Hi Everyone, i want to know the challanges or process involved in the connecting the Big Data, Hadoop or Mapr to Tableau. As the Big Data has Unstructured database how it needs to be coverted and connected to a Reporting Tool?
Check out MapR Hadoop Hive to connect tableau to a MapR Hadoop Hive database and set up the data source. Also recommend you to check out Apache Drill to understand why using Apache Drill for queries. Please let us know if you need additional information.
Tableau requires a JDBC connection to any Big Data tool. So you would have to use HiveServer2, or something that allows JDBC connections. I think MapR uses Simba.
There really isn't any unstructured data, just semi-structured.
If you're using Hive, its structure on read not on write. Other tools like Spark would require that your run their Thrift service. If you're using SQL then your data has to be structured or somewhat structured. Hive stores a schema and applies the schema to give structure to the underlying data source. (On Read) You could also use Drill as your JDBC connection too.
Are you still facing the challenge? Do any of the suggestions help? If they are, please help to mark them "Helpful" and "Correct" to show your appreciation.
Retrieving data ...