How To Integrate Tableau and Hadoop with Hortonworks Data Platform

Chances are you’ve already used Tableau Software if you’ve been involved with data analysis and visualization solutions for any length of time. Tableau 6.1.4 introduced the ability to visualize large, complex data stored in Hadoop with Hortonworks Data Platform via Hive and the Hortonworks Hive ODBC driver.

If you want to get hands on with Tableau as quickly as possible, we recommend using the Hortonworks Sandbox and the ‘Visualize Data with Tableau’ tutorial.

Additionally, Tableau have partnered with us to produce these two fantastic resources to assist with integrating their tools on HDP.

Using_Tableau_with_Hortonworks_Data_Platform.v1.0 Using Tableau Software with Hortonworks Data Platform
This whitepaper walks through the integration of Tableau Software with HDP and provides a reference architecture and solution set for modernizing your data architecture for big data analytics applications covering:

  • Use Cases for Big Data
  • Integrating Hortonworks Data Platform and Tableau Software
  • Reference Architecture for Big Data Analytics
  • Essential Technical Components
Best_Practices_for_Hadoop_Data_Analysis_with_Tableau.v1.0 Best Practices for Hadoop Data Analysis with Tableau
This how-to guide is packed with tips and tricks to get started and then get the most out of Tableau and HDP including unique features of the connector, performance best practices and a few gotchas covering:

  • Pre-requisites and basic installation.
  • Performing On-the-Fly ETL
  • Working with UDFs or MapReduce
  • Performance Techniques
  • Known Limitations

Categorized by :
HDP

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.