cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
January 21, 2014
prev slideNext slide

How To Use Microsoft Excel to Visualize Hadoop Data

Microsoft and Hortonworks have been working together for over two years now with the goal of bringing the power of Big Data to a billion people. As a result of that work, today we announced the General Availability of HDP 2.0 for Windows with the full power of YARN.

There are already over half a billion Excel users on this planet.

So, we have put together a short tutorial on the Hortonworks Sandbox where we walk through the end-to-end data pipeline using HDP and Microsoft Excel in the shoes of a data analyst at a financial services firm where she:

  • Cleans and aggregates 10 years of raw stock tick data from NYSE
  • Enriches the data model by looking up additional attributes from Wikipedia
  • Creates an interactive visualization on the model

You can find the tutorial here.

As part of this process you will experience how simple it is to integrate HDP with the Microsoft Power BI platform.

This integration is made possible by the community work to design and implement WebHDFS, an open REST API in Apache Hadoop. Microsoft used the API from Power Query for Excel to make the integration to Microsoft Business Intelligence platform seamless.

Happy Hadooping!!!

Comments

  • Hello.
    When I try to load the data from HDFS, this error is shown:
    DataSource.Error: HDFS failed to get contents from ‘http://127.0.0.1:50070/webhdfs/v1/user/hue/nyse/stock_aggregates/000000_0’. Status code: 503, description: ‘Service Unavailable’.
    Any suggestion about what is happening?

  • Leave a Reply

    Your email address will not be published. Required fields are marked *