Microsoft and Hortonworks have been working together for over two years now with the goal of bringing the power of Big Data to a billion people. As a result of that work, today we announced the General Availability of HDP 2.0 for Windows with the full power of YARN.
There are already over half a billion Excel users on this planet.
So, we have put together a short tutorial on the Hortonworks Sandbox where we walk through the end-to-end data pipeline using HDP and Microsoft Excel in the shoes of a data analyst at a financial services firm where she:
- Cleans and aggregates 10 years of raw stock tick data from NYSE
- Enriches the data model by looking up additional attributes from Wikipedia
- Creates an interactive visualization on the model