Hortonworks Sandbox: Dreaming Up New Tutorials For You
We’re cooking up some new tutorials for you to play with in your Hortonworks Sandbox to help you learn more about the Hortonworks Data Platform, Apache Hadoop, Hive, Pig and HCatalog, with maybe a smattering of Mahout in there as well.
While you’re anxiously awaiting, we thought we’d give you some pointers to some resources so that you can experiment and play. After all, that’s what a Sandbox is all about, right?
First, if you’re looking to expand your skills, take a look at Hive Language Manual, the Pig Tutorial on the Apache Foundation website, and Command Line Interface information on HCatalog project incubator site.
Use Hive to SQLize
Pull In Your Own Data
You have datasets. We know you want to put them in Hadoop. It’s easy. You can import your own data into the Sandbox the same way you imported data sets in the tutorials.
Looking for other interesting data sets? There are many interesting sets for you:
- Public Data available on Google
- Open Data Initiative from the US Government
- Microsoft Bing Spatial Data Services
- US Government XML Data sources
- Free downloadable datasets from InfoChimps
In the meantime, we’re working hard to bring you new and interesting tutorials. We’d love to see what you’ve done. Show us your demos and tutorials — who knows, there might be one of the coveted stuff elephants in your future!