Get Started with Hadoop
Ready to Work with Hadoop?
We’ve brought together a collection of resources that are of particular interest for developers, analysts, and system administrators
The easiest way to get started with Enterprise Hadoop
Sandbox is a personal, portable Hadoop environment that comes with a dozen interactive Hadoop tutorials. Sandbox includes many of the most exciting developments from the latest HDP distribution, packaged up in a virtual environment that you can get up and running in 15 minutes!
Sandbox comes with a dozen hands-on tutorials that will guide you through the basics of Hadoop; tutorials built on the experience gained from training thousands of people in our Hortonworks University Training classes.
Build a Proof of Concept
The Sandbox includes the Hortonworks Data Platform in an easy to use form. You can add your own datasets, and connect it to your existing tools and applications. With this, you can prove out your use of Hadoop and plan the integration points for your first Hadoop project.
Test New Functionality
You can test new functionality with the Sandbox before you put it into production. Simply, easily and safely.
Work your way through the tutorials
Sandbox comes with a series of in-depth tutorials that provide an easy hands-on introduction to many of the common use cases of Hadoop
- Hello World – An overview of Hadoop with HCatalog, Hive and Pig
- How To Process Data with Apache Pig
- How to Process Data with Apache Hive
- How to Use HCatalog, Pig & Hive Commands
Training & Certification
Hortonworks offers public and private Hadoop training for business users, Java developers, Windows teams, data analysts, data scientists and administrators. Courses are designed by the leaders and committers of Apache Hadoop and students work through real-world, scenario-based projects.
See all courses
Students that successfully complete a Hortonworks training course are able to sit for the respective Hortonworks certification exam. Hortonworks certification identifies you as an expert in the Apache Hadoop ecosystem.
Apache Hadoop has a vibrant community of contributors developing and extending the codebase, along with a developers, data scientists and adminstrators building apps on top of Hadoop. Join the Apache mailing lists for Hadoop and monitor progress of JIRA tickets, submit bugs and contribute code.
An ideal way to get started. Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Connect with the Hortonworks experts and other Hadoop users in the Forums.
Find a Meetup near you
There are many Hadoop user groups across the world focused on learning, using and evolving Hadoop. Meet them here.
Sign up for the newsletter
If you drop us your email address below, then we’ll drop you a line every few weeks with the latest information from Hortonworks and the Hadoop ecosystem.