Get Started with Hadoop

Kick start your journey in Hadoop with these resources

Ready to work with Hadoop?

We’ve brought together a collection of resources that are of particular interest for developers, analysts, and system administrators

Learn how to collect and process data and build applications with Hadoop.
Learn how to explore, query and deliver insights with Hadoop.
Learn how to provision, manage and monitor Hadoop.

See Hadoop in action

Here’s a selection of introductory videos that show how Hadoop can be used to take advantage of new types of data :

Start using Hadoop with Hortonworks Sandbox

Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials. It’s a quick and easy way to start experimenting with Hadoop.

Get Sandbox

Hortonworks Sandbox

Work your way through the tutorials

Sandbox comes with a series of in-depth tutorials that provide an easy hands-on introduction to many of the common use cases of Hadoop

  1. Hello World – An overview of Hadoop with HCatalog, Hive and Pig
  2. How To Process Data with Apache Pig
  3. How to Process Data with Apache Hive
  4. How to Use HCatalog, Pig & Hive Commands
  5. More…

Training & Certification

Hortonworks offers public and private Hadoop training for business usersJava developers, Windows teamsdata analysts, data scientists and administrators. Courses are designed by the leaders and committers of Apache Hadoop and students work through real-world, scenario-based projects.

Students that successfully complete a Hortonworks training course are able to sit for the respective Hortonworks certification exam. Hortonworks certification identifies you as an expert in the Apache Hadoop ecosystem.

Contribute

Apache Hadoop has a vibrant community of contributors developing and extending the codebase, along with a developers, data scientists and adminstrators building apps on top of Hadoop. Join the Apache mailing lists for Hadoop and monitor progress of JIRA tickets, submit bugs and contribute code.

An ideal way to get started. Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.

Have Questions?

Connect with the Hortonworks experts and other Hadoop users in the Forums.

Find a Meetup near you

There are many Hadoop user groups across the world focused on learning, using and evolving Hadoop. Meet them here.

Sign up for the newsletter

If you drop us your email address below, then we’ll drop you a line every few weeks with the latest information from Hortonworks and the Hadoop ecosystem.

HDP 2.1 Webinar Series
Join us for a series of talks on some of the new enterprise functionality available in HDP 2.1 including data governance, security, operations and data access :
Contact Us
Hortonworks provides enterprise-grade support, services and training. Discuss how to leverage Hadoop in your business with our sales team.
Integrate with existing systems
Hortonworks maintains and works with an extensive partner ecosystem from broad enterprise platform vendors to specialized solutions and systems integrators.