cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
cta

Hortonworks Sandbox

cloud Ready to Get Started?

DOWNLOAD SANDBOX

Hortonworks Sandbox is a personal, portable Apache Hadoop® environment that comes with dozens of interactive Hadoop and it's ecosystem tutorials and the most exciting developments from the latest HDP distribution. Get up and running in 15 minutes!

Download Sandbox

Overview

If you are new to the Hortonworks Sandbox and using Apache open source tools to build modern data applications we suggest you are with the following tutorials.

Sandbox Basics

Getting Started with HDP®

Begin your Apache Hadoop journey with this tutorial aimed for users with limited experience in using the Sandbox.
Explore Sandbox on virtual machine and cloud environments and learn to navigate the Apache Ambari user interface.

This tutorial provides a section that describes the key concepts and series of tutorials where you move data into HDFS, explore the data with SQL in Apache Hive, do transformations with Apache Pig or Apache Spark and at the end generate a report with your choice of Microsoft Excel, Apache Zeppelin or Zoomdata tools.

Getting Started with HDP

Hands-on Tour of Apache Spark in 5 minutes

This will provide a quick introduction to Spark by creating an RDD for wikipedia inside an Apache Zeppelin notebook.

After you have gone through this tutorial you can find additional Spark tutorials here:

Apache Spark in 5 minutes

IoT Realtime Event Processing

Apache Hadoop is often used to process unstructured data, new data types or data at scale at rest. However, you can also process data-in-motion and this tutorial will introduce you to tools like Apache Storm, Apache Kafka and Apache HBase.

IoT Realtime Event Processing

Analyzing Social Media and Customer Sentiment

This tutorial will introduce you to consuming real time twitter data and doing some basic sentiment analysis. You will be introduced to Apache NiFi to connect and conduct streaming data from twitter and then you will persist the data into Apache Solr and Apache Hive.

Apache NiFi

Try More Tutorials

You can find additional tutorials here:

What's New in Hortonworks Data Platform 2.5

parallax slide

For Security Administrators & Data Stewards

  • Classification-based Policy. ​ Assign access to data assets based on reusable metadata tags such as PCI or PII.
  • Location-based Policy. Customize entitlements based on geography. A user trying to access the same data from different locations would be subject to unique geographical context.
  • Data Expiry-based Policy Assign expiration dates to data tag to automatically deny users access to the tagged data after the expiration date.
  • Prohibition-based Policy. Define security policy that restricts combining two data sets to help avoid privacy violations.
  • Row Level Security & Dynamic Data Masking. Restrict row access and anonymize sensitive data in real-time in Hive based on user characteristics and runtime context.
parallax slide

For Hadoop operators

  • Role-Based Access Control. Apache Ambari 2.4 includes additional cluster operational roles to provide more granular division of control for cluster operations.
  • Log Search (Technical preview). Automatically configures the collection of cluster operational metrics to aid with analysis and troubleshooting by including a new Log Search service.
  • Customizable Cluster Alerts. Tailor HDP to fit with your enterprise monitoring environment by configuring a set of predefined alerts that seamlessly integrates with your existing enterprise monitoring tools.
  • Activity Reporting and Visualization. Activity Reporting and Visualization in Hortonworks SmartSense 1.3 (available separately) helps Hadoop operators understand how their cluster operates.
Hortonworks Sandbox in the cloud

Hortonworks Sandbox in the cloud

Explore cloud vendors that can help you get started with Hadoop with minimum system requirements.
Learn More
Hortonworks Sandbox on a VM Download

No data center, no cloud service and no internet connection needed! Full control of the environment. Easily extend with additional components or try the various Hortonworks technical previews. Always updated with latest edition.

Try Hortonworks Sandbox on Azure

Azure provides an easy way to get started with Hadoop with minimum system requirements. This is a great solution if your personal machine doesn’t meet the minimum system requirements to run locally.