HDP : Archive Release

Version 2.0

What's New in 2.0

  • HDP 2.0 comprises the most recent innovation being delivered in the open Hadoop community. It delivers the latest releases across Hadoop and the key related projects into a single integrated and tested platform. While there are hundreds of new features found in each Apache project, some of the highlights for this release include:
  • Enterprise Ready YARN, the Hadoop Operating SystemWith Hadoop 2, Apache Hadoop YARN serves as the Hadoop operating system, and takes Hadoop from a single-use data platform for batch processing to a multi-use platform that enables batch, interactive, online and stream processing.
  • Stinger Phase 2; Interactive SQL Queries at Petabyte ScaleThe Stinger Initiative was launched at the beginning of 2013 as a broad community-based effort to enhance the speed, scale and breadth of SQL semantics supported by Apache Hive. Hive 0.12 represents phase 2 of the Stinger Initiative and HDP 2.0 is a significant step forward for Hive, the de-facto standard for SQL access in Hadoop.
  • Reliable NoSQL IN Hadoop with HBaseApache HBase 0.96 is the culmination of more than a year’s worth of effort that’s delivered important enterprise features such as Snapshots and improved MTTR
  • Manage & Monitor YARN and a Hadoop 2 clusterApache Ambari 1.4 allows you to provision, manage and monitor a cluster based on the Hadoop 2 stack. This includes YARN, MapReduce 2 and support for enabling native NameNode High Availability (HA).

Technical Specifications

Component Version
Apache Hadoop 2.2.0
Apache Hive 0.12.0
Apache HCatalog 0.12.0
Apache HBase 0.96.1
Apache ZooKeeper 3.4.5
Apache Pig 0.12.0
Apache Sqoop 1.4.4
Apache Flume 1.4.0
Apache Oozie 4.0.0
Apache Ambari 1.4.4
Apache Mahout 0.8.0
Hue 2.3.0

For the list of patches applied to the component
versions please refer to the Release Notes.

Download & Install

HDP Sandbox
Runs on VirtualBox or VMWare
Try HDP on a laptop. Sandbox is a virtualized HDP environment that includes interactive Hadoop tutorials. A simple and easy way to get started with Hadoop. More about Sandbox
Automated (Ambari)
RHEL/CentOS/SLES (64-bit)
The easiest way to set up HDP. Apache Ambari simplifies the provisioning, management and monitoring of your cluster.
Download links within documentation
Go straight to the page
Manual (RPMs)
Ubuntu/RHEL/CentOS/SLES (64-bit)
Roll up your sleeves and use this option to install and configure your cluster manually with the RPM packages.
Windows
Windows Server 2008 & 2012
The only release of Hadoop available for the Windows platform.
Full Details of this Windows Release

A variety of open source projects have been integrated, tested and combined as part of the Hortonworks Data Platform (HDP). The platform components comprising the Hortonworks Data Platform (HDP) are released under the Apache 2.0 License. HDP is also commonly used with 3rd-Party Components (ex. Oracle’s JDK – Java Platform) and Optional Add-Ons (ex. Hive ODBC Driver). When you choose to use those components, it is recommended you read and understand the licensing terms specific to each of those components :
Components & Licenses »

Previous Versions?

Downloads, Add-ons and Documentation for previous versions of HDP are available here

Search Docs

Have Questions?

Get help, answers to common questions, and collaborate with others in the forums :

Add-ons

Hortonworks Hive ODBC Driver (Win/Mac – v1.3.19)

The Hortonworks Hive ODBC Driver allows you to connect popular Business Intelligence (BI) tools to query, analyze and visualize data stored within the Hortonworks Data Platform.

Hortonworks Hive ODBC Driver (Linux – v1.3.19)

The Hortonworks Hive ODBC Driver allows you to connect popular Business Intelligence (BI) tools to query, analyze and visualize data stored within the Hortonworks Data Platform.

Quest Data Connector for Oracle and Hadoop

The Quest® Data Connector for Oracle and Hadoop is a Sqoop plugin that enables high-performance data transfer between Oracle Database and Hadoop. This version is compatible with Hortonworks Data Platform.

Hortonworks Connector for Teradata v1.1 for HDP2

The Hortonworks Connector for Teradata is the fastest and most scalable way to transfer data between Teradata Database and Apache Hadoop. Jointly developed by Teradata and Hortonworks, the Connector plugs into Hortonworks Data Platform and offers wire-speed, fully parallel data transfers between Teradata and Apache Hive, HBase, HCatalog or HDFS. The connector lets you move more data faster, freeing up more time for critical data processing.

Talend Open Studio for Big Data (v5.4)

Talend Open Studio for Big Data is a powerful and versatile open source data integration tool. Talend provides data managers, operators, and analysts a graphical tool that abstracts the underlying Hadoop complexities and dramatically improves the efficiency of job design through an easy-to-use Eclipse development environment.