Hortonworks Data Platform Product Dependencies and Licenses


Hortonworks is dedicated to providing a 100% open source, Apache licensed product for our users and community. A variety of open source projects have been integrated, tested and combined as part of the Hortonworks Data Platform (HDP). The specific versions of these projects have been selected because they have been tested and certified to be the most stable, compatible and production-ready releases available in their respective domains. A release of HDP may also include external dependencies. Some of those components are provided bundled with the software; we call those Dependencies.

This document provides list of HDP dependencies, version, license type, and how those dependencies are manifest in the platform files.

Kit and Bundle Structure

The Hortonworks Data Platform (HDP) is designed to be easy to consume and use. There is a core product HDP and there are Add-Ons. The product and its add-ons are offered in common formats such as RPMs (Redhat Package Manager) and tarballs (tar and gzip GNU Unix utilities).

Add-ons can be downloaded separately from HDP and each comes with instructions on how to install and leverage the features alongside HDP. These add-ons can be products, plug-ins, or other software that can deliver significant operational, productivity, or usability gains but should not be required in order to use HDP.

External Dependencies

Hortonworks Data Platform is bit-for-bit identical with Apache Hadoop – a 100% pure Apache-Foundation, and Apache licensed set of components. HDP offers and / or installs some non-Apache software of various licenses to improve user experience. You can choose to separate out every non-Apache component and install yourself or not use the components at all. The following set of tables specifies the versions of various components, the libraries in use in HDP, and the license terms of other software available in our product or offered as an Add-On.

Open source projects are sourced from various open source communities, including the Apache Software Foundation, while commercial components are sourced from the corresponding 3rd-party software provider.

Product Licensing

Hortonworks Data Platform (HDP) is released under an Apache 2.0 License. Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. You may obtain a copy of the License at: http://www.apache.org/licenses/LICENSE-2.0

With the exception of Dependent and Optional components, all Components within HDP use or are compatible with the Apache license and are therefore covered under the Software Assurance offering that is available as an option to Hortonworks subscription customers. Please note that Dependent and Optional Components obey their own license terms and are therefore excluded from coverage under Software Assurance. Please note that Alpha, Beta, Developer Preview and Early Access releases are excluded from coverage under Software Assurance. For Hortonworks customers, please refer to your Software Assurance contract for more details.

Hortonworks Data Platform Components

The following components have been integrated and tested with the Hortonworks Data Platform.

Hortonworks Data Platform 1.2

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & MapReduce) 1.1.2 Apache 2
Apache Hive – Data Access & Query 0.10.0 Apache 2
Apache HCatalog – Metadata Services 0.5.0 Apache 2
Apache HBase – NoSQL Database 0.94.5 Apache 2
Apache ZooKeeper – Cluster Coordination 3.4.5 Apache 2
Apache Pig – Scripting Platform 0.10.1 Apache 2
Apache Sqoop – Data Integration Services 1.4.2 Apache 2
Apache Oozie – Workflow Scheduler 3.2.0 Apache 2
Apache Flume – Log management services 1.3.1 Apache 2
Apache Mahout – Machine learning library 0.7.0 Apache 2
Apache Ambari – Cluster install, management and monitoring 1.2.2 Apache 2
Talend Open Studio for Big Data – Data Integration Services 5.1.1 Apache 2
HDP High Availability for VMware – HA for VMware vSphere 1.1.0 Hortonworks License (Apache compatible)
HDP High Availability for Red Hat – HA for Red Hat Enterprise Linux 1.1.0 Apache 2
Teradata Connector for Apache Hadoop – Connector for data transfer 0.1.1 Hortonworks License (Apache compatible)
For the latest HDP 1.2 patch information, refer to the HDP 1.2.3 Release Notes.

Hortonworks Data Platform 1.1

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & MapReduce) 1.0.3 Apache 2
Apache Hive – Data Access & Query 0.9.0 Apache 2
Apache HCatalog – Metadata Services 0.4.0 Apache 2
Templeton – HCatalog REST API 0.1.4 Apache 2
Apache HBase – NoSQL Database 0.92.1 Apache 2
Apache ZooKeeper – Cluster Coordination 3.3.4 Apache 2
Apache Pig – Scripting Platform 0.9.2 Apache 2
Apache Sqoop – Data Integration Services 1.4.2 Apache 2
Apache Oozie – Workflow Scheduler 3.1.3 Apache 2
Apache Flume – Log management services 1.2.0 Apache 2
HMC (Apache Ambari) – Cluster install, management and monitoring 1.0.2 Apache 2
Talend Open Studio for Big Data – Data Integration Services 5.1.1 Apache 2
HDP High Availability for VMware – HA for VMware vSphere 1.1.0 Hortonworks License (Apache compatible)
HDP High Availability for Red Hat – HA for Red Hat Enterprise Linux 1.1.0 Apache 2
Hortonworks Hive ODBC Driver 1.0.0 Hortonworks License (Apache compatible)
For the latest HDP 1.1 patch information, refer to the HDP 1.1.1 Release Notes.

Hortonworks Data Platform 1.0

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & MapReduce) 1.0.3 Apache 2
Apache Hive – Data Access & Query 0.9.0 Apache 2
Apache HCatalog – Metadata Services 0.4.0 Apache 2
Templeton – HCatalog REST API 0.1.4 Apache 2
Apache HBase – NoSQL Database 0.92.1 Apache 2
Apache ZooKeeper – Cluster Coordination 3.3.4 Apache 2
Apache Pig – Scripting Platform 0.9.2 Apache 2
Apache Sqoop – Data Integration Services 1.4.1 Apache 2
Apache Oozie – Workflow Scheduler 3.1.3 Apache 2
HMC (Apache Ambari) – Cluster install, management and monitoring 0.9 Apache 2
Talend Open Studio for Big Data – Data Integration Services 5.1.1 Apache 2
For HDP 1.0.1 patch information, refer to the HDP 1.0.1 Release Notes.

Hortonworks Data Platform 1.1 for Windows (Beta)

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & MapReduce) 1.0.3 Apache 2
Apache Hive – Data Access & Query 0.9.0 Apache 2
Templeton – HCatalog REST API 0.1.4 Apache 2
Apache HCatalog – Metadata Services 0.4.1 Apache 2
Apache Pig – Scripting Platform 0.9.3 Apache 2
Apache Oozie – Workflow Scheduler 3.2.0 Apache 2
Apache Sqoop – Data Integration Services 1.4.2 Apache 2
For HDP 1.1 for Windows (Beta) patch information, refer to the HDP 1.1 for Windows Release Notes.

Hortonworks Data Platform 2.0 Alpha 2

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & YARN/MapReduce) 2.0.3 (Alpha) Apache 2
Apache Pig – Scripting Platform 0.10.1 Apache 2
Apache Hive – Data Access & Query 0.10.0+ Apache 2
Apache HCatalog – Metadata Services 0.5.0 Apache 2
Apache HBase – NoSQL Database 0.94.5 Apache 2
Apache ZooKeeper – Cluster Coordination 3.4.5 Apache 2
Apache Tez – Query processing 0.1 Apache 2
For HDP 2.0 Alpha 2 patch information, refer to the HDP 2.0 Alpha 2 Release Notes.

Hortonworks Data Platform 2.0 Alpha 1

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & YARN/MapReduce) 2.0.2 (Alpha) Apache 2
Apache Pig – Scripting Platform 0.9.2 Apache 2
Apache Hive – Data Access & Query 0.9.0 Apache 2
Apache HCatalog – Metadata Services 0.4.0 Apache 2
Apache HBase – NoSQL Database 0.94.2 Apache 2
Apache ZooKeeper – Cluster Coordination 3.4.3 Apache 2
Apache Oozie – Workflow Scheduler 3.2.0 Apache 2
HMC (Apache Ambari) – Cluster install, management and monitoring 2.0.0.1 Apache 2
For HDP 2.0 Alpha 1 patch information, refer to the HDP 2.0 Alpha 1 Release Notes.

 

Hortonworks Data Platform 1.1 for Microsoft HDInsight (Developer Preview)

Components Version License Terms
Apache Hadoop – Core Hadoop (HDFS & MapReduce) 1.0.3 Apache 2
Apache Hive – Data Access & Query 0.9.0 Apache 2
Apache HCatalog – Metadata Services 0.4.0 Apache 2
Apache Pig – Scripting Platform 0.9.3 Apache 2
For HDP 1.1 for Microsoft HDInsight (Developer Preview) patch information, refer to the HDP 1.1 for HDInsight Release Notes.

Dependent and Optional Components

The following Dependent and Optional components have been included for dependency purposes or customer convenience only. Hortonworks supports HDP interactions with these components for both development and production purposes according to the terms of your HDP Subscription agreement. Please note that Dependent and Optional components obey their own license terms and are therefore excluded from coverage under Software Assurance; for more details, please obtain a copy of the Software Assurance contract from Hortonworks legal.

Dependent and Optional Components Notes License Terms
Oracle JDK – Java Platform This is a required component that provides the Java platform to run Apache Hadoop and related components. Oracle Corporation
MySQL – Hive + HCatalog Metastore Database This component is used by Apache HCatalog and Apache Hive and provides a relational database for storing table, schema, and metadata information. GPL v2
PostgreSQL – Ambari Configuration Database This component is used by Apache Ambari to store cluster information such as configurations and topology. PostgreSQL License
Nagios – System Monitoring and Alerts This component is used by Apache Ambari to provide basic system-level monitoring and alerting capabilities. GPL v2
Ganglia – Service Monitoring and Metrics This component is used by Apache Ambari to provide service and application level monitoring and metric collection capabilities. BSD (Apache 2 compatible)
Puppet – Installation and Configuration This component is used by Apache Ambari / HMC to provide installation and configuration capabilities. Apache
LZO – Compression Library This component is provided for use as an optional compression algorithm that can be used to replace the default Snappy compression library if so desired. GPL v2