Hortonworks is dedicated to providing a 100% open source, Apache licensed product for our users and community. A variety of open source projects have been integrated, tested and combined as part of the Hortonworks Data Platform (HDP). The specific versions of these projects have been selected because they have been tested and certified to be the most stable, compatible and production-ready releases available in their respective domains. A release of HDP may also include external dependencies. Some of those components are provided bundled with the software; we call those Dependencies.
This document provides list of HDP dependencies, version, license type, and how those dependencies are manifest in the platform files.
Kit and Bundle Structure
The Hortonworks Data Platform (HDP) is designed to be easy to consume and use. There is a core product HDP and there are Add-Ons. The product and its add-ons are offered in common formats such as RPMs (Redhat Package Manager) and tarballs (tar and gzip GNU Unix utilities).
Add-ons can be downloaded separately from HDP and each comes with instructions on how to install and leverage the features alongside HDP. These add-ons can be products, plug-ins, or other software that can deliver significant operational, productivity, or usability gains but should not be required in order to use HDP.
External Dependencies
Hortonworks Data Platform is bit-for-bit identical with Apache Hadoop – a 100% pure Apache-Foundation, and Apache licensed set of components. HDP offers and / or installs some non-Apache software of various licenses to improve user experience. You can choose to separate out every non-Apache component and install yourself or not use the components at all. The following set of tables specifies the versions of various components, the libraries in use in HDP, and the license terms of other software available in our product or offered as an Add-On.
Open source projects are sourced from various open source communities, including the Apache Software Foundation, while commercial components are sourced from the corresponding 3rd-party software provider.
Product Licensing
Hortonworks Data Platform (HDP) is released under an Apache 2.0 License. Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. You may obtain a copy of the License at: http://www.apache.org/licenses/LICENSE-2.0
With the exception of Dependent and Optional components, all Components within HDP use or are compatible with the Apache license and are therefore covered under the Software Assurance offering that is available as an option to Hortonworks subscription customers. Please note that Dependent and Optional Components obey their own license terms and are therefore excluded from coverage under Software Assurance. Please note that Alpha, Beta, Developer Preview and Early Access releases are excluded from coverage under Software Assurance. For Hortonworks customers, please refer to your Software Assurance contract for more details.
Hortonworks Data Platform Components
The following components have been integrated and tested with the Hortonworks Data Platform.
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & MapReduce) | 1.1.2 | Apache 2 |
| Apache Hive – Data Access & Query | 0.10.0 | Apache 2 |
| Apache HCatalog – Metadata Services | 0.5.0 | Apache 2 |
| Apache HBase – NoSQL Database | 0.94.5 | Apache 2 |
| Apache ZooKeeper – Cluster Coordination | 3.4.5 | Apache 2 |
| Apache Pig – Scripting Platform | 0.10.1 | Apache 2 |
| Apache Sqoop – Data Integration Services | 1.4.2 | Apache 2 |
| Apache Oozie – Workflow Scheduler | 3.2.0 | Apache 2 |
| Apache Flume – Log management services | 1.3.1 | Apache 2 |
| Apache Mahout – Machine learning library | 0.7.0 | Apache 2 |
| Apache Ambari – Cluster install, management and monitoring | 1.2.2 | Apache 2 |
| Talend Open Studio for Big Data – Data Integration Services | 5.1.1 | Apache 2 |
| HDP High Availability for VMware – HA for VMware vSphere | 1.1.0 | Hortonworks License (Apache compatible) |
| HDP High Availability for Red Hat – HA for Red Hat Enterprise Linux | 1.1.0 | Apache 2 |
| Teradata Connector for Apache Hadoop – Connector for data transfer | 0.1.1 | Hortonworks License (Apache compatible) |
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & MapReduce) | 1.0.3 | Apache 2 |
| Apache Hive – Data Access & Query | 0.9.0 | Apache 2 |
| Apache HCatalog – Metadata Services | 0.4.0 | Apache 2 |
| Templeton – HCatalog REST API | 0.1.4 | Apache 2 |
| Apache HBase – NoSQL Database | 0.92.1 | Apache 2 |
| Apache ZooKeeper – Cluster Coordination | 3.3.4 | Apache 2 |
| Apache Pig – Scripting Platform | 0.9.2 | Apache 2 |
| Apache Sqoop – Data Integration Services | 1.4.2 | Apache 2 |
| Apache Oozie – Workflow Scheduler | 3.1.3 | Apache 2 |
| Apache Flume – Log management services | 1.2.0 | Apache 2 |
| HMC (Apache Ambari) – Cluster install, management and monitoring | 1.0.2 | Apache 2 |
| Talend Open Studio for Big Data – Data Integration Services | 5.1.1 | Apache 2 |
| HDP High Availability for VMware – HA for VMware vSphere | 1.1.0 | Hortonworks License (Apache compatible) |
| HDP High Availability for Red Hat – HA for Red Hat Enterprise Linux | 1.1.0 | Apache 2 |
| Hortonworks Hive ODBC Driver | 1.0.0 | Hortonworks License (Apache compatible) |
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & MapReduce) | 1.0.3 | Apache 2 |
| Apache Hive – Data Access & Query | 0.9.0 | Apache 2 |
| Apache HCatalog – Metadata Services | 0.4.0 | Apache 2 |
| Templeton – HCatalog REST API | 0.1.4 | Apache 2 |
| Apache HBase – NoSQL Database | 0.92.1 | Apache 2 |
| Apache ZooKeeper – Cluster Coordination | 3.3.4 | Apache 2 |
| Apache Pig – Scripting Platform | 0.9.2 | Apache 2 |
| Apache Sqoop – Data Integration Services | 1.4.1 | Apache 2 |
| Apache Oozie – Workflow Scheduler | 3.1.3 | Apache 2 |
| HMC (Apache Ambari) – Cluster install, management and monitoring | 0.9 | Apache 2 |
| Talend Open Studio for Big Data – Data Integration Services | 5.1.1 | Apache 2 |
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & MapReduce) | 1.0.3 | Apache 2 |
| Apache Hive – Data Access & Query | 0.9.0 | Apache 2 |
| Templeton – HCatalog REST API | 0.1.4 | Apache 2 |
| Apache HCatalog – Metadata Services | 0.4.1 | Apache 2 |
| Apache Pig – Scripting Platform | 0.9.3 | Apache 2 |
| Apache Oozie – Workflow Scheduler | 3.2.0 | Apache 2 |
| Apache Sqoop – Data Integration Services | 1.4.2 | Apache 2 |
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & YARN/MapReduce) | 2.0.3 (Alpha) | Apache 2 |
| Apache Pig – Scripting Platform | 0.10.1 | Apache 2 |
| Apache Hive – Data Access & Query | 0.10.0+ | Apache 2 |
| Apache HCatalog – Metadata Services | 0.5.0 | Apache 2 |
| Apache HBase – NoSQL Database | 0.94.5 | Apache 2 |
| Apache ZooKeeper – Cluster Coordination | 3.4.5 | Apache 2 |
| Apache Tez – Query processing | 0.1 | Apache 2 |
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & YARN/MapReduce) | 2.0.2 (Alpha) | Apache 2 |
| Apache Pig – Scripting Platform | 0.9.2 | Apache 2 |
| Apache Hive – Data Access & Query | 0.9.0 | Apache 2 |
| Apache HCatalog – Metadata Services | 0.4.0 | Apache 2 |
| Apache HBase – NoSQL Database | 0.94.2 | Apache 2 |
| Apache ZooKeeper – Cluster Coordination | 3.4.3 | Apache 2 |
| Apache Oozie – Workflow Scheduler | 3.2.0 | Apache 2 |
| HMC (Apache Ambari) – Cluster install, management and monitoring | 2.0.0.1 | Apache 2 |
Hortonworks Data Platform 1.1 for Microsoft HDInsight (Developer Preview)
| Components | Version | License Terms |
| Apache Hadoop – Core Hadoop (HDFS & MapReduce) | 1.0.3 | Apache 2 |
| Apache Hive – Data Access & Query | 0.9.0 | Apache 2 |
| Apache HCatalog – Metadata Services | 0.4.0 | Apache 2 |
| Apache Pig – Scripting Platform | 0.9.3 | Apache 2 |
Dependent and Optional Components
The following Dependent and Optional components have been included for dependency purposes or customer convenience only. Hortonworks supports HDP interactions with these components for both development and production purposes according to the terms of your HDP Subscription agreement. Please note that Dependent and Optional components obey their own license terms and are therefore excluded from coverage under Software Assurance; for more details, please obtain a copy of the Software Assurance contract from Hortonworks legal.
| Dependent and Optional Components | Notes | License Terms |
| Oracle JDK – Java Platform | This is a required component that provides the Java platform to run Apache Hadoop and related components. | Oracle Corporation |
| MySQL – Hive + HCatalog Metastore Database | This component is used by Apache HCatalog and Apache Hive and provides a relational database for storing table, schema, and metadata information. | GPL v2 |
| PostgreSQL – Ambari Configuration Database | This component is used by Apache Ambari to store cluster information such as configurations and topology. | PostgreSQL License |
| Nagios – System Monitoring and Alerts | This component is used by Apache Ambari to provide basic system-level monitoring and alerting capabilities. | GPL v2 |
| Ganglia – Service Monitoring and Metrics | This component is used by Apache Ambari to provide service and application level monitoring and metric collection capabilities. | BSD (Apache 2 compatible) |
| Puppet – Installation and Configuration | This component is used by Apache Ambari / HMC to provide installation and configuration capabilities. | Apache |
| LZO – Compression Library | This component is provided for use as an optional compression algorithm that can be used to replace the default Snappy compression library if so desired. | GPL v2 |