Hortonworks Data Platform, powered by Apache Hadoop, is a massively scalable 100% open source platform for storing, processing, and analyzing large volumes of data. It is designed to deal with data from many sources and formats in a quick, easy, and cost-effective manner. Hortonworks Data Platform provides an open, stable and highly extensible platform that makes it easier to integrate Apache Hadoop with existing data architectures and maximize the value of the data flowing through your business.
Who is it for?
Hortonworks Data Platform is ideal for organizations that prefer a 100% Apache-licensed, open source platform that consists of the essential Apache Hadoop projects. It is highly recommended for anyone that has encountered difficulty installing and integrating Hadoop projects downloaded directly from Apache or organizations that want to avoid getting locked into a proprietary vendor’s platform.
It is also ideal for solution providers that wish to integrate or extend their solutions for Apache Hadoop thanks to the openness and extensibility of the platform.
What is included?
Hortonworks Data Platform includes the most popular and essential Apache Hadoop projects including the Hadoop Distributed File System (HDFS), MapReduce, Pig, Hive, HBase and Zookeeper. In addition to these components, Hortonworks Data Platform includes open source technologies that make the Hadoop platform more manageable, open, and extensible.
Unlike other Hadoop solutions that lock away management features within proprietary extensions, Hortonworks Data Platform includes Ambari, an open source installation and management system out of the box. Hortonworks Data Platform also includes HCatalog, a metadata management service for simplifying data sharing between Hadoop and other enterprise information systems, along with a complete set of open APIs, including WebHDFS and those for Ambari and HCatalog, to make it easier for ISVs to integrate and extend Apache Hadoop.
All of these components have been integrated and tested as part of the Hortonworks Data Platform release process. Installation and configuration tools have also been included to make it easier to install, deploy and use the Hortonworks Data Platform.
The initial release of Hortonworks Data Platform (version 1) is based on Apache Hadoop 0.20.205, a stable release and the first Apache Hadoop release to support security and HBase. There has also been significant progress on Next Generation MapReduce in Apache Hadoop 0.23, and within the coming weeks we expect to release an early technology preview of Hortonworks Data Platform version 2 which includes this important and emerging technology.
When will Hortonworks Data Platform be available?
The Hortonworks Data Platform version 1 (based upon Apache Hadoop 0.20.205) will be released via a private Technology Preview Program in November 2011 with plans to release a public technology preview in early 2012. Customers, partners and community members are encouraged to sign up for the Technology Preview Program.
