Hortonworks Data Platform 2.0 Now Generally Available

YARN-Based Architecture of HDP 2.0 Enables New Workloads in the Modern Data Architecture

PALO ALTO, Calif.—October 23, 2013—Hortonworks, the leading contributor to and provider of enterprise Apache™ Hadoop®, today announced the general availability of the Hortonworks Data Platform (HDP) 2.0, the first commercial distribution built on the recent Hadoop 2 GA release from the Apache Software Foundation. HDP 2.0 represents the culmination of five years of work across the Hadoop community and delivers the YARN-based architecture of Hadoop 2, phase 2 of the Stinger initiative and includes the very latest innovations from the broader Hadoop ecosystem in a single integrated and tested platform appropriate for mainstream enterprise use.

“Hortonworks Data Platform 2.0 GA release is a huge milestone of progress for Hadoop. Through our use of the HDP 2.0 and through the efforts of both AT&T and Hortonworks, AT&T is developing some of the top Hadoop experts in the world, along with leading edge technology,” said Victor Nilson, AT&T’s senior vice president for data sciences.

Click to Tweet: HDP 2.0 now GA; first #Hadoop 2 GA distribution features YARN-based architecture for modern data architectures #BigData

With Hadoop 2, Apache Hadoop YARN serves as the Hadoop operating system, and takes Hadoop beyond simply a single-use data platform for batch processing to a multi-use platform that enables batch, interactive, online and stream processing. By acting as the primary resource manager and mediator of access to data stored in HDFS, YARN enables enterprises to store data in a single place and interact with it in multiple ways simultaneously and with consistent levels of service.

The Stinger Initiative was launched at the beginning of 2013 as a broad community-based effort to enhance the speed, scale and breadth of SQL semantics supported by Apache Hive. By including the recently released Hive 0.12 which is the culmination of phase 2 of the Stinger Initiative, HDP 2.0 represents a significant step forward for Hive, the de-facto standard for SQL access in Hadoop today and the only SQL interface designed for queries that scale from gigabytes to petabytes. Microsoft has been a critical partner in the development of HDP 2.0 and has contributed more than 6,000 engineering hours across various Apache projects, as well as porting HDP 2.0 to Windows, which will be available next month.

“The YARN based architecture of HDP 2.0 delivers on our mission to enable the modern data architecture by providing an enterprise Hadoop platform that deeply integrates with existing and future data center technologies,” said Shaun Connolly, vice president of corporate strategy, Hortonworks. “Hortonworks remains committed to delivering a tested, stable, and 100-percent open source Hadoop distribution of the most recent Apache project releases. Our approach ensures that HDP always includes the most proven community-driven innovations that are driving the enterprise deployments shaping the data architectures of tomorrow.

HDP 2.0 is the first enterprise Hadoop platform to include the latest enterprise features delivered in Hadoop 2 and all the related Apache projects, many of which had significant GA community releases within the last few weeks.  The key projects in HDP include:

  • Apache Hadoop 2.2.0
  • Apache Hive 0.12.0
  • Apache HCatalog
  • Apache Pig 0.12.0
  • Apache HBase 0.96
  • Apache Ambari 1.4.1
  • Apache ZooKeeper 3.4.5
  • Apache Oozie 4.0.0
  • Apache Sqoop 1.4.4
  • Apache Flume 1.4.0
  • Apache Mahout 0.8.0

A complete list of HDP features and enhancements can be found at: http://hortonworks.com/products/hdp/

HDP 2.0 is supported by a broad ecosystem of customers and partners, including the following strategic partners:

Microsoft
“We are excited to see the next generation of Hadoop delivered with HDP 2.0,” said Eron Kelly, General Manager, SQL Server at Microsoft. “The new YARN architecture is designed to give the Windows community the flexibility to run multiple application types in Hadoop, all sharing a common resource management infrastructure. We are proud to have had the opportunity to contribute significantly to key Apache projects with Hortonworks such as Stinger and Hive to make this a reality. ”
Teradata

“Hortonworks Data Platform 2.0 is a big leap in Hadoop capabilities,” said Scott Gnau, President, Teradata Labs. “Customers require and expect resource management to maintain consistent performance and service level agreements between IT and the business. HDP 2.0 fits with our approach and within the Teradata Unified Data Architecture to enable organizations to unlock value from all their data assets, and power deeper insights and new business applications.”

Availability

The Hortonworks Data Platform 2.0 is available now for download from the Hortonworks website at http://hortonworks.com/products/hdp/. HDP 2.0 for Windows will be available in the coming weeks.

Hortonworks is hosting a webinar on Tuesday, November 12 to discuss HDP 2.0. More information is available at http://info.hortonworks.com/HDP20_Webinar.html

About Hortonworks

Hortonworks is the only 100-percent open source software provider to develop, distribute and support an Apache Hadoop platform explicitly architected, built and tested for enterprise-grade deployments. Developed by the original architects, builders and operators of Hadoop, Hortonworks stewards the core and delivers the critical services required by the enterprise to reliably and effectively run Hadoop at scale. Our distribution, Hortonworks Data Platform, provides an open and stable foundation for enterprises and a growing ecosystem to build and deploy big data solutions. Hortonworks also provides unmatched technical support, training and certification programs. For more information, visit www.hortonworks.com. Get started and Go from Zero to Hadoop in 15 Minutes with the Hortonworks’ Sandbox.

###

For Additional Information Contact:

Keith Giannini or Anna Vaverka

Schwartz MSL

hortonworks@schwartzmsl.com

Tel: (415) 817-2500