Posts by Jeff Sposetti:


Hortonworks Data Platform 1.3 Release: The community continues to power innovation in Hadoop

HDP 1.3 release delivers on community-driven innovation in Hadoop with SQL-IN-Hadoop, and continued ease of enterprise integration and business continuity features.

Almost one year ago (50 weeks to be exact) we released Hortonworks Data Platform 1.0, the first 100% open source Hadoop platform into the marketplace.  The past year has been dynamic to say the least!  However, one thing has remained constant: the steady, predictable cadence of HDP releases.  In September 2012 we released 1.1, this February gave us 1.2 and today we’re delighted to release HDP 1.3.

HDP 1.3 represents yet another significant step forward and allows customers to harness the latest innovation around Apache Hadoop and its related projects in the open source community.  In addition to providing a tested, integrated distribution of these projects, HDP 1.3 includes a primary focus on enhancements to Apache Hive, the de-facto standard for SQL access in Hadoop as well as numerous improvements that simplify ease of use.

The Relentless March of Community Driven Innovation

Consistent with our approach and together with many others in the community, Hortonworks has been working hard to progress the Hadoop projects at the Apache Software Foundation.  We believe that identifying enterprise requirements, introducing them into the community and working within those projects at the ASF is the fastest path to innovation and HDP 1.3 represents that philosophy realized.

Hortonworks Data Platform Releases

By incorporating all of the latest relevant and stable Apache project releases in HDP 1.3 we are able to provide our customers with the most up-to-date Hadoop platform available.  And because it is 100% open source, it eliminates any notion of vendor lock-in

In fact, the graphic above illustrates the progress we have made in a very short time.

By applying our consistent approach to innovation and maintaining a cadence of releases we believe that we can greatly accelerate Hadoop adoption and enable an ever-larger number of customers to adopt Apache Hadoop as a core component of their enterprise data architecture.

HDP 1.3, SQL-IN-Hadoop: Phase 1 of the Stinger Initiative

Stinger InitiativeApache Hive is the defacto standard for SQL access in Hadoop, and the Stinger Initiative is a coordinated effort by Hortonworks and many others to enhance Hive for the emerging requirement for interactive queries in Hadoop.

HDP 1.3 is the first distribution to include Apache Hive 0.11 which delivers a 50x improvement in performance for queries and broadens the range of SQL semantics supported in Hadoop as part of the Stinger Initiative.  Incorporating over 350 enhancements contributed by a broad community of over 55 developers from more than 10 organizations, Hive 0.11 is a phenomenal demonstration of the power of the community!

Ease of Use and Business Continuity

As the user base for Hadoop expands quickly, HDP 1.3 continues the focus on ease of use to include the following set of capabilities:

Ease Of Use

  • This release provides NFS v3 standards-based access to HDFS so that file system can be accessed as a mounted drive on the network, simplifying movement of data in and out of Hadoop.
  • HDP 1.3 provides more access to enterprise data from Hadoop with optimized Oracle and Netezza connectors, enhanced HCatalog support for Sqoop and the ability to transfer Sqoop direct loads to/from RCFile and ORCFile.
  • Apache Ambari, the open source management and provisioning solution for Apache Hadoop was upgraded to include job diagnostic improvements, more customization options, new heatmaps and broader support for existing enterprise platforms.

Business Continuity

  • HDP 1.3 delivers file dataset (HDFS) and HBase snapshots for point-in-time disaster recovery functionality.
  • An upgrade to HBase 0.94.6.1 provides multi-master high availability, table snapshots and shortened recovery times for online applications built on Hadoop.

We are very pleased to bring you HDP 1.3, and encourage you to download it today.

 

Hortonworks Data Platform 2.0 Alpha is Now Available for Preview!

We are very excited to announce the Alpha release of the Hortonworks Data Platform 2.0 (HDP 2.0 Alpha).

HDP 2.0 Alpha is built around Apache Hadoop 2.0, which improves availability of HDFS with High Availability for the NameNode along with several performance and reliability enhancements. Apache Hadoop 2.0 also significantly advances data processing in the Hadoop ecosystem with the introduction of YARN, a generic resource-management and application framework to support MapReduce and other paradigms such as real-time processing and graph processing.

In addition to Apache Hadoop 2.0, this release includes the essential Hadoop ecosystem projects such as Apache HBase, Apache Pig, Apache Hive, Apache HCatalog, Apache ZooKeeper and Apache Oozie to provide a fully integrated and verified Apache Hadoop 2.0 stack

Apache Hadoop 2.0 is well on the path to General Availability, and is already deployed at scale in several organizations; but it won’t get to the current maturity levels of the Hadoop 1.0 stack (available in Hortonworks Data Platform 1.x) without feedback and contributions from the community.

Hortonworks strongly believes that for open source technologies to mature and become widely adopted in the enterprise, you must balance innovation with stability. With HDP 2.0 Alpha, Hortonworks provides organizations an easy way to evaluate and gain experience with the Apache Hadoop 2.0 technology stack, and it presents the perfect opportunity to help bring stability to the platform and influence the future of the technology.

Learn More
Please take a look at the Hortonworks Documentation to learn more about installing and using HDP 2.0 Alpha.

To learn more about Apache Hadoop YARN, Arun Murthy — Chair of Apache Hadoop PMC and YARN/MapReduce lead – and the rest of Hortonworks YARN development team, have a great four-part Blog series on the technology: one, two, three and four.

Download It
You can download the HDP 2.0 Alpha bits from the Hortonworks Download site.

Tell Us About It
Please visit the HDP 2.0 Alpha Forum to ask questions, get help, provide feedback and hear what others are doing with HDP.

Note: This Alpha release is early access and not for production use. Support is only available via Forums. Additionally, this is an early access release, you might find some incomplete features or a bit of instability.

We are excited about the opportunities that Hadoop 2.0 provides for the future of Hadoop and Big Data. The HDP 2.0 Alpha release is just the beginning. Enjoy!