Hortonworks Data Platform
HDP 2.3 - Another major advance for Open Enterprise Hadoop.
Hortonworks Data Platform 2.3 represents yet another major step forward for Hadoop as the enterprise data platform. This release incorporates the most recent innovations that have happened in Hadoop and its supporting ecosystem of projects. HDP 2.3 packages more than a hundred new features across all our existing projects. Every component is updated and we have added some key technologies and capabilities to HDP 2.3.
Key highlights of HDP 2.3 include:
Breakthrough Usability for Hadoop
HDP 2.3 eliminates much of the complexity administering Hadoop and improves developer productivity
HDP 2.3 leverages the Ambari Views Framework to deliver new user views and a breakthrough user experience for both cluster operators and developers.
...and for developers
- Fast and easy SQL Editor for Hive.An integrated experience that allows for SQL query building, displaying a visual “explain plan”, and allowing for an extended debugging experience when using the Tez execution engine.
- Easy Pig editor and web based HDFS browserIn addition to the SQL builder, a Pig Latin Editor brings a modern browser-based IDE experience to Pig. There is also a File Browser for HDFS.
- An entirely new user experience for Apache Falcon A web-forms based approach allows for rapid development of feeds and processes. The new Falcon UI also allows you to search and browse processes that have executed, visualize lineage and setup mirroring jobs to replicate files and databases between clusters or to cloud storage such as Microsoft Azure Storage.
Impressive improvements across all data access engines
Consolidating access to data YARN as its architectural center As organizations strive to efficiently store their data in a single repository and interact with it simultaneously in different ways, they need SQL, streaming, data science, batch and more… all in the same cluster. HDP 2.3 adds new engines including:
New capabilities for feature-rich Spark applications
Apache Spark on YARN is enhanced with the new DataFrame API, machine learning algorithms such as clustering, frequent pattern-mining algorithms and a technology preview of SparkSQL.
Advances towards comprehensive security
Security administrators can now define and manage security policies and capture security audit information for HDFS, Hive, HBase, Knox, Storm and now Solr, Kafka and YARN.
Provides security administrators the ability to manage keys and authorization policies for key management store (KMS) by introducing transparent data encryption to encrypt data in HDFS files, combined with Apache Ranger embedded open source Hadoop KMS.
Provide a framework for users to control access based on dynamic rules including time or location by introducing hooks to manage dynamic policies.
Introducing Apache Atlas
A common approach to Hadoop data governance from the open source community
As enterprises across all major industries deploy Hadoop into corporate data and processing environments, a common approach to working with metadata and data governance becomes a necessity.
Apache Atlas was created by a consortium of enterprises and Hortonworks to meet this need. Atlas enhances governance capabilities in Hadoop for both prescriptive and forensic models enriched by taxonomical metadata. Atlas, at its core, is designed to exchange metadata with other tools and processes within and outside of the Hadoop stack. Atlas enables platform-agnostic governance controls that effectively address enterprise compliance requirements.
Proactive monitoring and maintenance with your HDP Support Subscription
Deploy HDP with proactive and intelligent support. Hortonworks SmartSense gathers insight, provides recommendations, and helps optimize cluster utilization and health. Hortonworks SmartSense is included with every HDP Support Subscription.
By easily capturing log files and metrics for insight and resolution.
Via intelligent stream of cluster analytics and data-driven recommendation.
Through a proactive view into customer’s cluster utilization that can be used to drive capacity planning.