Data Lifecycle & Governance
Ready to Get Started?READ THE BLOG
HDP is the industry's only true secure, enterprise-ready open source Apache™ Hadoop® distribution based on a centralized architecture (YARN). HDP addresses the complete needs of data-at-rest, powers real-time customer applications and delivers robust analytics that accelerate decision making and innovation.START SUBSCRIPTION
YARN and Hadoop Distributed File System (HDFS) are the cornerstone components of Hortonworks Data Platform (HDP). While HDFS provides the scalable, fault-tolerant, cost-efficient storage for your big data lake, YARN provides the centralized architecture that enables you to process multiple workloads simultaneously. YARN provides the resource management and pluggable architecture for enabling a wide variety of data access methods.
Hortonworks Data Platform includes a versatile range of processing engines that empower you to interact with the same data in multiple ways, at the same time. This means applications can interact with the data in the best way: from batch to interactive SQL or low latency access with NoSQL. Emerging use cases for data science, search and streaming are also supported with Apache Spark, Storm and Kafka.
HDP extends data access and management with powerful tools for data governance and integration. They provide a reliable, repeatable, and simple framework for managing the flow of data in and out of Hadoop. This control structure, along with a set of tooling to ease and automate the application of schema or metadata on sources is critical for successful integration of Hadoop into your modern data architecture.
Hortonworks has engineering relationships with many leading data management providers to enable their tools to work and integrate with HDP.
Security is woven and integrated into HDP in multiple layers. Critical features for authentication, authorization, accountability and data protection are in place to help secure HDP across these key requirements. Consistent with this approach throughout all of the enterprise Hadoop capabilities, HDP also ensures you can integrate and extend your current security solutions to provide a single, consistent, secure umbrella over your modern data architecture.
Operations teams deploy, monitor and manage a Hadoop cluster within their broader enterprise data ecosystem. Apache Ambari simplifies this experience. Ambari is an open source management platform for provisioning, managing, monitoring, and securing the Hortonworks Data Platform. It enables Hadoop to fit seamlessly into your enterprise environment.
Cloudbreak, as part of Hortonworks Data Platform and powered by Apache Ambari, allows you to simplify the provisioning of clusters in any cloud environment including; Amazon Web Services, Microsoft Azure, Google Cloud Platform and OpenStack. It optimizes your use of cloud resources as workloads change.
Speed up Spark StreamingExperience performance gains up to 10 times for applications that store large datasets such as state management, through a revamped Spark Streaming state tracking API.
Seamless Data AccessAchieve higher performance with Spark 1.6 through a new Dataset API which is an extension of DataFrame API and also supports compile-time type checking.
Dynamic Executor Allocation Utilize cluster resources efficiently through Dynamic Executor Allocation functionality that automatically expands and shrinks resources based on utilization.
More Flexible UpgradesAmbari 2.2 provides Hadoop operators a faster way to upgrade their clusters by automating both maintenance and feature releases, while the cluster is down.
Simplified Security OperationsService configurations for Ranger provides a continuation of the new user experience. In addition, optional storage of Kerberos credentials and customizable security settings simplify administration and provide additional security.
Improved Troubleshooting Ambari 2.2 makes it easier and faster to perform troubleshooting with customizable metric widget graph display timezone and the ability to export metrics to identify and respond to problems quickly.
Try out the latest HDP features and functionality with Hortonworks Sandbox, or set HDP up for a production environment, install and configure your clusters.
Check out HDP add-ons for connecting with popular BI tools, powering search queries and more.
Progressive Insurance is one of the largest U.S. auto insurance companies. The team turned to Hortonworks Data Platform to transform its business with massive ingest of new types of data. Progressive uses HDP for ad placement and to store driving data for its usage-based insurance products.