Working within the community...
...for the enterprise

Hortonworks is committed to delivering HDP completely in the open. We introduce enterprise feature requirements into the public domain and we code to address those requirements. We contribute all of that code back to the wide array of Hadoop ecosystem projects managed within the Apache Software Foundation..

Advancing Hadoop Together

Community initiatives rally Hadoop users, developers and vendors towards common objectives

The Data Governance Initiative is a coalition of enterprise Hadoop users and vendors, working to develop an extensible foundation for comprehensive data governance.
The Stinger Initiative has successfully rallied contributions from hundreds of developers across dozens of companies in order to improve the speed, scale and semantics of SQL in Hadoop.
The Open Data Platform initiative aims to rally end users and vendors around a common core platform (the ODP Core)--freeing the ecosystem to focus on building applications for important use cases.

Supporting Projects to Meet Enterprise Needs

Projects introduced to ASF address security, management, operations and data access

Ambari is a framework for provisioning, deploying, managing, and monitoring Apache Hadoop clusters. Its intuitive tools and APIs simplify Hadoop operations.
Falcon enables automation of data movement and processing for ingest, pipelines, replication and compliance use cases.
Ranger delivers a comprehensive approach to security across Hadoop with centralized administration, authorization, audit and data protection.
Tez is an extensible framework for building high performance batch and interactive data processing applications, coordinated by YARN in Apache Hadoop.
Slider is a YARN-based framework for deployment and management of long-running data access applications such as HBase, Accumulo and Storm.

YARN : Engineering Hadoop from the core

YARN is the ‘data operating system’ within Hadoop that enables common sets of data to be processed simultaneously by multiple applications.

Introduced as MR-279 by Arun Murthy in 2009, work on Hadoop’s next-generation architecture powered by YARN culminated in 2013 with the release of Hadoop 2.

Today, YARN is the center of innovation in and around Hadoop. It has become the enabling technology that enables multiple applications to share a common cluster and dataset, which in turn enables the modern data architecture.

More »

Arun Murthy and over 29 Hortonworks engineers are committers to core Hadoop

From Open-Source to Enterprise Data Platform

HDP : Completely Open-Source Apache Hadoop

Hortonworks develops within the governance model of the Apache Software Foundation contributing to and progressing the individual components from the Hadoop ecosystem and ultimately integrating them into the Hortonworks Data Platform (HDP).

Assembling a complete platform like HDP requires choosing the right stable version of Apache Hadoop as the foundation and then integrating, and packaging the optimal versions of all the other ASF components into a well-tested, certified data platform.

Hortonworks Data Platform »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.