The Hadoop Operating System
Apache Hadoop YARN is the data operating system for Hadoop 2.0. YARN enables a user to interact with all data in multiple ways simultaneously, making Hadoop a true multi-use data platform, allowing it to take its place in a modern data architecture.
Hadoop 2.0 is truly a fundamental architecture change, one that makes Hadoop significantly more than just a batch platform
Speed, Scale and SQL Semantics
The Stinger Initiative is a broad, community-based effort to drive the future of Apache Hive, delivering 100x performance improvements at petabyte scale with familiar SQL semantics.
The performance changes we are making today will transform Hive into a single tool that Hadoop users can use to do report generation, ad hoc queries, and large batch jobs spanning 10s or 100s of terabytes.
Stream Data Processing
Early adopters are using stream processing engines such as Apache Storm to analyze data in real time. Hortonworks has initiated an engineering commitment to deeply integrate STORM with Hadoop.
We are committed to deeply integrate Storm with Hadoop, specifically as a supported component of the 100% Open Source Hortonworks Data Platform.
Simplified Data Processing for Hadoop
The goal of the Data Management Initiative is to simplify the creation of data processing solutions for Hadoop. This effort will help enterprises construct solutions that maximize reuse and consistency.
As organizations move more and more data into Hadoop, the requirement to intelligently and automatically categorize and move data has become paramount. Projects like Apache Falcon have been created to meet these needs.
Security for Enterprise Hadoop
A roadmap for flexible, accountable, integrated enterprise security in Hadoop. The roadmap is organized around security best practices for authentication, authorization, accounting and data protection.
Open, Integrated & Intuitive IT Tools
A completely open set of features for provisioning, managing and monitoring Enterprise Hadoop clusters. These will easily integrate with existing IT systems, behind a single pane of glass, providing operational control and deep insight into cluster performance.