Innovation in the Open

Working with the Community, for the Enterprise
These labs offer an open roadmap for the development of Hortonworks Data Platform; outlining what we have delivered and what we are continuing to deliver to lead innovation within the Hadoop ecosystem

Engineering Hadoop at its core

Hortonworks engineers are spearheading initiatives to fundamentaly improve performance and capabilities of Hadoop :
Speed, Scale and SQL Semantics
The Stinger Initiative is a broad, community-based effort to drive the future of Apache Hive, delivering 100x performance improvements at petabyte scale with familiar SQL semantics.
The performance changes we are making today will transform Hive into a single tool that Hadoop users can use to do report generation, ad hoc queries, and large batch jobs spanning 10s or 100s of terabytes.
The Hadoop Operating System
Apache Hadoop YARN is the data operating system for Hadoop 2.0. YARN enables a user to interact with all data in multiple ways simultaneously, making Hadoop a true multi-use data platform, allowing it to take its place in a modern data architecture.
Hadoop 2.0 is truly a fundamental architecture change, one that makes Hadoop significantly more than just a batch platform
Stream Data Processing
Early adopters are using stream processing engines such as Apache Storm to analyze data in real time. Hortonworks has initiated an engineering commitment to deeply integrate STORM with Hadoop.
We are committed to deeply integrate Storm with Hadoop, specifically as a supported component of the 100% Open Source Hortonworks Data Platform.
Simplified Data Processing for Hadoop
The goal of the Data Management Initiative is to simplify the creation of data processing solutions for Hadoop. This effort will help enterprises construct solutions that maximize reuse and consistency.
As organizations move more and more data into Hadoop, the requirement to intelligently and automatically categorize and move data has become paramount. Projects like Apache Falcon have been created to meet these needs.
Security for Enterprise Hadoop
A roadmap for flexible, accountable, integrated enterprise security in Hadoop. The roadmap is organized around security best practices for authentication, authorization, accounting and data protection.
Open, Integrated & Intuitive IT Tools
A completely open set of features for provisioning, managing and monitoring Enterprise Hadoop clusters. These will easily integrate with existing IT systems, behind a single pane of glass, providing operational control and deep insight into cluster performance.

Enabling the enterprise with deep engineering partnerships

Working with partners, Hortonworks is bringing Hadoop to new platforms and new environments :
Elastic Hadoop on OpenStack
Project Savanna aims to provide operational agility & deployment flexibility across public and private clouds for Hadoop.
Expanding Hadoop with Microsoft
Microsoft and Hortonworks are collaborating in the open to expand the reach of Apache Hadoop and its ecosystem components.
Collaboration for Enterprise Data Apps
The expanded strategic alliance between Hortonworks and Red Hat has two open source leaders collaborating to develop the best in enterprise data solutions.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.

Thank you for subscribing!