Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
Open Source Projects




Druid is an open-source analytics data store designed for business intelligence (OLAP) queries on event data. Druid provides low latency (real-time) data ingestion, flexible data exploration, and fast data aggregation. Existing Druid deployments have scaled to trillions of events and petabytes of data. Druid is most commonly used to power user-facing analytic applications.

Druid and the Druid logo are copyright Metamarkets Group Inc.
Druid is a registered trademark of Metamarkets Group Inc.

What Druid Does?

Key Features:

Feature Description
Sub-Second Queries Druid delivers sub-second queries, even when you have terabytes of data and dozens of dimensions.
Real-Time Data Ingestion Druid makes real-time a reality. Query data seconds after it arrives. Native integration with Apache Kafka makes it simple to enable real-time analytics.
Integrated with Apache Hive Build OLAP cubes and run sub-second SQL queries using any Hive-compatible tool.
Apache Ambari Integration Apache Ambari makes deploying, configuring and monitoring Druid a breeze..

How Druid Works

Druid is fast because data is converted into a heavily indexed columnar format that is ideal for typical OLAP query patterns. Druid is queried through Hive SQL, using the Druid to Hive connector included in HDP, or through a native REST API.

Hortonworks Focus for Druid

Hortonworks focuses on enabling fast, scalable analytics that seamlessly combines historical and real-time data.

  • Real-Time Analytics: The Druid / Hive connector lets you build OLAP cubes using SQL, or tap in to existing Druid cubes. Or take advantage of Hive’s powerful SQL support to perform deep analytics on your Druid data.
  • Management: Apache Ambari makes it easy to deploy, configure, monitor and manage Druid clusters.
  • Security: Druid now fully supports Kerberos and secure Hadoop, and Apache Ambari manages all the heavy lifting of securing your Druid cluster.

Related Videos


Druid in the Press

Webinars & Presentations