newsletter

Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

AVAILABLE NEWSLETTERS:

Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
cta

Hortonworks DataFlow (HDF)

The Answer to All Your Real-Time Streaming Data Problems

Get started with Hortonworks DataFlow

Download Now

Overview

Hortonworks DataFlow (HDF) is a scalable, real-time streaming analytics platform that ingests, curates and analyzes data for key insights and immediate actionable intelligence. DataFlow addresses the key challenges enterprises face with data-in-motion—real-time stream processing of data at high volume and high scale, data provenance and ingestion from IoT devices, edge applications and streaming sources.

manufacturing video imgvideo button

Benefits

Drastically reduce your data integration development time

Imagine a no-code approach to building complex data pipelines with minimal effort. HDF offers a simple visual user interface for building sophisticated data flows to accomplish major data ingestions, transformations and enrichment from a variety of streaming sources. Powered by Apache NiFi, HDF can ingest data from a range of data sources—devices, enterprise applications, partner systems or edge applications generating real-time streaming data.


Blog: What’s new in Hortonworks DataFlow (HDF) 3.2?
Manage and secure your data from edge to enterprise

HDF enables high volume data collection at the edge, even from edge devices using Minifi. Now you can set up widely distributed IoT deployment models for regional data collection with ease using NiFi with Minifi to stream data from the edge. Tight integration with Apache Ranger gives HDF the unique advantage of seamless security across all your data-in-motion and data-at-rest.

Get real-time insights and actionable intelligence faster than ever

Real-time insights mean you can act sooner. Using the powerful streaming platform Apache Kafka, HDF can process several million transactions per second, identify key patterns, compare against machine learning models and offer predictive or prescriptive analytics to help business leadership make key decisions and seize opportunities.


Streaming Analytics Manager provides a visual way to build complex streaming applications, enabling data analysts and data scientists to understand key insights and gain actionable intelligence from real-time data.

White paper: Get started building streaming apps without writing a single line of code
Build a data architecture that adapts to IoT-scale

HDF is 100% open source technology – so you can design a future-proof architecture without any vendor lock-in. This solution is a proven technology hundreds of customers have chosen for its prowess in mission-critical use cases. Customers can implement IoT solutions for sectors such as automotive, manufacturing, transportation, utilities, retail and public sector. You can adopt a data strategy to handle highly diversified and large data volumes at high velocities.


White paper: The Essential Guide to Data-Driven IOT
Stay compliant with full governance of any streaming data

HDF is the only product in the industry offering data provenance and edge-to-enterprise data governance out of the box. In the age of GDPR and other regulatory compliance laws, it’s important to track data lineage, even for streaming data. NiFi within HDF offers data provenance tracking without any extra configuration or setup. With tight integration of Apache Atlas, you have a complete governance of data from the edge to the enterprise.

Customers

Clearsense
Clearsense is a smart data organization based in Jacksonville, Florida that is re-imagining and simplifying data analytics to help healthcare organizations realize measurable value from their data. They have developed a...
Johns Hopkins
Johns Hopkins University is an American private research university, founded in 1876 and located in Baltimore, Maryland. It is considered the first research university in the United States, and is organized into 10...
Hilton
Hilton is an American multinational hospitality company, founded in 1919 and headquartered in Tysons Corner, Virginia. Currently, its portfolio includes 5,500 hotels across 110 countries. Hilton has 14 brands across...