cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

We just concluded our highly attended 7-part Data-In-Motion webinar series. The final installment was a very informative session on how Apache NiFi, Kafka and Storm work together. Slides and Q&A below. Should you have any more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where […]

We recently hosted a webinar on the newest features of Hortonworks DataFlow 2.0 highlighting: the new user interface new processors in Apache NiFi Apache NiFi multi-tenancy Apache NiFi zero master clustering architecture Apache MiNiFi One of the first things you may have noticed in Hortonworks DataFlow 2.0 is the new user interface based on Apache […]

We recently hosted a webinar on the topic of  HDF 2.0 and the integration between Apache NiFi, Apache Ambari and Apache Ranger.  We thought we would share the questions & answers from the webinar, and also compile relevant data into a single place to make it easy to find and reference. Should you have any […]

We recently concluded this webinar series, with 7 webinars and 77 questions answered. All webinars, slides, Q&A and related info are available below. Should you have any more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where an entire community of folks are monitoring and […]

My life as part of a high performance team Last week we released Hortonworks DataFlow HDF 2.0. It was a great 1 year anniversary present for me – a new release of the product I’ve been supporting since I joined Hortonworks a year ago. I’ve had the privilege of working with the most talented, quick-thinking, […]

Enterprise Productivity and Integration of Apache NiFi, Kafka and Storm, together with Ambari and Ranger We are pleased to announce that Hortonworks DataFlow (HDF™) Version 2.0 is now generally available for download!  As part of a Open and Connected Data Platforms offering from Hortonworks, HDF 2.0 provides a new level of enterprise integration for data […]

Streaming analytics to create an accurate single buyer identity in real-time The 4th and final demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, was done by Simon Ball and it showcased how Apache NiFi moved parallel streams of streaming data into Spark and then more analysis could be done by […]

Use IoT to get real-time feedback on customer preferences and respond to them During the 3rd demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, it was audience participation time! Kay Lerch demonstrated how to interact with the audience, through specific twitter and SMS messages sent to a specific phone number, […]

Hortonworks Dataflow (HDF) offers a combination Apache NiFI, Kafka and Storm. HDF 2.0 has significant architecture and enterprise productivity features to make it faster and easier to deploy, manage and analyze streaming data. In the next few weeks, we will go into more details, but for now, here are the three highlights to take note […]

Apache NiFi to prioritize which images should be sent to Spark in the cloud for computer vision machine learning During the 2nd demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, Simon Ball demonstrated how to take data received from the edge, and run facial recognition on a more powerful cloud […]

Match image to an identifier, correlate with data and initiate personalized, real time electronic convo with customer in store During the 1st demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, Jeremy Dyer modelled the scenario of a customer walking into a store, where a retailer can find out who they […]

So, it’s been a month since Hadoop Summit San Jose, where over 5000 of the leading tech innovators in big data came together to share their inventions, wisdom and know-how. One of the sessions – a powerpoint free zone, was Data Hacks & Demos, a keynote session hosted by Joe Witt and starring an international […]

Significant Throughput and Latency Gains Between Apache Storm 0.9 and 1.0 The release of version 1.0 marks another major milestone for Apache Storm. Since becoming an Apache project in Sept 2013, much work has gone into maturing the feature set and also improving performance by reworking or tweaking various components. (See A Brief History of […]

Debugging distributed systems can be difficult largely because they are designed to run on many (possibly thousands) of hosts in a cluster. This process typically involves monitoring and analyzing log files spread across the cluster, and if the necessary information is not being logged, service restarts and job redeployment may be required. Not only is […]

Part 1: A Little History In this series of blog posts, we will provide an in-depth look select features introduced with the release of Apache Storm (Storm) 1.0. To kick off the series, we’ll take a look how Storm has evolved over the years from its beginnings as an open source project, up to the […]