cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

We just concluded our highly attended 7-part Data-In-Motion webinar series. The final installment was a very informative session on how Apache NiFi, Kafka and Storm work together. Slides and Q&A below. Should you have any more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where […]

We recently hosted a webinar on the newest features of Hortonworks DataFlow 2.0 highlighting: the new user interface new processors in Apache NiFi Apache NiFi multi-tenancy Apache NiFi zero master clustering architecture Apache MiNiFi One of the first things you may have noticed in Hortonworks DataFlow 2.0 is the new user interface based on Apache […]

We recently hosted a webinar on the topic of  HDF 2.0 and the integration between Apache NiFi, Apache Ambari and Apache Ranger.  We thought we would share the questions & answers from the webinar, and also compile relevant data into a single place to make it easy to find and reference. Should you have any […]

One the most enjoyable parts of my job is working with customers and partners who have innovated on the Hortonworks Connected Data Platform.  Companies like Servient. Here’s a great real example of a recent use case for a customer we worked together on in the energy vertical.  I’ve removed the actual name for obvious reasons. […]

We recently concluded this webinar series, with 7 webinars and 77 questions answered. All webinars, slides, Q&A and related info are available below. Should you have any more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where an entire community of folks are monitoring and […]

My life as part of a high performance team Last week we released Hortonworks DataFlow HDF 2.0. It was a great 1 year anniversary present for me – a new release of the product I’ve been supporting since I joined Hortonworks a year ago. I’ve had the privilege of working with the most talented, quick-thinking, […]

Enterprise Productivity and Integration of Apache NiFi, Kafka and Storm, together with Ambari and Ranger We are pleased to announce that Hortonworks DataFlow (HDF™) Version 2.0 is now generally available for download!  As part of a Open and Connected Data Platforms offering from Hortonworks, HDF 2.0 provides a new level of enterprise integration for data […]

Streaming analytics to create an accurate single buyer identity in real-time The 4th and final demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, was done by Simon Ball and it showcased how Apache NiFi moved parallel streams of streaming data into Spark and then more analysis could be done by […]

Use IoT to get real-time feedback on customer preferences and respond to them During the 3rd demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, it was audience participation time! Kay Lerch demonstrated how to interact with the audience, through specific twitter and SMS messages sent to a specific phone number, […]

Hortonworks Dataflow (HDF) offers a combination Apache NiFI, Kafka and Storm. HDF 2.0 has significant architecture and enterprise productivity features to make it faster and easier to deploy, manage and analyze streaming data. In the next few weeks, we will go into more details, but for now, here are the three highlights to take note […]

Apache NiFi to prioritize which images should be sent to Spark in the cloud for computer vision machine learning During the 2nd demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, Simon Ball demonstrated how to take data received from the edge, and run facial recognition on a more powerful cloud […]

Match image to an identifier, correlate with data and initiate personalized, real time electronic convo with customer in store During the 1st demo of the Data Hacks & Demos session, at Hadoop Summit San Jose, Jeremy Dyer modelled the scenario of a customer walking into a store, where a retailer can find out who they […]

So, it’s been a month since Hadoop Summit San Jose, where over 5000 of the leading tech innovators in big data came together to share their inventions, wisdom and know-how. One of the sessions – a powerpoint free zone, was Data Hacks & Demos, a keynote session hosted by Joe Witt and starring an international […]

In preparation for Hadoop Summit San Jose, I asked the Chair for the Apache Committer Insights track, Andy Feng – VP Architecture, Yahoo! which were the top 3 sessions he would recommend. Although it was a tough choose only 3, he recommended: HDFS: Optimization, Stabilization and Supportability Speakers: Chris Nauroth from Hortonworks and Arpit Agarwal […]

Apache Hadoop® exists within a broader ecosystem of enterprise analytical packages. This includes ETL tools, ERP and CRM systems, enterprise data warehouses, data marts and others. Modern workloads flow from these various traditional analytical sources into Hadoop and then often back out again. What dataset came from which system, when and how did it change over […]