Apache Storm: Real-Time Processing in Hadoop
I recently sat down with Himanshu Bari to discuss how Apache Ambari will serve as the single point of management for Hadoop 2 clusters integrated with Apache Storm and its real-time, streaming event processing.
Himanshu discusses Apache Storm’s five key benefits and how those will add to the power and stability of a Hadoop 2 stack, providing analysis of huge data flows from the second data is created and then for decades of historical analysis of that data stored in HDFS.
Other highlights include:
- The reasons for adding Apache Storm to Hortonworks Data Platform
- How Apache Hadoop YARN opened the door for integration of Storm into Hadoop
- Two general use case patterns with Storm and specific uses in transportation and advertising
- How Ambari will provide a single view and common operational platform for enterprises to distribute cluster resources across different workloads