Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics, offering information and knowledge of the Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
cta
HDP Developer: Real-time Development

cloud Register For Upcoming Courses

Schedule

OVERVIEW

This 4 day training course is designed for developers who need to create real-time applications to ingest and process streaming data sources using Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF) environments. Specific technologies covered includes: Apache Hadoop, Apache Kafka, Apache Storm & Trident, Apache Spark and Apache HBase as well as Apache NiFi. The highlight of the course is the custom workshop-styled labs that will allow participants to build streaming applications with Storm and Spark Streaming.

PREREQUISITES

Students should be familiar with programming principles and have experience in software development. Java programming experience is required. SQL and light scripting knowledge is also helpful. No prior Hadoop knowledge is required.

TARGET AUDIENCE

Developers and data engineers who need to understand and develop real-time / streaming applications on HDP and HDF.

Day 1: HDP Real-Time Architecture and Components

Objectives

*

Real-time architecture & overview of the class

*

Identify the relevant HDP/HDF components

*

Spark ecosystem overview

*

RDD Programming

Hands-on Labs

*

Using HDFS Commands

*

Introduction to SPARK REPLs and Zeppelin

*

Create and Manipulate RDDs

Day 2: Real-Time Processing with Spark Streaming

Objectives

*

Pair RDD Programming

*

Spark Streaming

*

Kafka Architecture

*

HBase Architecture

Hands-On Labs

*

Create and Manipulate Pair RDDs

*

Spark Streaming Using HDFS Directories and TCP Sockets

*

Spark Streaming Transformations

*

Spark Streaming Window Transformations

Day 3: Real-Time Processing with Storm

Objectives

*

Storm Architecture

*

Building Storm Topologies

*

Advanced Storm Features

*

Storm Integrations

Hands-On Labs

*

Storm WordCount

*

Integrating Storm with Kafka

*

Storm Workshop with Kafka and HBase

Day 4: Building DataFlows with HDF/NiFi

Objectives

*

Introduction to HDF/NiFi

*

NiFi Architecture

*

Developing DataFlows

Hands-On Labs / Demos

*

NiFi User Interface

*

Building a NiFi Data Flow

*

Processor Group

*

Remote Processor Group