Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
cta
HDP Developer: Real-time Development

cloud Register For Upcoming Courses

Schedule

OVERVIEW

This 4 day training course is designed for developers who need to create real-time applications to ingest and process streaming data sources using Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF) environments. Specific technologies covered includes: Apache Hadoop, Apache Kafka, Apache Storm & Trident, Apache Spark and Apache HBase as well as Apache NiFi. The highlight of the course is the custom workshop-styled labs that will allow participants to build streaming applications with Storm and Spark Streaming.

REGISTER NOW FOR UPCOMING TRAINING IN LONDON
Hortonworks University are delighted to announce specialized public training classes which will be available in London. The courses will be delivered directly by Hortonworks instructors who will be coming directly from HQ. These offerings are usually reserved for private onsite training, so this is a unique opportunity for all customers to attend either in class or remotely via our virtual ILT option. A 4 day session for HDP Developer Real Time Development will be delivered beginning Monday November 20th to Thursday November 23rd. Click the button below for additional information or contact Thai Thanh.

Click Here for Additional Information or to Reserve Your Seat

PREREQUISITES

Students should be familiar with programming principles and have experience in software development. Java programming experience is required. SQL and light scripting knowledge is also helpful. No prior Hadoop knowledge is required.

TARGET AUDIENCE

Developers and data engineers who need to understand and develop real-time / streaming applications on HDP and HDF.

Day 1: HDP Real-Time Architecture and Components

Objectives

*

Real-time architecture & overview of the class

*

Identify the relevant HDP/HDF components

*

Spark ecosystem overview

*

RDD Programming

Hands-on Labs

*

Using HDFS Commands

*

Introduction to SPARK REPLs and Zeppelin

*

Create and Manipulate RDDs

Day 2: Real-Time Processing with Spark Streaming

Objectives

*

Pair RDD Programming

*

Spark Streaming

*

Kafka Architecture

*

HBase Architecture

Hands-On Labs

*

Create and Manipulate Pair RDDs

*

Spark Streaming Using HDFS Directories and TCP Sockets

*

Spark Streaming Transformations

*

Spark Streaming Window Transformations

Day 3: Real-Time Processing with Storm

Objectives

*

Storm Architecture

*

Building Storm Topologies

*

Advanced Storm Features

*

Storm Integrations

Hands-On Labs

*

Storm WordCount

*

Integrating Storm with Kafka

*

Storm Workshop with Kafka and HBase

Day 4: Building DataFlows with HDF/NiFi

Objectives

*

Introduction to HDF/NiFi

*

NiFi Architecture

*

Developing DataFlows

Hands-On Labs / Demos

*

NiFi User Interface

*

Building a NiFi Data Flow

*

Processor Group

*

Remote Processor Group