Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
cta
HDP Developer: Real-time Development

OVERVIEW

This 4 day training course is designed for developers who need to create real-time applications to ingest and process streaming data sources using Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF) environments. Specific technologies covered includes: Apache Hadoop, Apache Kafka, Apache Storm & Trident, Apache Spark and Apache HBase as well as Apache NiFi. The highlight of the course is the custom workshop-styled labs that will allow participants to build streaming applications with Storm and Spark Streaming.

PREREQUISITES

Students should be familiar with programming principles and have experience in software development. Java programming experience is required. SQL and light scripting knowledge is also helpful. No prior Hadoop knowledge is required.

TARGET AUDIENCE

Developers and data engineers who need to understand and develop real-time / streaming applications on HDP and HDF.

1
Day

An Overview of Hadoop, HDFS and Zeppelin

Objectives

  • Real-time Architecture Overview
  • Apache Hadoop Primer
  • HDFS Architecture Overview
  • Apache Zeppelin and Apache Spark Overview
  • RDD Programming

Labs

  • Validating the Lab Environment
  • Using HDFS Commands
  • Introduction to SPARK REPLs and Zeppelin
  • Creating and ManipulatingRDDs

An Overview of Spark Streaming, Apache Kafka and Apache HBase

Working with Apache Storm

An Introduction to Hortonworks Data Flow

Live Training

LIVE CLASS
DATE & TIME
LOCATION
REGISTER