Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
cta
HDP Developer: Quick Start

OVERVIEW

This 4 day training course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Apache Pig and Apache Hive, and developing applications on Apache Spark.

Topics include: Essential understanding of HDP and its capabilities, Hadoop, YARN, HDFS, MapReduce/Tez, data ingestion, using Pig and Hive to perform data analytics on Big Data and an introduction to Spark Core, Spark SQL, Apache Zeppelin, and additional Spark features.

Prerequisites

Students should be familiar with programming principles and have experience in software development. SQL and light scripting knowledge is also helpful. No prior Hadoop knowledge is required.

Target Audience

Developers and data engineers who need to understand and develop applications on HDP.

1
Day

An Introduction to Apache Hadoop and HDFS

Objectives

  • The Case for Hadoop
  • The Hadoop Ecosystem
  • The HDFS Architecture
  • Ingesting Data Into HDFS
  • Parallel Processing Fundamentals
  • YARN Architecture
  • Introduction to Apache Pig

Labs

  • Starting anHDP Cluster
  • Using HDFS Commands
  • Demonstration: Understanding Apache Pig
  • Getting Started with Apache Pig
  • Exploring Data with Pig

Advanced Apache Pig Programming

Advanced Apache Hive Programming

Working with Pair RDDs and Building YARN Applications

Live Training

LIVE CLASS
DATE & TIME
LOCATION
REGISTER