HDP Overview: Apache Hadoop Essentials

A Technical Understanding for Business Users and Decision Makers
This course provides a technical overview of Apache Hadoop. Itincludes highlevel information about concepts, architecture,operation, and uses of the Hortonworks Data Platform (HDP)and the Hadoop ecosystem.  The course provides an optional primer for those who plan to attend a handson, instructorled course


8 hours – on line only

Target Audience:

Data architects, data integration architects, managers, Clevel executives, decision makers, technical infrastructure team, and Hadoop administrators or developers who want to understand the fundamentals of Big Data and the Hadoop ecosystem.

Course Objectives:

This course will cover :

  • Describe what makes data “Big Data
  • List data types stored and analyzed in Hadoop
  • Describe how Big Data and Hadoop fit into your current infrastructure and environment
  • Describe fundamentals of:
    – the Hadoop Distributed File System (HDFS)
    – YARNo  MapReduce
    – Hadoop frameworks: (Pig, Hive, HCatalog, Storm, Solr, Spark, HBase, Oozie, Ambari, ZooKeeper, Sqoop,
    Flume, and Falcon)
    – Recognize use cases for Hadoop o Describe the business value of Hadoop
    – Describe new technologies like Tez and the Knox Gateway


  • No previous Hadoop or programming knowledge is required.Students will need browser access to the Internet.

Lab Content:

There are no required labs in order to complete this course. Attendees have the option of installing the Hortonworks Sandbox and working through the demonstrations.


Upcoming Courses

See our Schedule