8 hours (online)
1 day (VILT)
100% self-- paced, online exploration (for employees, partners or support subscription customers)
100% instructor led discussion
No previous Hadoop or programming knowledge is required.Students will need browser access to the Internet.
Data architects, data integration architects, managers, C-level executives, decision makers, technical infrastructure team, and Hadoop administrators or developers who want to understand the fundamentals of Big Data and the Hadoop ecosystem.
Hortonworks University provides an immersive and valuable real world experience in scenario-based training Courses. Our classes are available both in classroom or online, from anywhere in the world.
At the completion of the course students will be able to:
Describe the case for Hadoop
Identify the Hadoop Ecosystem architecture
Data Management - HDFS, YARN
Data Access - Pig, Hive, HBase, Storm, Solr, Spark
Data Governance & Integration - Falcon, Flume, Sqoop, Kafka, Atlas
Security - Kerberos, Falcon, Knox
Operations - Ambari, Zookeeper, Oozie, Cloudbreak
Observe popular data transformation and processing engines in action: Apache Hive , Apache Pig, Apache Spark
Detail the architecture and features of YARN
Describe backup and recovery options
Describe how to secure Hadoop
Explain the fundamentals of parallel processing
Describe data ingestion options and frameworks for batch and real-time streaming
Detail the HDFS architecture