Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics, offering information and knowledge of the Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
cta

Certifications: Exam Objectives

Candidates should be able to perform all of the following objectives

HDPCD Exam

Data Ingestion
Import data from a table in a relational database into HDFS
SQOOP-IMPORT
Import the results of a query from a relational database into HDFS
FREE-FORM QUERY IMPORTS
Import a table from a relational database into a new or existing Hive table
IMPORTING DATA INTO HIVE
Insert or update data from HDFS into a table in a relational database
SQOOP-EXPORT
Given a Flume configuration file, start a Flume agent
FLUME AGENT
Given a configured sink and source, configure a Flume memory channel with a specified capacity
MEMORY CHANNEL
Data Transformation
Write and execute a Pig script
LEARN MORE
Load data into a Pig relation without a schema
LEARN MORE
Load data into a Pig relation with a schema
LEARN MORE
Load data from a Hive table into a Pig relation
LEARN MORE
Use Pig to transform data into a specified format
LEARN MORE
Transform data to match a given Hive schema
LEARN MORE
Group the data of one or more Pig relations
LEARN MORE
Use Pig to remove records with null values from a relation
LEARN MORE
Store the data from a Pig relation into a folder in HDFS
LEARN MORE
Store the data from a Pig relation into a Hive table
LEARN MORE
Sort the output of a Pig relation
LEARN MORE
Remove the duplicate tuples of a Pig relation
LEARN MORE
Specify the number of reduce tasks for a Pig MapReduce job
LEARN MORE
Join two datasets using Pig
LEARN MORE
LEARN MORE
Perform a replicated join using Pig
LEARN MORE
Run a Pig job using Tez
LEARN MORE
Within a Pig script, register a JAR file of User Defined Functions
LEARN MORE
LEARN MORE
Within a Pig script, define an alias for a User Defined Function
LEARN MORE
Within a Pig script, invoke a User Defined Function
LEARN MORE
Data Analysis
Write and execute a Hive query
LEARN MORE
Define a Hive-managed table
LEARN MORE
Define a Hive external table
LEARN MORE
Define a partitioned Hive table
LEARN MORE
Define a bucketed Hive table
LEARN MORE
Define a Hive table from a select query
LEARN MORE
Define a Hive table that uses the ORCFile format
LEARN MORE
Create a new ORCFile table from the data in an existing non-ORCFile Hive table
LEARN MORE
Specify the storage format of a Hive table
LEARN MORE
Specify the delimiter of a Hive table
LEARN MORE
Load data into a Hive table from a local directory
LEARN MORE
Load data into a Hive table from an HDFS directory
LEARN MORE
Load data into a Hive table as the result of a query
LEARN MORE
Load a compressed data file into a Hive table
LEARN MORE
Update a row in a Hive table
LEARN MORE
Delete a row from a Hive table
LEARN MORE
Insert a new row into a Hive table
LEARN MORE
Join two Hive tables
LEARN MORE
Run a Hive query using Tez
LEARN MORE
Run a Hive query using vectorization
LEARN MORE
Output the execution plan for a Hive query
LEARN MORE
Use a subquery within a Hive query
LEARN MORE
Output data from a Hive query that is totally ordered across multiple reducers
LEARN MORE
Set a Hadoop or Hive configuration property from within a Hive query
LEARN MORE

HDPCA Exam

Installation
Configure a local HDP repository
LEARN MORE
Install ambari-server and ambari-agent
LEARN MORE
LEARN MORE
Install HDP using the Ambari install wizard
LEARN MORE
Add a new node to an existing cluster
LEARN MORE
Decommission a node
LEARN MORE
Add an HDP service to a cluster using Ambari
LEARN MORE
Configuration
Define and deploy a rack topology script
LEARN MORE
Change the configuration of a service using Ambari
LEARN MORE
Configure the Capacity Scheduler
LEARN MORE
Create a home directory for a user and configure permissions
LEARN MORE
Configure the include and exclude DataNode files
LEARN MORE
Troubleshooting
Restart an HDP service
LEARN MORE
View an application’s log file
LEARN MORE
LEARN MORE
Configure and manage alerts
LEARN MORE
Troubleshoot a failed job
LEARN MORE
High Availability
Configure NameNode HA
LEARN MORE
Configure ResourceManager HA
LEARN MORE
Copy data between two clusters using distcp
LEARN MORE
Create a snapshot of an HDFS directory
LEARN MORE
Recover a snapshot
LEARN MORE
Configure HiveServer2 HA
LEARN MORE
Security
Install and configure Knox
LEARN MORE
Install and configure Ranger
LEARN MORE
Configure HDFS ACLs
LEARN MORE

HDPCD: Java Exam

Write a Hadoop MapReduce application in Java
LEARN MORE
Run a Java MapReduce application on a Hadoop cluster
LEARN MORE
Write and configure a Combiner for a MapReduce job
LEARN MORE
Write and configure a custom Partitioner for a MapReduce job
LEARN MORE
Sort the output of a MapReduce job
LEARN MORE
Write and configure a custom key class for a MapReduce job
LEARN MORE
Configure the input and output formats of a MapReduce job
LEARN MORE
Perform a join of two or more datasets
Many online examples and resources

Perform a map-side join of two datasets
Many online examples and resources

HDPCD: Spark Exam

Core Spark
Write a Spark Core application in Python or Scala
LEARN MORE
Initialize a Spark application
LEARN MORE
Run a Spark job on YARN
LEARN MORE
Create an RDD
LEARN MORE
Create an RDD from a file or directory in HDFS
LEARN MORE
Persist an RDD in memory or on disk
LEARN MORE
Perform Spark transformations on an RDD
LEARN MORE
Perform Spark actions on an RDD
LEARN MORE
Create and use broadcast variables and accumulators
LEARN MORE
Configure Spark properties
LEARN MORE
Spark SQL
Create Spark DataFrames from an existing RDD
LEARN MORE
Perform operations on a DataFrame
LEARN MORE
Write a Spark SQL application
LEARN MORE
Use Hive with ORC from Spark SQL
LEARN MORE
Write a Spark SQL application that reads and writes data from Hive tables
LEARN MORE