newsletter

Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

AVAILABLE NEWSLETTERS:

Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
cta

Certifications: Exam Objectives

Candidates should be able to perform all of the following objectives

HDP Certified Developer (HDPCD) Exam

Data Ingestion
Import data from a table in a relational database into HDFS
SQOOP-IMPORT
Import the results of a query from a relational database into HDFS
FREE-FORM QUERY IMPORTS
Import a table from a relational database into a new or existing Hive table
IMPORTING DATA INTO HIVE
Insert or update data from HDFS into a table in a relational database
SQOOP-EXPORT
Data Transformation
Write and execute a Pig script
LEARN MORE
Load data into a Pig relation without a schema
LEARN MORE
Load data into a Pig relation with a schema
LEARN MORE
Load data from a Hive table into a Pig relation
LEARN MORE
Use Pig to transform data into a specified format
LEARN MORE
Transform data to match a given Hive schema
LEARN MORE
Group the data of one or more Pig relations
LEARN MORE
Use Pig to remove records with null values from a relation
LEARN MORE
Store the data from a Pig relation into a folder in HDFS
LEARN MORE
Store the data from a Pig relation into a Hive table
LEARN MORE
Sort the output of a Pig relation
LEARN MORE
Remove the duplicate tuples of a Pig relation
LEARN MORE
Specify the number of reduce tasks for a Pig MapReduce job
LEARN MORE
Join two datasets using Pig
LEARN MORE
LEARN MORE
Perform a replicated join using Pig
LEARN MORE
Run a Pig job using Tez
LEARN MORE
Within a Pig script, register a JAR file of User Defined Functions
LEARN MORE
LEARN MORE
Within a Pig script, define an alias for a User Defined Function
LEARN MORE
Within a Pig script, invoke a User Defined Function
LEARN MORE
Data Analysis
Write and execute a Hive query
LEARN MORE
Define a Hive-managed table
LEARN MORE
Define a Hive external table
LEARN MORE
Define a partitioned Hive table
LEARN MORE
Define a bucketed Hive table
LEARN MORE
Define a Hive table from a select query
LEARN MORE
Define a Hive table that uses the ORCFile format
LEARN MORE
Create a new ORCFile table from the data in an existing non-ORCFile Hive table
LEARN MORE
Specify the storage format of a Hive table
LEARN MORE
Specify the delimiter of a Hive table
LEARN MORE
Load data into a Hive table from a local directory
LEARN MORE
Load data into a Hive table from an HDFS directory
LEARN MORE
Load data into a Hive table as the result of a query
LEARN MORE
Load a compressed data file into a Hive table
LEARN MORE
Update a row in a Hive table
LEARN MORE
Delete a row from a Hive table
LEARN MORE
Insert a new row into a Hive table
LEARN MORE
Join two Hive tables
LEARN MORE
Run a Hive query using Tez
LEARN MORE
Run a Hive query using vectorization
LEARN MORE
Output the execution plan for a Hive query
LEARN MORE
Use a subquery within a Hive query
LEARN MORE
Output data from a Hive query that is totally ordered across multiple reducers
LEARN MORE
Set a Hadoop or Hive configuration property from within a Hive query
LEARN MORE

HDP Certified Administrator (HDPCA) Exam

Installation
Configure a local HDP repository
LEARN MORE
Install ambari-server and ambari-agent
LEARN MORE
LEARN MORE
Install HDP using the Ambari install wizard
LEARN MORE
Add a new node to an existing cluster
LEARN MORE
Decommission a node
LEARN MORE
Add an HDP service to a cluster using Ambari
LEARN MORE
Configuration
Change the configuration of a service using Ambari
LEARN MORE
Configure the Capacity Scheduler
LEARN MORE
Create a home directory for a user and configure permissions
LEARN MORE
Troubleshooting
Restart an HDP service
LEARN MORE
View an application’s log file
LEARN MORE
LEARN MORE
Configure and manage alerts
LEARN MORE
Troubleshoot a failed job
LEARN MORE
High Availability
Configure NameNode HA
LEARN MORE
Configure ResourceManager HA
LEARN MORE
Copy data between two clusters using distcp
LEARN MORE
Create a snapshot of an HDFS directory
LEARN MORE
Recover a snapshot
LEARN MORE
Configure HiveServer2 HA
LEARN MORE
Security
Install and configure Knox
LEARN MORE
Install and configure Ranger
LEARN MORE

HDP Certified Spark Developer (HDPCSD) Exam

Core Spark
Write a Spark Core application in Python or Scala
LEARN MORE
Initialize a Spark application
LEARN MORE
Run a Spark job on YARN
LEARN MORE
Create an RDD
LEARN MORE
Create an RDD from a file or directory in HDFS
LEARN MORE
Persist an RDD in memory or on disk
LEARN MORE
Perform Spark transformations on an RDD such as filtering and aggregations
LEARN MORE
Perform Spark actions on an RDD
LEARN MORE
Create and use broadcast variables and accumulators
LEARN MORE
Configure Spark properties
LEARN MORE
Ingest data using SparkSession
Sort results and write out to HDFS or other supported destinations
Spark SQL
Create Spark DataFrames from an existing RDD
LEARN MORE
Perform operations on a DataFrame
LEARN MORE
Write a Spark SQL application
LEARN MORE
Use Hive with ORC from Spark SQL
LEARN MORE
Write a Spark SQL application that reads and writes data from Hive tables
LEARN MORE
Invoke SQL API or SparkSession SQL functionality to select and produce results
Using join capabilities produce analytic results
Rename DataFrame/Dataset columns to produce best results
Spark Streaming
Use Spark structured streaming to ingest data in real time
Invoke streaming transformations and aggregations to produce analytic results
Invoke spark-submit utility on existing Spark application using proper arguments