Tutorials for Hadoop with HDP 2.1: Hive, Tez, Falcon, Knox, Storm
If you’re excited to get started with the new features in Hortonworks Data Platform 2.1, then we’ve included 4 tutorials for you try out – Sandbox-style.
You can download the HDP 2.1 Technical Preview here, and then get stuck into these great tutorials.
Interactive Query with Apache Hive and Apache Tez
OK, so you’re not going to get huge performance out of a one-node VM, but you can try out Hive on Tez, and see the performance gains versus MapReduce, and also try out features such as Vectorized Query, and the host of new SQL features. Get supercharged here.
Defining and Processing Data Pipelines with Apache Falcon
Processing Stream data in near real-time with Apache Storm
Secure your Hadoop infrastructure with Apache Knox
With data flying around in all directions, its probably worth taking a look at Apache Knox to provide perimeter security for your cluster – even if it is just one node. Batten down the hatches here.
We hope you have some fun testing out the new features of HDP 2.1 with these tutorials, and that they provide the inspiration for your own production work. If you have any comments, let us know below, or in the forums. And if you’d like a Hortonworks elephant, be sure to add your own tutorial over here.
Try it with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.