Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
June 30, 2016
prev slideNext slide

Quickly Launch Hortonworks Data Platform in Amazon Web Services

Big data is changing the way enterprises interact with and consume data. Modern data platforms, such as Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF), are driving a data revolution by powering new workloads and analytic applications.

This week, there are thousands of attendees in San Jose at Hadoop Summit 2016 learning about the technologies and business drivers that are transforming how the enterprise harnesses data. We also announced the latest innovations coming in Hortonworks Data Platform 2.5 and provided a look at the new and exciting technologies in the pipeline, such as Hive LLAP.

While Big Data and Hadoop have been eating the world, the Cloud has been steadfast at work enabling the enterprise to save time, save money and scale fast.

Separately, Big Data and Cloud are each creating new opportunities and efficiencies for the enterprise. But when used together, enterprises can realize business value and achieve insight into data more quickly and with greater flexibility than ever before. To make this combination achieve it’s full potential, Big Data and Cloud needs an experience that marries ease of use with infrastructure agility so that a user can get their analytics “tool of choice” in their hands exactly when they need (and want) it.

Introducing the Hortonworks Connected Data Cloud Technical Preview

To this end, we are introducing the “Hortonworks Connected Data Cloud” Technical Preview. This Technical Preview gives you a way to quickly spin up Apache Hive and Apache Spark clusters that are ready to run ephemeral workloads in your Amazon Web Services (AWS) environment.

Blog1

Using “Hortonworks Cloud for AWS”, you can create clusters by choosing from a set of prescriptive cluster configurations. It’s not meant for the infinite configuration possibilities that Hadoop provides. Instead, it’s about hiding those complexities under-the-hood so people can get Spark and Hive running quickly in their AWS environments to start modeling and analyzing data sets. When you are done your analysis, you can give back the resources to the Cloud just as easily as you got them.

How Do I Get Started?

Instructions on how to spin up the Technical Preview in your AWS environment are found here: http://hortonworks.github.io/hdp-aws/. The high-level points are:

  1. Start with your AWS account and launch the cloud controller into your AWS environment.
  2. Log into the cloud controller and start creating Hive or Spark clusters that are ready to use for analysis.
  3. Scale-up, scale-down and clone those clusters…and when done, terminate.

What Hive and Spark Cluster Configurations are Available?

The Technical Preview includes a prescriptive set of cluster configurations for Hive and Spark, from the most stable to the more experimental. If you are looking to grab HDP 2.4 with the Hive and Spark versions you know and love, go for it. Or if you are looking to explore the latest HDP innovations, try the with Spark 2.0-preview or Hive LLAP.

Blog2

Where Do I Get Help?

Once you start working with the Technical Preview, Hortonworks Community Connection is a great resource for help. Hortonworks cloud subject matter experts are moderating the “Cloud & Operations” Track for questions related to this Technical Preview. When asking a question related to this Technical Preview, be sure to select the “Cloud & Operations” Track and add the following tag: hortonworks-cloud.

Blog3

What’s Next?

We are excited for you to try the Technical Preview and look forward to seeing your feedback on the Hortonworks Community Connection. And if you want to see the Technical Preview in action, you still have time to attend the Hadoop Summit 2016 Keynote on Thursday, June 30 where Sanjay Radia will discuss “The Elephant in the Clouds” and the technical underpinnings aimed at enabling Hadoop in the Cloud.

Happy Hadooping!

Comments

    • Hi Vikram,
      Microsoft Azure HDInsight is our Premier Hadoop Managed Platform Service offering in the market. HDInsight is for users that want a Managed Hadoop-as-a-Service using the best of breed Hadoop distribution from Hortonworks. In the recent Forrester Wave for Big Data Hadoop Cloud Solutions, Azure HDInsight was rated higher than the other solutions in almost every category.

      The AWS Technical Preview is not a hosted or managed offering. It’s an Infrastructure-as-a-Service offering (IaaS) that you can launch and run in your AWS environment. The focus of the AWS offering is to provide a simple experience aimed initially at Apache Hive and Apache Spark ephemeral cluster use cases. It is not as broad or feature-rich as HDInsight.

      Since the AWS Technical Preview is just a Technical Preview, support is not available. Please try it. We would like to get your feedback!

  • This is great news and quite an interesting read. Thank you for explaining the features and benefits of using Hortonworks on Amazon Web Services. This article helped get a better understanding of how these platforms can be used together. We hope to see it improve more in the future!

  • I wasted three hours of very valuable time trying to find out that you need to have two subscriptions from amazon market in order for this tutorial to work for Hortonworks cloud controller and hortonworks hdp services

  • Leave a Reply

    Your email address will not be published. Required fields are marked *