cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
February 23, 2017
prev slideNext slide

Accelerating Time to Market with Hortonworks Data Cloud

With the introduction of the Hortonworks Data Cloud (HDCloud), deploying clusters and starting to process data has become an order of magnitude faster. When Apache Hadoop evolved from being an on premise solution to a cloud based solution, the time it took to make a cluster went from weeks to days. The same magnitude of improvements has happened with HDCloud. All of the complex technical setup and management is transparently handled by HDCloud.

One of the first major use cases for HDCloud I became involved with was with a global retailer’s business intelligence team. This retailer needed to analyze terabytes of data but had no experience doing so. When they asked about the traditional solutions in the Hadoop ecosystem, they encountered a serious challenge: their IT teams had no internalized Hadoop knowledge and were unable to start the initiative without increasing headcount or attending additional training. This was a non-starter because to even begin to prove value, large budgets needed to be set aside and any project would be delayed by months. This is a classic chicken-and-egg situation.

So to eliminate this challenge, we decided to leverage HDCloud. I’m pleased to report that after only a few hours with our team, real data was loaded into a running cluster that was ready to process with Apache Hive. One of the major time-savings was that the same table DDLs were able to be exported from the legacy SQL database straight into Hive. Being able to leverage existing SQL assets brought an added benefit of eliminating a tremendous amount of technical risk.

At the end of our engagement, we were at a point that an experienced database administrator with no specific Hadoop skills was able to manipulate the data with ease. Overall, we found by using HDCloud and this extremely agile deployment strategy enabled teams to begin their big data journey with almost no barrier to entry. Plus, as the team began to gain confidence, Hortonworks is able to help them become power users and get the best performance out of the system.

Overall, Hortonworks Data Cloud provided me and this retailer the following advantages:

  • Pre-configured and optimized templates that match common use cases such as Enterprise Data Warehousing and Data Science. These configurations take all the guesswork out doing an initial deployment.
  • A shortened list of suggested cloud instance types to deploy on. The cloud is a general-purpose utility but specific node types are optimized for Hadoop. We help guide you to that selection.
  • With an increase in digital attacks targeting companies, a robust but easy to deploy security solution is critical. At the most simplistic level, HDCloud exposed a minimal set of cluster ports that you can configure to only allow connections from your home office or IP range.
  • OS level dependencies, networking and hostnames are managed through automatic procedures.
  • The ability to create multiple isolated environments for dev, test, and production all from the same management console.
  • Elastic sizing of your cluster so you can start small and effortlessly add more nodes so you do not have to plan capacity for the future years just to get started.

I believe the combination of ease of use, rapid deployment, and scalability will change the way companies can explore and process data, which translates how they go to market over the coming years.

If you are ready to get started with Hortonworks Data Cloud for AWS, go here for a 5 day free trial from the AWS Marketplace listing. For more information, please refer to the following links.

Product Webpage https://hortonworks.com/products/cloud/aws
Product Documentation http://docs.hortonworks.com/HDPDocuments/HDCloudAWS/HDCloudAWS-1.11.0/index.html
“How To Get Started” Webinar https://hortonworks.com/webinar/hadoop-in-the-cloud-aws/

Leave a Reply

Your email address will not be published. Required fields are marked *