Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
Open Source Projects




A tool for provisioning and managing Apache Hadoop clusters in the cloud

Cloudbreak, as part of the Hortonworks Data Platform, makes it easy to provision, configure and elastically grow HDP clusters on cloud infrastructure. Cloudbreak can be used to provision Hadoop across cloud infrastructure providers including Amazon Web Services, Microsoft Azure, Google Cloud Platform and OpenStack.

What Cloudbreak Does

Cloudbreak is a tool for provisioning Hadoop clusters on cloud infrastructure such as Amazon Web Services, Microsoft Azure. As part of the Hortonworks Data Platform and powered by Apache Ambari, Cloudbreak allows enterprises to simplify the provisioning of clusters in the cloud and optimize the use of cloud resources with elastic scaling. With Cloudbreak. Hadoop operators get the following core benefits:

  • Simplified Cluster Provisioning. Dynamically provision and configure clusters on the cloud. With Ambari Blueprints, build the clusters you need in a consistent, repeatable fashion.
  • Automated Cluster Scaling. Optimize cloud resource usage by seamlessly adjusting the cluster as workload and activity changes. Allows you to respond faster to new business requirements.
  • Choice of Clouds. Supports Amazon Web Services, Microsoft Azure, Google Cloud Platform and OpenStack. You can extend even further with the cloud provider “plug-in” model.
  • Highly Extensible. Recipes for scripting extensions that run before/after cluster provisioning. The Cloudbreak Command Line Interface (CLI) and REST API are ideal for automation.

How Cloudbreak Works

Cloudbreak launches Hortonworks Data Platform clusters on the cloud in 3 easy steps:

  1. Pick a Blueprint: Cloudbreak uses Ambari Blueprints to have declarative Hadoop cluster definition. Blueprints can be designed for specialized applications and workloads (such as Data Science or IoT Apps). Cloudbreak includes a few default Blueprints for common cluster configurations but you can always upload your own Blueprint to build the cluster just the way you like it.
  2. Choose a Cloud: Cloudbreak is configured to work with cloud infrastructure resources (such as servers, network setup and security options). Choose the cloud infrastructure you want to use for the cluster.
  3. Launch HDP: In this step, Cloudbreak obtains the chosen cloud infrastructure platform, installs Apache Ambari and applies the desired Blueprint. The result: your cluster is launched and ready to go!

Internally, Cloudbreak is built on the foundation of cloud providers APIs (Amazon Web Services, Microsoft Azure, Google Cloud Platform, OpenStack), Apache Ambari, Docker containers, Swarm and Consul. 

Collaboration and Focus

Hortonworks is focused on going to market with a 100% open source solution. This focus allows us to collectively provide the product management guidance for Enterprise Grade Hadoop to mainstream enterprises, our partner ecosystem, and further innovate the core of Hadoop.

  • Open. Deliver a complete set of features for Hadoop cloud deployment, in the public and with the community, by defining the operational framework and lifecycle.
  • Flexible. Support a wider array of cloud providers with a common set of API’s to deploy Hadoop.
  • Integrated. Ensure that Hadoop cloud deployment can be integrated with existing IT tools, behind a single pane of glass, by providing Recipes, a CLI and a REST API.

Given our strong open source heritage, we believe Hortonworks is uniquely qualified to ensure that the Cloudbreak technologies continue to flourish in the open. Our strategy is squarely focused on a 100% open-source model with no proprietary extensions. With this approach, we are never conflicted about which capabilities, features, or components to incorporate. We listen to our customers’ and partners’ requirements and work together with them in the open to deliver the best the community has to offer.

Recent Improvements

Cloudbreak, which is part of Hortonworks Data Platform, serves as the unifying system for enterprises looking to easily and securely provision HDP workloads on cloud infrastructure. It simplifies the experience of provisioning and managing cloud resources together with Hadoop environments. This enables the IT Operator to deliver data and processing with agility and flexibility while optimizing their use of cloud resources.

Customizable Provisioning

Cloudbreak supports the use of “Recipes” to execute custom scripts before and after cluster provisioning. When Cloudbreak goes to provision the cluster, it can bring in “recipes” to run pre and post cluster creation to help prepare the machines.

Support for OpenStack

Cloudbreak add supports OpenStack Juno and Kilo. Also available is a Service Provider Interface (SPI) for plugging in new cloud providers.

Recent Cloudbreak Releases

Get started today and download Cloudbreak

Cloudbreak Version Notable Enhancements
  • Support for HDP 2.6
  • Azure support for Private IPs
  • Ability to designate Host Group for Ambari Server
  • Azure support for Canada East and Canada Central Regions
  • Adds support for OpenStack Juno and Kilo
  • Recipes for custom before/after cluster provisioning scripts
  • Service Provider Interface (SPI) for cloud “plugin” model
  • First release of Cloudbreak
  • Support for Amazon Web Services, Microsoft Azure and Google Cloud Platform
  • Support for Hortonworks Data Platform 2.3


Cloudbreak in our Blog

Webinars & Presentations