Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
Open Source Projects




Cloudbreak simplifies the deployment of Hortonworks platforms in cloud environments such as Amazon Web Services, Microsoft Azure and Google Cloud Platform. Cloudbreak enables the enterprise to quickly run big data workloads in the cloud while optimizing the use of cloud resources.

What Cloudbreak Does

With Cloudbreak, the Big Data Platform Owner will get the following core benefits:

  • Simplified Cluster Provisioning. Dynamically provision and configure clusters on the cloud. With Ambari Blueprints, build the clusters you need in a consistent, repeatable fashion.
  • Automated Cluster Scaling. Optimize cloud resource usage by seamlessly adjusting the cluster as workload and activity changes. Allows you to respond faster to new business requirements.
  • Choice of Clouds. Supports Amazon Web Services, Microsoft Azure, Google Cloud Platform and OpenStack.
  • DevOps: Automate deployment using the integrated Command Line Interface (CLI) and REST API.

How Cloudbreak Works

Cloudbreak launches clusters on the cloud in 3 easy steps:

  1. Pick a Blueprint: Cloudbreak uses Ambari Blueprints to have declarative Hadoop cluster definition. Blueprints can be designed for specialized applications and workloads (such as Data Science or IoT Apps). Cloudbreak includes a few default Blueprints for common cluster configurations but you can always upload your own Blueprint to build the cluster just the way you like it.
  2. Choose a Cloud: Cloudbreak is configured to work with cloud infrastructure resources (such as servers, network setup and security options). Choose the cloud infrastructure you want to use for the cluster.
  3. Launch Cluster: In this step, Cloudbreak obtains the chosen cloud infrastructure platform, installs Apache Ambari and applies the desired Blueprint. The result: your cluster is launched and ready to go!

Internally, Cloudbreak is built on the foundation of cloud providers APIs (Amazon Web Services, Microsoft Azure, Google Cloud Platform, OpenStack), Apache Ambari, Docker containers, Swarm and Consul. 

Collaboration and Focus

Hortonworks is focused on going to market with a 100% open source solution. This focus allows us to collectively provide the product management guidance for Enterprise Grade Hadoop to mainstream enterprises, our partner ecosystem, and further innovate the core of Hadoop.

  • Open. Deliver a complete set of features for Hadoop cloud deployment, in the public and with the community, by defining the operational framework and lifecycle.
  • Flexible. Support a wider array of cloud providers with a common set of API’s to deploy Hadoop.
  • Integrated. Ensure that Hadoop cloud deployment can be integrated with existing IT tools, behind a single pane of glass, by providing Recipes, a CLI and a REST API.

Given our strong open source heritage, we believe Hortonworks is uniquely qualified to ensure that the Cloudbreak technologies continue to flourish in the open. Our strategy is squarely focused on a 100% open-source model with no proprietary extensions. With this approach, we are never conflicted about which capabilities, features, or components to incorporate. We listen to our customers’ and partners’ requirements and work together with them in the open to deliver the best the community has to offer.

Recent Improvements

Some recent improvements to Cloudbreak include:

Custom Images

Cloudbreak includes a set of standard default images as the “base” for launching clusters in the cloud. These images help enable a simple out-of-the-box experience but the typical enterprise requires customization of these images. Cloudbreak allows you to “bring your own image” so you can customize to meet your enterprise requirements.


Authenticating and establishing a user’s identity is the basis for secure access to a cluster. Users need to be able to reliably “identify” themselves and then have that identity propagated throughout the cluster. Kerberos is the foundational technology to enable this authentication. Cloudbreak enables you to create clusters that are enabled with Kerberos.

Recent Cloudbreak Releases

Get started today and download Cloudbreak

Cloudbreak Version Notable Enhancements
  • Custom Images
  • Kerberos
  • New Recipe Types
  • New and Simplified UI
  • Support for HDP 2.6
  • Azure support for Private IPs
  • Ability to designate Host Group for Ambari Server
  • Azure support for Canada East and Canada Central Regions
  • Adds support for OpenStack Juno and Kilo
  • Recipes for custom before/after cluster provisioning scripts
  • Service Provider Interface (SPI) for cloud “plugin” model
  • First release of Cloudbreak
  • Support for Amazon Web Services, Microsoft Azure and Google Cloud Platform
  • Support for Hortonworks Data Platform 2.3



Cloudbreak in our Blog

Webinars & Presentations