HDP on Linux – Installation Forum

Set up 2 node cluster using HDP

  • #51182
    Vidy G

    I am trying to set up a two node cluster using HDP 2.0 sandbox. I believe we need to use two different VM or physical machines to set up a 2 node cluster. Is it correct?

    I set up a HDP sandbox VM and cloned it to create a second VM. I enabled Ambari in sandbox 1 to configure the sandbox2 as the second node in the cluster. But Ambari failed to register the second sandbox. The log file said issues with host name. I tried to modify host-name of second VM with no luck. Has anyone tried this before? If so what will be a simple way of setting up a 2 node cluster of HDP ?

to create new topics or reply. | New User Registration

  • Author
  • #51204

    Hi Vidy,

    You should build a 2 node cluster using Ambari.
    Trying to connect 2 sandboxes together is not recommended as they contain ALL HDP components.
    Sandbox is designed to work as a standalone test system.

    Please review the ambari install documentation on how to setup a 2 node cluster – it will be quicker than trying to get the 2 sandboxes working – even if it is possible as Ambari is configured on both.



    Jem Matriano

    Hi Vidy,

    Try ro check the ip-address and mac-address of you cloned vm.

    Vidy G

    Thanks for the response Dave. Does that mean I can set up a 2-node cluster of HDP 2.x via Ambari ?
    I don;t see any specific instructions on that in Ambari configuration help page. Could you please point me to the right documentation?

    Vidy G

    My actual question is “Does that mean I can set up a 2-node cluster of HDP 2.x (via Ambari) on a single HDP sandbox VM ?”

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.