Windows only or extension to Linux

to create new topics or reply. | New User Registration

This topic contains 3 replies, has 3 voices, and was last updated by  Larry Liu 2 years, 5 months ago.

  • Creator
    Topic
  • #16067

    I reviewed the MSI parameters and noticed that the NAMENODE_HOST is a mandatory field to configure. How does it work if no Hadoop instances (name, jobtracker) are configured? Do I first need to configure a (name and jobtracker) Hadoop instance and then configure the MSI for the remaining task- and datanodes? Or can the MSI also be used to configure the first instances? The manual doesn’t say anything about this.

Viewing 3 replies - 1 through 3 (of 3 total)

You must be to reply to this topic. | Create Account

  • Author
    Replies
  • #16251

    Larry Liu
    Moderator

    Hi, Piethein

    You are right about the clusterproperties.txt. Once the installation is completed, each node knows the configuration. The purpose of clusterproperties.txt is to tell the installer the layout of the cluster and configure each node accordingly.

    Thanks

    Larry

    Collapse
    #16241

    @tedr

    Thanks for your answer. So how does it work when I want to install a namenode host and mutiple data nodes? One time configure the clusterproperties.txt, run the MSI on all machines?

    Is it correct that the MSI will read the clusterproperties.txt, detect and then decide to configure the machine as a namenode or datanode for instance, based on the servername and information retrieved from the clusterproperties.txt?

    Collapse
    #16092

    tedr
    Member

    Hi Piethein,

    Thanks for looking at HDP for Windows.

    The NAMENODE_HOST variable is not meant to point at an existing instance of an hadoop namenode, but insteat it should point at the host you wish to become the namenode during cluster install.

    Thanks again,
    Ted.

    Collapse
Viewing 3 replies - 1 through 3 (of 3 total)
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.