- PREPARE: Confirm your system meets the Requirements, you have Prepared the cluster and performed the Pre-Flight Operations.
- DOWNLOAD: Download and Install the software.
- PROVISION: Use the wizard to Provision Your Cluster.
- LEARN: Browse Additional Resources to learn more.
| Component | Platforms |
|---|---|
| Hortonworks Management Center (HMC) (1) | Operating Systems RHEL / CentOS 5 (64-bit) RHEL / CentOS 6 (64-bit) Web Browsers |
| Hadoop Components | Operating Systems (2) RHEL / CentOS 5 (64-bit) RHEL / CentOS 6 (64-bit) |
- The Hortonworks Management Center runs on the HMC Server.
- To install the Hadoop Components on different Operating Systems (such as SUSE Linux Enterprise Server), use gsInstaller or one of the Manual Install options.
In order to prepare your cluster for HDP, you will need to perform steps on each host that will be part of your cluster, as well as prepare the entire cluster to accept installation of the HDP software. This section provides information on those configurations.
Perform the following steps on the HMC Server and each host you plan to include as part of your cluster.
- Confirm the Fully Qualified Domain Name (FQDN) for each host using the command
hostname -f.If deploying your cluster to Amazon EC2, be sure to use the Private DNS host name.
- Confirm each host has Internet access via HTTP, HTTPS and FTP. When performing the HDP install, each host in the cluster will access the Internet to obtain software packages required for installation.
If your hosts will use a proxy to access the Internet, configure each host machine to use an Internet proxy. Check with your IT or network team for these settings.
If you do not have Internet access available to your cluster hosts refer to the Hortonworks Documentation on how to setup a Local Mirror Repository.
- Remove or disable any existing Puppet agent configurations. HDP performs the software installation (and ongoing cluster management) using Puppet. With HDP, the HMC Server is the Puppet master and each host in your cluster acts as a Puppet Agent.
- Disable SELinux
- Enable NTP on the cluster to synchronize the clocks across the hosts.
- Prepare Password-less SSH Login for
rootuser between the HMC Server and each host in the cluster. This enables the HMC Server to reach each host in the cluster via SSH without prompting for a password.Password-less SSH Login is required for the HMC Server to access each host in the cluster and install the necessary software components. For more information, please refer to the Hortonworks Documentation.
Confirm the HMC Server can SSH to itself without prompting for a password. This can be done using the
ssh root@localhostcommand. - Check the dependencies on each host in the cluster using the
yum info [dependency]command. Confirm the following are either not installed, or if installed, they are these versions based on your operating system.
Name Dependency Version-Release Ruby ruby[RHEL + CentOS 5]
1.8.5-24.el5
[RHEL + CentOS 6]
1.8.7.*.el6Puppet puppet[RHEL + CentOS 5]
2.7.9-2
[RHEL + CentOS 6]
2.7.9-2.el6Ruby Rack rubygem-rack[RHEL + CentOS 5]
1.1.0-2.el5
[RHEL + CentOS 6]
1.1.0-2.el6Ruby Passenger rubygem-passenger[RHEL + CentOS 5]
3.0.12-1.el5.centos
[RHEL + CentOS 6]
3.0.12-1.el6Nagios nagios[RHEL + CentOS 5]
3.0.12-1.el5.centos
[RHEL + CentOS 6]
3.2.3-2.el6Nagios Plugins nagios-plugins[RHEL + CentOS 5]
1.4.15-2.el5
[RHEL + CentOS 6]
1.4.9-1Nagios Common nagios-common[RHEL + CentOS 5]
2.12-10.el5
[RHEL + CentOS 6]
2.12-10.el6MySQL mysql[RHEL + CentOS 5 + 6]
5.*Ganglia ganglia-gmond
ganglia-gmetad[RHEL + CentOS 5 + 6]
3.2.0
Complete the Preparing Your Cluster steps above and confirm you have the following handy before you install and start HMC:
| Check | Operation | Description |
|---|---|---|
| SSH Private Key | Obtain the SSH Private Key (typically id_rsa) to use during the installation. Refer to the Hortonworks Documentation for more information on how the Private Key is used during cluster provisioning. |
|
| Host names text file | Create a text file of host names that will be part of your cluster. This file should contain a list of target host names, separated by newline, for the cluster. Refer to the Preparing Your Cluster section for more information on obtaining the hostname for each host in your cluster.
The host name should be the FQDN for the host, not the IP address. For more information, refer to the Hortonworks Documentation. |
- On the server you plan to use to host HMC, download and install the appropriate RPM based on your HMC Server platform.
HMC Server Platform HDP RPM RHEL / CentOS 5 (64-bit) rpm -Uvh http://public-repo-1.hortonworks.com/HDP-1.1.1.16/repos/centos5/hdp-release-1.1.1.16-1.el5.noarch.rpm RHEL / CentOS 6 (64-bit) rpm -Uvh http://public-repo-1.hortonworks.com/HDP-1.1.1.16/repos/centos6/hdp-release-1.1.1.16-1.el6.noarch.rpm - Install Extra Packages for Enterprise Linux (EPEL) with the following:
yum install epel-release
- ** RHEL / CentOS 5 HMC Server installs only **. Install the “PHP Extension Community Library for JSON” with the following:
yum install php-pecl-json
- Install HMC using the following:
yum install hmc
- Confirm HMC is installed by querying
hmcfrom the RPM list with the following:rpm -qa | grep hmc
- Start the HMC service. You will be prompted to agree to the Oracle Java License and download the binaries:
service hmc start
- Stop the firewall with the following:
/etc/init.d/iptables stop
- Proceed to Provisioning Your Cluster.
- Be sure you have performed steps in the Pre-Flight Operations Checklist and confirm you have your Host names text file and HMC Server SSH Private Key file handy.
- Browse to HMC start page:
http://{your.hmc.server}/hmc/html - Click the “Get Started” button.
- Follow the wizard instructions to provision your cluster.
Learn more about our products, Hadoop and participate in the community with the following resources:
- Hortonworks Documentation
- Knowledgebase
- Community Forums
- Apache Hadoop Training Courses (public, private and online courses)
- Expert Technical Support