Home Forums HDP on Linux – Installation Ambari Cluster Provisioning Error – Server at https://HDPMachine1:8440 is not re

This topic contains 13 replies, has 5 voices, and was last updated by  Seth Lyubich 1 year, 8 months ago.

  • Creator
    Topic
  • #14451

    suaroman
    Participant

    I am attempting to use Ambari to deploy\provision a cluster.
    From the Ambari server machine I enter a target host called hdpmaster.
    I use Browse to find the private key located on the Ambari server, then select Register and Confirm green button.
    I have already deployed Ambari client to the target machine ( hdpmaster ) and verified its running.

    When I select ‘Register and Confirm’ button, I’m taken to the Confirm Hosts . After a while, the progress bar turns red and the status is failed.

    Registration log for hdpmaster (error message)

    erifying Python version compatibility…
    Using python /usr/bin/python2.6
    Checking for previously running Ambari Agent…
    ERROR: ambari-agent already running
    Check /var/run/ambari-agent/ambari-agent.pid for PID.
    (‘hostname: ok HDPMaster
    ip: ok 127.0.0.1 192.168.0.15
    cpu: ok Intel(R) Core(TM) i7-2640M CPU @ 2.80GHz
    memory: ok 1.47172 GB
    disks: ok
    Filesystem Size Used Avail Use% Mounted on
    /dev/mapper/vg_hdpmaster-lv_root
    36G 4.0G 31G 12% /
    tmpfs 754M 276K 754M 1% /dev/shm
    /dev/sda1 485M 55M 406M 12% /boot
    os: ok CentOS release 6.3 (Final)
    iptables: ok
    Chain INPUT (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination
    5563 3527K ACCEPT all — * * 0.0.0.0/0 0.0.0.0/0 state RELATED,ESTABLISHED
    2 168 ACCEPT icmp — * * 0.0.0.0/0 0.0.0.0/0
    6 540 ACCEPT all — lo * 0.0.0.0/0 0.0.0.0/0
    22 1144 ACCEPT tcp — * * 0.0.0.0/0 0.0.0.0/0 state NEW tcp dpt:22
    1296 127K REJECT all — * * 0.0.0.0/0 0.0.0.0/0 reject-with icmp-host-prohibited

    Chain FORWARD (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination
    0 0 REJECT all — * * 0.0.0.0/0 0.0.0.0/0 reject-with icmp-host-prohibited

    Chain OUTPUT (policy ACCEPT 6601 packets, 514K bytes)
    pkts bytes target prot opt in out source destination
    selinux: ok SELINUX=enforcing
    SELINUXTYPE=targeted
    yum: ok yum-3.2.29-30.el6.centos.noarch
    rpm: ok rpm-4.8.0-27.el6.x86_64
    openssl: ok openssl-1.0.0-25.el6_3.1.x86_64
    curl: ok curl-7.19.7-26.el6_2.4.x86_64
    wget: ok wget-1.12-1.4.el6.x86_64
    net-snmp: ok net-snmp-5.5-41.el6_3.1.x86_64
    net-snmp-utils: UNAVAILABLE
    ntpd: UNAVAILABLE
    ruby: UNAVAILABLE
    puppet: UNAVAILABLE
    nagios: UNAVAILABLE
    ganglia: UNAVAILABLE
    passenger: UNAVAILABLE
    hadoop: UNAVAILABLE
    yum_repos: ok
    AMBARI-1.x Ambari 1.x 6
    HDP-UTILS-1.1.0.15 Hortonworks Data Platform Utils Version – HDP-UTILS-1. 52
    zypper_repos: UNAVAILABLE
    ‘, None)
    (‘INFO 2013-01-26 20:13:16,966 NetUtil.py:77 – Server at https://HDPMachine1:8440 is not reachable, s

Viewing 13 replies - 1 through 13 (of 13 total)

You must be logged in to reply to this topic.

  • Author
    Replies
  • #14731

    Seth Lyubich
    Keymaster

    Hi Shrek,

    From your output below it appears that HDPMachine1 is not reachable. Can you please make sure that you can ping the machine from all hosts in the cluster and that name resolves to correct IP (also from all nodes in the cluster)? Also you can check what command ‘hostname -f’ resolves this hostaname to. Is it HDPMachine1, or some other name.

    Since other issue on this post is resolved, please start new post if you are not able to resolve your issue with suggestions above.

    Hope this helps and thanks for using HDP,

    Thanks,
    Seth

    Collapse
    #14724

    Shrek Fan
    Member

    Hi,
    I have encounter the same issue,and I try many method to resolve ,but can’t work.
    If there are some method to work through this issue, or someone have resolve ?

    I just use a single computer and have /etc/ambari-agent/ambari-agent.ini to make sure the server host can reachable.

    some time my failed log is like below:
    INFO 2013-01-28 06:50:38,783 Controller.py:91 – Registered with the server with {u\’response\’: u\’OK\’,
    u\’responseId\’: 0,
    u\’responseStatus\’: u\’OK\’,
    u\’statusCommands\’: []}
    INFO 2013-01-28 06:50:38,787 Controller.py:96 – Got status commands on registration []
    INFO 2013-01-28 06:50:38,787 Controller.py:116 – No commands from the server : []
    INFO 2013-01-28 06:50:38,787 Controller.py:224 – Response from server = OK
    STDERR
    Connection to hdpmaster closed.
    Registering with the server…
    Registration with the server failed.

    and some times like :

    INFO 2013-01-26 20:14:29,978 NetUtil.py:58 – Failed to connect to https://HDPMachine1:8440/cert/ca due to [Errno 110] Connection timed out

    But all the end of log is :

    Connection to hdpmaster closed.
    Registering with the server…
    Registration with the server failed.

    Collapse
    #14512

    tedr
    Member

    Hi Sauroman,

    Thanks for letting us know.

    Ted.

    Collapse
    #14507

    suaroman
    Participant

    Thanks. Fixing the case of the hostnames corrected the problem.

    Collapse
    #14501

    tedr
    Member

    Hi Sauroman,

    No problem. It’s a little thing I miss quite a bit having come from the non-case sensitive windows world.

    Thanks,
    Ted.

    Collapse
    #14497

    suaroman
    Participant

    Oh geez… Thanks for pointing out the host name case sensitivity.
    Let me fix this . Thx

    Collapse
    #14491

    tedr
    Member

    Hi Sauromon,

    When I look at your logs and replies in this thread I see 3 different hostnames (hdpmaster, HDPMaster, & HDPMachine1). You need to make sure you have in the list of hostnames to install on the hostnames that you get when you run "hostname – f" on each of the nodes. Also remember that this host name is case sensitive, thus the first two names are different.

    Thanks,
    Ted.

    Collapse
    #14479

    suaroman
    Participant

    (3rd part )

    “netmask”: “255.255.255.0″, “uptime_days”: “0″, “uniqueid”: “a8c00b00″, “kernelrelease”: “2.6.32-279.19.1.el6.x86_64″, “path”: “/usr/lib/ambari-agent/lib/ruby-1.8.7-p370/bin:/usr/lib/ambari-server/*:/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin”, “ipaddress”: “192.168.0.11″, “lsbdistdescription”: “CentOS release 6.3 (Final)”, “manufacturer”: “VMware, Inc.”, “kernelversion”: “2.6.32″, “macaddress”: “00:0C:29:F9:50:84″, “operatingsystemrelease”: “6.3″, “boardserialnumber”: “None”, “processor0″: “Intel(R) Core(TM) i7-2640M CPU @ 2.80GHz”, “uptime_seconds”: “628″, “network_eth0″: “192.168.0.0″, “uptime_hours”: “0″, “lsbdistcodename”: “Final”, “productname”: “VMware Virtual Platform”, “lsbdistrelease”: “6.3″, “ipaddress_eth0″: “192.168.0.11″, “netmask_eth0″: “255.255.255.0″, “mounts”: [{"available": "31149524", "used": "4647336", "percent": "13%", "device": "/dev/mapper/vg_hdpmaster-lv_root", "mountpoint": "/", "type": "ext4", "size": "37712556"}, {"available": "771340", "used": "264", "percent": "1%", "device": "tmpfs", "mountpoint": "/dev/shm", "type": "tmpfs", "size": "771604"}, {"available": "414769", "used": "55475", "percent": "12%", "device": "/dev/sda1", "mountpoint": "/boot", "type": "ext4", "size": "495844"}]}, “timestamp”: 1359373835337, “hostname”: “HDPMaster”, “responseId”: -1, “publicHostname”: “HDPMaster”}\’
    INFO 2013-01-28 06:50:38,343 security.py:48 – SSL Connect being called.. connecting to the server
    INFO 2013-01-28 06:50:38,783 Controller.py:91 – Registered with the server with {u\’response\’: u\’OK\’,
    u\’responseId\’: 0,
    u\’responseStatus\’: u\’OK\’,
    u\’statusCommands\’: []}
    INFO 2013-01-28 06:50:38,787 Controller.py:96 – Got status commands on registration []
    INFO 2013-01-28 06:50:38,787 Controller.py:116 – No commands from the server : []
    INFO 2013-01-28 06:50:38,787 Controller.py:224 – Response from server = OK
    ‘, None)

    STDERR
    Connection to hdpmaster closed.
    Registering with the server…
    Registration with the server failed.

    Collapse
    #14478

    suaroman
    Participant

    (2nd part of log )

    INFO 2013-01-28 06:50:38,343 Controller.py:87 – Registering with the server \’{“hardwareProfile”: {“lsbrelease”: “:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noarch”, “kernel”: “Linux”, “ipaddress_lo”: “127.0.0.1″, “memoryfree”: 1237319, “memorytotal”: 1541406, “serialnumber”: “VMware-56 4d cf 4c 7d b6 ed 7e-f5 2e d4 65 84 f9 50 84″, “processorcount”: “1″, “is_virtual”: true, “timezone”: “EST”, “hardwareisa”: “x86_64″, “id”: “root”, “netmask_lo”: “255.0.0.0″, “lsbmajdistrelease”: “6″, “uptime”: “0:10 hours”, “boardproductname”: “440BX Desktop Reference Platform”, “macaddress_eth0″: “00:0C:29:F9:50:84″, “rubyversion”: “1.8.7″, “hostname”: “HDPMaster”, “facterversion”: “1.6.10″, “lsbdistid”: “CentOS”, “virtual”: “vmware”, “operatingsystem”: “CentOS”, “network_lo”: “127.0.0.0″, “boardmanufacturer”: “Intel Corporation”, “sshdsakey”: “AAAAB3NzaC1kc3MAAACBAIsp2Cu5MSKLpY348PyK/EFjtQASogSZJx9j6FKy4tnoDxbvu9jJxpyBYk0biSOVMV9agNzfxDZPh5XtBkw2ewDUBQ7uD7Of+TQgyAUS1XByG7EC6nBWDywCjMCWiV4v+VDP2r9PzpE6k2Dlskf9CUKsk2oBAbv73JCRUyn7By0XAAAAFQCdAqv6g3VIi9MWhP2yQHypatJkXQAAAIEAhH2JuOjPdkxlc5q43g8qkIyP73+Z9pjn73/lhHS49GIhdw/U80j1/quEunTcDxqXrh5kL8DhBySAOY6aSJyp6z9MfpPejg5g/Q3b7zvX5r6gWkbhjU3ri2E9fC6M64iSABxrJ+KbZ2qMksnLrtlsACV/vpMH5l+5C4rs/o1vRwAAAACAT3ZeUfaF/4VDGMKyNSIFzZ7r9v0+kpjeOJVVHRtjSgS9qO/PRKhItR1ybqsLqtG7/MfJLf2PxCqKAnIEflB0VM55PA7QmzN8HCVhhJj3x/KIERld/U1IRmc773tQbB0UwGEEgdtBe6M2FyHFmxCoWRDRkndAXKjL+qLCxPFJlzY=”, “selinux”: “false”, “hardwaremodel”: “x86_64″, “kernelmajversion”: “2.6″, “type”: “Other”, “rubysitedir”: “/usr/lib/ambari-agent/lib/ruby-1.8.7-p370/lib/ruby/site_ruby/1.8″, “architecture”: “x86_64″, “osfamily”: “RedHat”, “swapfree”: “2.97 GB”, “sshrsakey”: “AAAAB3NzaC1yc2EAAAABIwAAAQEA109niJnmH6xEDm7gWKexI+CJ1wk3ZBd5PKlKJxYGoOkUfNFRlFNc1TBk12TdNQ0qp8ZKbcBlE3qpGoRtHirg7EeqGLS0v1rXqP5O+oqCOAmPPIcypWBaQhML2DOdpWpChip23pDnFZt8LZZrhi8/JJ+DtnP49utFAuCZvllNPn8wetXN8m9pN9oZ/aK4WbrdtdW9+bhs07q0YZ6+8Sr5lBZLE9PihFFF/YFwk9TnyD9mD/bdc1zB9ZrqmKGC8GVxsVPtfGvHW/MNCtmqaCkzQfh4aqtYoBBMK81qoQrG8W+dG1pUstzlYajSNkZwDNktowj3Y6IdNORBwuraI0xbgQ==”, “ps”: “ps -ef”, “memorysize”: 1541406, “interfaces”: “eth0,lo”, “physicalprocessorcount”: 1, “swapsize”: “2.97 GB”,

    Collapse
    #14477

    suaroman
    Participant

    Thanks for the response.
    I cleaned things up and making it a little further.
    Can you take a peek at the log and let me know what you think? I can’t seem to make it past the ‘Confirm Host’ part of the wizard. Host machine is called hdpmaster. Progress bar runs for a while then eventually fails.

    Verifying Python version compatibility…
    Using python /usr/bin/python2.6
    Checking for previously running Ambari Agent…
    Starting ambari-agent
    Verifying ambari-agent process status…
    Ambari Agent successfully started
    Agent PID at: /var/run/ambari-agent/ambari-agent.pid
    Agent log at: /var/log/ambari-agent/ambari-agent.out
    (‘hostname: ok HDPMaster
    ip: ok 192.168.0.11
    cpu: ok Intel(R) Core(TM) i7-2640M CPU @ 2.80GHz
    memory: ok 1.47172 GB
    disks: ok
    Filesystem Size Used Avail Use% Mounted on
    /dev/mapper/vg_hdpmaster-lv_root
    36G 4.5G 30G 13% /
    tmpfs 754M 264K 754M 1% /dev/shm
    /dev/sda1 485M 55M 406M 12% /boot
    os: ok CentOS release 6.3 (Final)
    iptables: ok
    Chain INPUT (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination

    Chain FORWARD (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination

    Chain OUTPUT (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination
    selinux: ok SELINUX=disabled
    SELINUXTYPE=targeted
    yum: ok yum-3.2.29-30.el6.centos.noarch
    rpm: ok rpm-4.8.0-27.el6.x86_64
    openssl: ok openssl-1.0.0-25.el6_3.1.x86_64
    curl: ok curl-7.19.7-26.el6_2.4.x86_64
    wget: ok wget-1.12-1.4.el6.x86_64
    net-snmp: ok net-snmp-5.5-41.el6_3.1.x86_64
    net-snmp-utils: UNAVAILABLE
    ntpd: UNAVAILABLE
    ruby: UNAVAILABLE
    puppet: UNAVAILABLE
    nagios: UNAVAILABLE
    ganglia: UNAVAILABLE
    passenger: UNAVAILABLE
    hadoop: UNAVAILABLE
    yum_repos: ok
    AMBARI-1.x Ambari 1.x 6
    HDP-UTILS-1.1.0.15 Hortonworks Data Platform Utils Version – HDP-UTILS-1. 52
    zypper_repos: UNAVAILABLE
    ‘, None)
    (‘ \’sshrsakey\’: \’AAAAB3NzaC1yc2EAAAABIwAAAQEA109niJnmH6xEDm7gWKexI+CJ1wk3ZBd5PKlKJxYGoOkUfNFRlFNc1TBk12TdNQ0qp8ZKbcBlE3qpGoRtHirg7EeqGLS0v1rXqP5O+oqCOAmPPIcypWBaQhML2DOdpWpChip23pDnFZt8LZZrhi8/JJ+DtnP49utFAuCZvllNPn8wetXN8m9pN9oZ/aK4WbrdtdW9+bhs07q0YZ6+8Sr5lBZLE9PihFFF/YFwk9TnyD9mD/bdc1zB9ZrqmKGC8GVxsVPtfGvHW/MNCtmqaCkzQfh4aqtYoBBMK81qoQrG8W+dG1pUstzlYajSNkZwDNktowj3Y6IdNORBwuraI0xbgQ==\’,
    \’swapfree\’: \’2.97 GB\’,
    \’swapsize\’: \’2.97 GB\’,
    \’timezone\’: \’EST\’,
    \’type\’: \’Other\’,
    \’uniqueid\’: \’a8c00b00\’,
    \’uptime\’: \’0:10 hours\’,
    \’uptime_days\’: \’0\’,
    \’uptime_hours\’: \’0\’,
    \’uptime_seconds\’: \’628\’,
    \’virtual\’: \’vmware\’}

    Collapse
    #14457

    Sasha J
    Moderator

    Suaroman,
    It seems like your did not follow steps defined in installation manual…
    Your iptables is not disabled, (which is mandatory),
    SELINUX is in enforcing mode, should be disabled,
    You should not install agent manually anywhere, Ambari do it for you automatically,
    It seems like your name resolution is not working.

    Please, clean up your machine and start from the beginning, follow steps in installation guide precisely:

    http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.0/bk_using_Ambari_book/content/ambari-chap1.html

    Thank you!
    Sasha

    Collapse
    #14453

    suaroman
    Participant

    Seems like the failure is occurring because of this error:

    DEBUG:: Connecting to the following url https://HDPMachine1:8440/cert/ca

    INFO 2013-01-26 20:14:29,978 NetUtil.py:58 – Failed to connect to https://HDPMachine1:8440/cert/ca due to [Errno 110] Connection timed out

    Can someone help me understand who is supposed to be listening on port 8440 ? I can’t seem to find this anywhere. Ambari server machine and target machines are clean (new installs ) of Centos 6.3.

    Collapse
    #14452

    suaroman
    Participant

    Hi Everyone,

    Ambari host machine is failing to deploy host.
    Host machine is called hdpmaster

    erifying Python version compatibility…
    Using python /usr/bin/python2.6
    Checking for previously running Ambari Agent…
    ERROR: ambari-agent already running
    Check /var/run/ambari-agent/ambari-agent.pid for PID.
    (‘hostname: ok HDPMaster
    ip: ok 127.0.0.1 192.168.0.15
    cpu: ok Intel(R) Core(TM) i7-2640M CPU @ 2.80GHz
    memory: ok 1.47172 GB
    disks: ok
    Filesystem Size Used Avail Use% Mounted on
    /dev/mapper/vg_hdpmaster-lv_root
    36G 4.0G 31G 12% /
    tmpfs 754M 276K 754M 1% /dev/shm
    /dev/sda1 485M 55M 406M 12% /boot
    os: ok CentOS release 6.3 (Final)
    iptables: ok
    Chain INPUT (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination
    5563 3527K ACCEPT all — * * 0.0.0.0/0 0.0.0.0/0 state RELATED,ESTABLISHED
    2 168 ACCEPT icmp — * * 0.0.0.0/0 0.0.0.0/0
    6 540 ACCEPT all — lo * 0.0.0.0/0 0.0.0.0/0
    22 1144 ACCEPT tcp — * * 0.0.0.0/0 0.0.0.0/0 state NEW tcp dpt:22
    1296 127K REJECT all — * * 0.0.0.0/0 0.0.0.0/0 reject-with icmp-host-prohibited

    Chain FORWARD (policy ACCEPT 0 packets, 0 bytes)
    pkts bytes target prot opt in out source destination
    0 0 REJECT all — * * 0.0.0.0/0 0.0.0.0/0 reject-with icmp-host-prohibited

    Chain OUTPUT (policy ACCEPT 6601 packets, 514K bytes)
    pkts bytes target prot opt in out source destination
    selinux: ok SELINUX=enforcing
    SELINUXTYPE=targeted
    yum: ok yum-3.2.29-30.el6.centos.noarch
    rpm: ok rpm-4.8.0-27.el6.x86_64
    openssl: ok openssl-1.0.0-25.el6_3.1.x86_64
    curl: ok curl-7.19.7-26.el6_2.4.x86_64
    wget: ok wget-1.12-1.4.el6.x86_64
    net-snmp: ok net-snmp-5.5-41.el6_3.1.x86_64
    net-snmp-utils: UNAVAILABLE
    ntpd: UNAVAILABLE
    ruby: UNAVAILABLE
    puppet: UNAVAILABLE
    nagios: UNAVAILABLE
    ganglia: UNAVAILABLE
    passenger: UNAVAILABLE
    hadoop: UNAVAILABLE
    yum_repos: ok
    AMBARI-1.x Ambari 1.x 6
    HDP-UTILS-1.1.0.15 Hortonworks Data Platform Utils Version – HDP-UTILS-1. 52
    zypper_repos: UNAVAILABLE
    ‘, None)
    (‘INFO 2013-01-26 20:13:16,966 NetUtil.py:77 – Server at https://HDPMachine1:8440 is not reachable, sleeping for 10 seconds…
    INFO 2013-01-26 20:13:26,977 NetUtil.py:44 – DEBUG:: Connecting to the following url https://HDPMachine1:8440/cert/ca
    INFO 2013-01-26 20:14:29,978 NetUtil.py:58 – Failed to connect to https://HDPMachine1:8440/cert/ca due to [Errno 110] Connection timed out
    INFO 2013-01-26 20:14:29,978 NetUtil.py:77 – Server at https://HDPMachine1:8440 is not reachable, sleeping for 10 seconds…(error truncated above).

    Collapse
Viewing 13 replies - 1 through 13 (of 13 total)