HDP on Linux – Installation Forum

HDP 1.2 Installation Failed

  • #18324

    Hello,

    I am running into some issues in registering steps of the Cluster Installation Wizard of HDP 1.2. I am creating 3 nodes cluster with Ambari-server running in 1 of the 3 nodes. OS is 64bits redhat linux.

    The following log is in ambari-agent.log of the master and the same for all boxes. For some reasons, it can’t connect and register over https. I see other posts similar to this, but didn’t see any resolution. Anyone knows how to resolve this issues. I can provide personal email and phone to contact offline if needed. Any helps you can provide is greatly appreciated.

    IINFO 2013-03-20 09:38:48,240 security.py:49 – SSL Connect being called.. connecting to the server
    INFO 2013-03-20 09:38:48,367 Controller.py:103 – Unable to connect to: https://dev01.myserver.com:8441/agent/v1/register/dev01.myserver.com
    Traceback (most recent call last):
    File “/usr/lib/python2.6/site-packages/ambari_agent/Controller.py”, line 88, in registerWithServer
    response = self.sendRequest(self.registerUrl, data)
    File “/usr/lib/python2.6/site-packages/ambari_agent/Controller.py”, line 237, in sendRequest
    self.cachedconnect = security.CachedHTTPSConnection(self.config)
    File “/usr/lib/python2.6/site-packages/ambari_agent/security.py”, line 77, in __init__
    self.connect()
    File “/usr/lib/python2.6/site-packages/ambari_agent/security.py”, line 82, in connect
    self.httpsconn.connect()
    File “/usr/lib/python2.6/site-packages/ambari_agent/security.py”, line 66, in connect
    ca_certs=server_crt)
    File “/usr/lib64/python2.6/ssl.py”, line 338, in wrap_socket

    suppress_ragged_eofs=suppress_ragged_eofs)
    File “/usr/lib64/python2.6/ssl.py”, line 120, in __init__
    self.do_handshake()
    File “/usr/lib64/python2.6/ssl.py”, line 279, in do_handshake
    self._sslobj.do_handshake()
    SSLError: [Errno 8] _ssl.c:491: EOF occurred in violation of protocol

    Cheers,

    Brandon

to create new topics or reply. | New User Registration

  • Author
    Replies
  • #18384
    Larry Liu
    Moderator

    Hi, Brandon,

    This issue will be fixed in the next release, which should be out next week.

    Before install new release, please remove old installation and repos from the server.

    Thanks
    Larry

    #18387

    Hi Larry,

    Thanks for your time.

    Do you know the details of the defect that I can take a look at? Also, do you have instruction on how to remove old installation safely?

    Thanks,

    -Phi

    #18389
    Larry Liu
    Moderator

    Hi, Brandon,

    Here is the bug:
    https://issues.apache.org/jira/browse/AMBARI-1432

    You need to remove the following:

    1. ambari agent
    2. ambari server
    3. postgresql
    4. remove ambari and hdp repos from /etc/yum.repos.d

    Larry

    #19882

    Hi Larry

    I’ve the same issue of Brandon:

    INFO 2013-04-02 12:29:54,299 security.py:49 - SSL Connect being called.. connecting to the server
    INFO 2013-04-02 12:29:54,386 Controller.py:103 - Unable to connect to: https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain
    Traceback (most recent call last):
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 88, in registerWithServer
    response = self.sendRequest(self.registerUrl, data)
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 237, in sendRequest
    self.cachedconnect = security.CachedHTTPSConnection(self.config)
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 77, in __init__
    self.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 82, in connect
    self.httpsconn.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 66, in connect
    ca_certs=server_crt)
    File "/usr/lib64/python2.6/ssl.py", line 338, in wrap_socket
    suppress_ragged_eofs=suppress_ragged_eofs)
    File "/usr/lib64/python2.6/ssl.py", line 120, in __init__
    self.do_handshake()
    File "/usr/lib64/python2.6/ssl.py", line 279, in do_handshake
    self._sslobj.do_handshake()
    SSLError: [Errno 8] _ssl.c:490: EOF occurred in violation of protocol

    Do you know if the patch is out? If not, is it possible to apply it manually?
    Any workaround meanwhile?

    Thanks
    Paolo

    #19956
    Seth Lyubich
    Moderator

    Hi Paolo,

    This issue should be resolved in 1.2.2. Can you please check that you are using latest version?

    Thanks,
    Seth

    #19967

    This is the repo I used to install:
    rpm -Uvh http://public-repo-1.hortonworks.com/ambari/centos6/1.x/GA/ambari-1.x-1.el6.noarch.rpm
    from the documentation at:
    http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.2/bk_using_Ambari_book/content/index.html

    This is the result of: yum list installed | grep ambari
    ambari.noarch 1.x-1.el6 installed
    ambari-agent.x86_64 1.2.2.3-1 @Updates-ambari-1.x
    ambari-server.noarch 1.2.2.3-1 @Updates-ambari-1.x

    It should be the last, can you confirm please?

    Thanks,
    Paolo

    #19971
    Seth Lyubich
    Moderator

    Hi Paolo,

    Looks like there is newer version. Can you please try to uninstall ambari agent and ambari-server with yum erase ambari-agent and yum erase ambari-server, remove Ambari repositories, clean repositories, and try installation again with newest repo?

    Please let us know if we can help with anything else.

    Thanks,
    Seth

    #19972

    Hi Seth
    Please can you share the repository url to be used and the correct version to be installed?

    Thanks
    Paolo

    #20140
    tedr
    Member

    Hi Paolo,

    The newest version is at the URL that you used. It looks like you may have just done your install prior to the repo being updated.

    Thanks,
    Ted.

    #20144

    Hi Ted
    thanks for your reply.

    Finally managed to install version 1.2.2.4-1 but I’m getting the same error:
    INFO 2013-04-03 18:28:54,061 security.py:49 - SSL Connect being called.. connecting to the server
    INFO 2013-04-03 18:28:54,131 Controller.py:103 - Unable to connect to: https://nodemaster:8441/agent/v1/register/nodemaster.localdomain
    Traceback (most recent call last):
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 88, in registerWithServer
    response = self.sendRequest(self.registerUrl, data)
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 237, in sendRequest
    self.cachedconnect = security.CachedHTTPSConnection(self.config)
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 77, in __init__
    self.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 82, in connect
    self.httpsconn.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 66, in connect
    ca_certs=server_crt)
    File "/usr/lib64/python2.6/ssl.py", line 338, in wrap_socket
    suppress_ragged_eofs=suppress_ragged_eofs)
    File "/usr/lib64/python2.6/ssl.py", line 120, in __init__
    self.do_handshake()
    File "/usr/lib64/python2.6/ssl.py", line 279, in do_handshake
    self._sslobj.do_handshake()
    SSLError: [Errno 8] _ssl.c:490: EOF occurred in violation of protocol

    Is there anything I can do to move ahead with the installation?
    Thanks
    Paolo

    #20145
    tedr
    Member

    Hi Paolo,

    When I look over the log snippet you just posted, I see it trying to connect to this URL https://nodemaster:8441/agent/v1/register/nodemaster.localdomain
    Which points out that you may have entered the short hostname (nodemaster) into the list for the hosts in your cluster, if that is so then that is where the problem lies. It is required that the fully qualified hostnames be entered in the list of hosts. In your case the fully qualified name would be nodemaster.localdomain.

    Thanks,
    Ted.

    #20147

    Hi Ted, seems the same also with the fully qualified name

    INFO 2013-04-03 20:28:53,032 security.py:49 – SSL Connect being called.. connecting to the server
    INFO 2013-04-03 20:28:53,144 Controller.py:103 – Unable to connect to: https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain

    Paolo

    #20208
    abdelrahman
    Moderator

    Hi Paolo,

    How is your day so far? Can you please validate that all of the firewalls are off by running the following from command line on every host ?
    # /etc/init.d/iptables stop
    Once this step is completed please make an http requests from every host using the following commands:
    wget https://nodemaster.localdomain:8441
    wget https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain

    Hope this helps.

    Thanks
    -Abdelrahman

    #20292

    This is the result on all nodes:
    [root@nodemaster hadoop]# wget https://nodemaster.localdomain:8441
    --2013-04-03 23:01:13-- https://nodemaster.localdomain:8441/
    Resolving nodemaster.localdomain... 192.168.1.100
    Connecting to nodemaster.localdomain|192.168.1.100|:8441... connected.
    Unable to establish SSL connection.
    [root@nodemaster hadoop]# wget https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain
    --2013-04-03 23:01:31-- https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain
    Resolving nodemaster.localdomain... 192.168.1.100
    Connecting to nodemaster.localdomain|192.168.1.100|:8441... connected.
    Unable to establish SSL connection.

    Paolo

    #20324

    Finally managed to install a 3 nodes cluster. To solve I restarted the entire installation from scratch.
    Thanks
    Paolo

    #20333
    tedr
    Member

    Hi Paolo,

    Thanks for letting us know that this is working for you now.

    Thanks,
    Ted.

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.