Home Forums HDP on Linux – Installation HDP 1.2 Installation Failed

Tagged: 

This topic contains 16 replies, has 6 voices, and was last updated by  tedr 1 year, 4 months ago.

  • Creator
    Topic
  • #18324

    Hello,

    I am running into some issues in registering steps of the Cluster Installation Wizard of HDP 1.2. I am creating 3 nodes cluster with Ambari-server running in 1 of the 3 nodes. OS is 64bits redhat linux.

    The following log is in ambari-agent.log of the master and the same for all boxes. For some reasons, it can’t connect and register over https. I see other posts similar to this, but didn’t see any resolution. Anyone knows how to resolve this issues. I can provide personal email and phone to contact offline if needed. Any helps you can provide is greatly appreciated.

    IINFO 2013-03-20 09:38:48,240 security.py:49 – SSL Connect being called.. connecting to the server
    INFO 2013-03-20 09:38:48,367 Controller.py:103 – Unable to connect to: https://dev01.myserver.com:8441/agent/v1/register/dev01.myserver.com
    Traceback (most recent call last):
    File “/usr/lib/python2.6/site-packages/ambari_agent/Controller.py”, line 88, in registerWithServer
    response = self.sendRequest(self.registerUrl, data)
    File “/usr/lib/python2.6/site-packages/ambari_agent/Controller.py”, line 237, in sendRequest
    self.cachedconnect = security.CachedHTTPSConnection(self.config)
    File “/usr/lib/python2.6/site-packages/ambari_agent/security.py”, line 77, in __init__
    self.connect()
    File “/usr/lib/python2.6/site-packages/ambari_agent/security.py”, line 82, in connect
    self.httpsconn.connect()
    File “/usr/lib/python2.6/site-packages/ambari_agent/security.py”, line 66, in connect
    ca_certs=server_crt)
    File “/usr/lib64/python2.6/ssl.py”, line 338, in wrap_socket

    suppress_ragged_eofs=suppress_ragged_eofs)
    File “/usr/lib64/python2.6/ssl.py”, line 120, in __init__
    self.do_handshake()
    File “/usr/lib64/python2.6/ssl.py”, line 279, in do_handshake
    self._sslobj.do_handshake()
    SSLError: [Errno 8] _ssl.c:491: EOF occurred in violation of protocol

    Cheers,

    Brandon

Viewing 16 replies - 1 through 16 (of 16 total)

You must be logged in to reply to this topic.

  • Author
    Replies
  • #20333

    tedr
    Member

    Hi Paolo,

    Thanks for letting us know that this is working for you now.

    Thanks,
    Ted.

    Collapse
    #20324

    Finally managed to install a 3 nodes cluster. To solve I restarted the entire installation from scratch.
    Thanks
    Paolo

    Collapse
    #20292

    This is the result on all nodes:
    [root@nodemaster hadoop]# wget https://nodemaster.localdomain:8441
    --2013-04-03 23:01:13-- https://nodemaster.localdomain:8441/
    Resolving nodemaster.localdomain... 192.168.1.100
    Connecting to nodemaster.localdomain|192.168.1.100|:8441... connected.
    Unable to establish SSL connection.
    [root@nodemaster hadoop]# wget https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain
    --2013-04-03 23:01:31-- https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain
    Resolving nodemaster.localdomain... 192.168.1.100
    Connecting to nodemaster.localdomain|192.168.1.100|:8441... connected.
    Unable to establish SSL connection.

    Paolo

    Collapse
    #20208

    abdelrahman
    Moderator

    Hi Paolo,

    How is your day so far? Can you please validate that all of the firewalls are off by running the following from command line on every host ?
    # /etc/init.d/iptables stop
    Once this step is completed please make an http requests from every host using the following commands:
    wget https://nodemaster.localdomain:8441
    wget https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain

    Hope this helps.

    Thanks
    -Abdelrahman

    Collapse
    #20147

    Hi Ted, seems the same also with the fully qualified name

    INFO 2013-04-03 20:28:53,032 security.py:49 – SSL Connect being called.. connecting to the server
    INFO 2013-04-03 20:28:53,144 Controller.py:103 – Unable to connect to: https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain

    Paolo

    Collapse
    #20145

    tedr
    Member

    Hi Paolo,

    When I look over the log snippet you just posted, I see it trying to connect to this URL https://nodemaster:8441/agent/v1/register/nodemaster.localdomain
    Which points out that you may have entered the short hostname (nodemaster) into the list for the hosts in your cluster, if that is so then that is where the problem lies. It is required that the fully qualified hostnames be entered in the list of hosts. In your case the fully qualified name would be nodemaster.localdomain.

    Thanks,
    Ted.

    Collapse
    #20144

    Hi Ted
    thanks for your reply.

    Finally managed to install version 1.2.2.4-1 but I’m getting the same error:
    INFO 2013-04-03 18:28:54,061 security.py:49 - SSL Connect being called.. connecting to the server
    INFO 2013-04-03 18:28:54,131 Controller.py:103 - Unable to connect to: https://nodemaster:8441/agent/v1/register/nodemaster.localdomain
    Traceback (most recent call last):
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 88, in registerWithServer
    response = self.sendRequest(self.registerUrl, data)
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 237, in sendRequest
    self.cachedconnect = security.CachedHTTPSConnection(self.config)
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 77, in __init__
    self.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 82, in connect
    self.httpsconn.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 66, in connect
    ca_certs=server_crt)
    File "/usr/lib64/python2.6/ssl.py", line 338, in wrap_socket
    suppress_ragged_eofs=suppress_ragged_eofs)
    File "/usr/lib64/python2.6/ssl.py", line 120, in __init__
    self.do_handshake()
    File "/usr/lib64/python2.6/ssl.py", line 279, in do_handshake
    self._sslobj.do_handshake()
    SSLError: [Errno 8] _ssl.c:490: EOF occurred in violation of protocol

    Is there anything I can do to move ahead with the installation?
    Thanks
    Paolo

    Collapse
    #20140

    tedr
    Member

    Hi Paolo,

    The newest version is at the URL that you used. It looks like you may have just done your install prior to the repo being updated.

    Thanks,
    Ted.

    Collapse
    #19972

    Hi Seth
    Please can you share the repository url to be used and the correct version to be installed?

    Thanks
    Paolo

    Collapse
    #19971

    Seth Lyubich
    Keymaster

    Hi Paolo,

    Looks like there is newer version. Can you please try to uninstall ambari agent and ambari-server with yum erase ambari-agent and yum erase ambari-server, remove Ambari repositories, clean repositories, and try installation again with newest repo?

    Please let us know if we can help with anything else.

    Thanks,
    Seth

    Collapse
    #19967

    This is the repo I used to install:
    rpm -Uvh http://public-repo-1.hortonworks.com/ambari/centos6/1.x/GA/ambari-1.x-1.el6.noarch.rpm
    from the documentation at:

    http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.2/bk_using_Ambari_book/content/index.html

    This is the result of: yum list installed | grep ambari
    ambari.noarch 1.x-1.el6 installed
    ambari-agent.x86_64 1.2.2.3-1 @Updates-ambari-1.x
    ambari-server.noarch 1.2.2.3-1 @Updates-ambari-1.x

    It should be the last, can you confirm please?

    Thanks,
    Paolo

    Collapse
    #19956

    Seth Lyubich
    Keymaster

    Hi Paolo,

    This issue should be resolved in 1.2.2. Can you please check that you are using latest version?

    Thanks,
    Seth

    Collapse
    #19882

    Hi Larry

    I’ve the same issue of Brandon:

    INFO 2013-04-02 12:29:54,299 security.py:49 - SSL Connect being called.. connecting to the server
    INFO 2013-04-02 12:29:54,386 Controller.py:103 - Unable to connect to: https://nodemaster.localdomain:8441/agent/v1/register/node2.localdomain
    Traceback (most recent call last):
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 88, in registerWithServer
    response = self.sendRequest(self.registerUrl, data)
    File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 237, in sendRequest
    self.cachedconnect = security.CachedHTTPSConnection(self.config)
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 77, in __init__
    self.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 82, in connect
    self.httpsconn.connect()
    File "/usr/lib/python2.6/site-packages/ambari_agent/security.py", line 66, in connect
    ca_certs=server_crt)
    File "/usr/lib64/python2.6/ssl.py", line 338, in wrap_socket
    suppress_ragged_eofs=suppress_ragged_eofs)
    File "/usr/lib64/python2.6/ssl.py", line 120, in __init__
    self.do_handshake()
    File "/usr/lib64/python2.6/ssl.py", line 279, in do_handshake
    self._sslobj.do_handshake()
    SSLError: [Errno 8] _ssl.c:490: EOF occurred in violation of protocol

    Do you know if the patch is out? If not, is it possible to apply it manually?
    Any workaround meanwhile?

    Thanks
    Paolo

    Collapse
    #18389

    Larry Liu
    Moderator

    Hi, Brandon,

    Here is the bug:

    https://issues.apache.org/jira/browse/AMBARI-1432

    You need to remove the following:

    1. ambari agent
    2. ambari server
    3. postgresql
    4. remove ambari and hdp repos from /etc/yum.repos.d

    Larry

    Collapse
    #18387

    Hi Larry,

    Thanks for your time.

    Do you know the details of the defect that I can take a look at? Also, do you have instruction on how to remove old installation safely?

    Thanks,

    -Phi

    Collapse
    #18384

    Larry Liu
    Moderator

    Hi, Brandon,

    This issue will be fixed in the next release, which should be out next week.

    Before install new release, please remove old installation and repos from the server.

    Thanks
    Larry

    Collapse
Viewing 16 replies - 1 through 16 (of 16 total)