We are using HDP 1.3. During the process of setting up the cluster we are facing couple of issues
(We have 3 machines, one among them itself runs ambari all on CentOS 5.4)
a.1)We chose auto registration of ambari-agent. The registration process passed for server3,
but for other two it failed (because of ssh ).
a.2) We corrected all machines, amabari-server reset, and tried again.
Till time server3 failed with the below exception.
NetUtil.py:41 – Connecting to the following url https://localhost.localdomain:8440/cert/ca
NetUtil.py:55 – Failed to connect to https://localhost.localdomain:8440/cert/ca due to [Errno 111] Connection refused
It was taking,localhost instead of ambari server. But /etc/ambari-agent/conf/ambari-agent.ini was pointing to proper server.
So, we tried the following
* Remove rpm,reset ambari -failed
* Remove the rpm,delete /etc/ambari-agent, delete /usr/lib/ambari* , retry – It worked
Would like to know any inputs on where we went wrong.
b)After this Installing and starting server failed , with below error
ERROR ServiceComponentHostImpl:721 – Can’t handle ServiceComponentHostEvent event at current state, serviceComponentName=GANGLIA_SERVER, hostName=server233.xxxxxx, currentState=INSTALL_FAILED, eventType=HOST_SVCCOMP_OP_
SUCCEEDED, event=EventType: HOST_SVCCOMP_OP_SUCCEEDED
15:17:12,934 WARN HeartBeatHandler:233 – State machine exception
org.apache.ambari.server.state.fsm.InvalidStateTransitionException: Invalid event: HOST_SVCCOMP_OP_SUCCEEDED at INSTALL_FAILED
We retired again,4th attempt it passed.Is there any extra conf that I have to take care in this case ?
c) After successfully completing everything , we were able to see the dashboard and all .
But, next time we I use the same url, it redirects me to the installation process.
But all the process is running and the cluster is healthy. We shortly discovered that only this url redirects me
If this is a known problem any proper workaround for this.