Home Forums Ambari Nagios Issues

This topic contains 6 replies, has 3 voices, and was last updated by  Ray Roberts 1 year, 7 months ago.

  • Creator
    Topic
  • #23973

    Ray Roberts
    Participant

    I’m noticing a lot of errors(red alerts in Ambari) in Nagios for my HDP nodes. Specifically, it says all my hosts are down, which they obviously are not. I assume there is an issue with the plugins or the type of check Nagios is performing. I’m fairly new to Nagios, so I apologize if this is trivial.

    Most of my statuses in Nagios UI show: (Return code of 127 is out of bounds – plugin may be missing)

    The rest of the statuses show: (null)

    I did some research on Nagios and I’m unable to locate the default installation directories that suppose to exist. For example, the etc dir for Nagios doesn’t exist on my Nagios node. I only have cgi-bin and plugins directories.

    Thanks in advance for your help, you guys have been awesome.

    -Ray

Viewing 6 replies - 1 through 6 (of 6 total)

You must be logged in to reply to this topic.

  • Author
    Replies
  • #25686

    Ray Roberts
    Participant

    Abdelrahman,

    Thanks.

    So, what about nagios-common? Should I downgrade that as well?

    After removing nagios-plugins, I’m pulling version 1.4.16-5 from the HDP-epel repo, not 1.4.9-1

    -Ray

    Collapse
    #25602

    abdelrahman
    Moderator

    Hi Ray,

    How is your day so far? This will be the cause of the issue since the Nagios plugins are not compatible. Please downgrade Nagios packages to the following rpms :

    nagios 3.2.3-2.el6
    nagios-plugins1.4.9-1

    Hope this helps
    Thanks
    -Abdelrahman

    Collapse
    #25593

    Ray Roberts
    Participant

    Seth,

    Sorry for the delayed response.

    No, I didn’t have a prior installation of Nagios.

    Here is my output from the below commands:

    #rpm -qa|grep nagios
    nagios-common-3.4.4-1.el6.x86_64
    hdp_mon_nagios_addons-1.2.2.5-1.el6.noarch
    nagios-plugins-1.4.16-5.el6.x86_64
    nagios-3.4.4-1.el6.x86_64

    #yum info nagios-plugins
    Installed Packages
    Name : nagios-plugins
    Arch : x86_64
    Version : 1.4.16
    Release : 5.el6
    Size : 939 k
    Repo : installed
    From repo : HDP-epel
    Summary : Host/service/network monitoring program plugins for Nagios
    URL : http://nagiosplug.sourceforge.net/

    It looks like I have a slightly newer version, however I assume that is due to it being updated via HDP. Either way, would this be the problem?

    -Ray

    Collapse
    #24136

    Seth Lyubich
    Keymaster

    Hi Ray,

    This is usually caused by incompatible version of Nagios plugins. Did you have Nagios installed prior to installation?

    this is what I have on my system:

    bash-4.1$ rpm -qa|grep nagios
    hdp_mon_nagios_addons-0.0.2.15-1.noarch
    nagios-plugins-1.4.9-1.x86_64
    nagios-3.2.3-2.el6.x86_64

    You can check prerequisites here:

    http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.3.1/bk_using_Ambari_book/content/ambari-chap1-5-1.html

    You can check your version(s) by running ‘yum info nagios-plugins’. Please make sure that 1.4.9 version is installed.

    Hope this helps,

    Seth

    Collapse
    #24088

    Ray Roberts
    Participant

    Seth,

    Thanks.

    I don’t recall having any issues during installation.

    Can I run a smoke test now, after installation?

    Nagios seems to be up and running fine, it just seems like something is wrong with the config or plugins its using. I haven’t touched anything with Nagios or its config. To be honest, it really hasn’t been a priority of mine until I cleared up getting all the main services (i.e. hdfs, zookeeker, etc) up and running. So, now, I’d like to clear up Nagios’s alerts and errors. I have noticed the alerts(red boxes by each node in Ambari) from Nagios since day 1.

    -Ray

    Collapse
    #24081

    Seth Lyubich
    Keymaster

    Hi Ray,

    You can try to run ps command and check similar to below:

    [root@ambari1 nagios]# ps -aef|grep nagios
    nagios 15065 1 0 Jan21 ? 00:06:22 /usr/bin/nagios -d /etc/nagios/nagios.cfg

    From here you should be able to find where Nagios runs from and configuration location.

    Also, did you receive any error during installation? If yes what was the issue? Did you pass Nagios smoke test?

    Hope this helps,

    Thanks,
    Seth

    Collapse
Viewing 6 replies - 1 through 6 (of 6 total)