Hi,
I’ve been asked to write my How-To on the CentOS Wiki. I don’t want to have to manage 2 versions, so here is the link.
Sorry for the inconvenience…
Hi,
I’ve been asked to write my How-To on the CentOS Wiki. I don’t want to have to manage 2 versions, so here is the link.
Sorry for the inconvenience…
I just started playing with Nagios, an open-source monitoring software package (GPL). I used to use monit instead, but there are two limitations of monit that made me switch:
On the other hand, Nagios has tools that allows a Nagios server to perform “local” checks on remote servers, via the network (check_snmp, check_nt, check_nrpe and check_ssh). It has as side effect that it can monitor Windows servers quite well. The web interface enough for my needs.
Note: I’m still using monit for process checks, as Nagios can’t do that as well as monit does (monit uses the information in the lockfile to see if the process is still in memory, and uses user-defined commands to restart the process if it is not in memory).
Here is how Nagios works, basically:
Nagios has also other very nice features… Here is how I configured my server:
Nagios’ configuration is a lot less painful than I thought. To make it easier, I created one file for each organization for which I monitor servers for.
Future plans: Failover
Using nsca, it is possible to have a Nagios server “standby”, that would detect if the “master” Nagios server is down, and perform the checks and notifications during this downeime. nsca allows this “standby” server to have all information about previous checks.
BTW, I used the rpm packages from Dag Wieers, and they work fine with my CentOS 4 system. I tried installing nagios on a Fedora Core 4 machine, but the rpm packages for nagios in Fedora-extras are confusing…