Nagios is sending critical false alerts about current users

 
Thread Tools Search this Thread
Operating Systems Linux Red Hat Nagios is sending critical false alerts about current users
# 1  
Old 01-23-2012
Nagios is sending critical false alerts about current users

Hello All,

Nagios seems to be sending false alerts about few hosts, (ex: There were no users on one host and still Nagios was reporting a critical alert and says 6 users are logged in. How do I fix this one?

Also, I have installed nagios and added 12 hosts as a start and monitoring few details on them, how do I send false alerts and check if they are being reported through nagios server?


Thanks
Saikrishna
# 2  
Old 01-23-2012
Quote:
How do I fix this one?
There might be several thousand scripts for Nagios out there. Without knowing the details of the script or method you're using to monitor your hosts it'd be almost impossible to provide a useful answer.

What and how are you monitoring those hosts? Is it a custom script?, do you use NRPE,? how do you connect to them?
# 3  
Old 01-24-2012
Hey Verdepollo,

Yes, NRPE is being used, and below are few details on the Nagios master server, do let me know if you need additional information.

[root@proof etc]# uname -a
Linux proof 2.6.18-274.el5 #1 SMP Fri Jul 22 04:43:29 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux
[root@proof etc]# cat /etc/redhat-release
CentOS release 5.7 (Final)


[root@proof etc]# ls -ltr /usr/local/nagios/etc/
total 116
-rw-rw---- 1 nagios nagios 1340 Jan 13 18:51 resource.cfg
-rw-rw-r-- 1 nagios nagios 43774 Jan 13 18:51 nagios.cfg.01.16.2012
-rw-rw-r-- 1 nagios nagios 11408 Jan 13 18:51 cgi.cfg
-rw-r--r-- 1 root root 26 Jan 16 09:53 htpasswd.users
-rw-rw-r-- 1 nagios nagios 43830 Jan 16 17:37 nagios.cfg
drwxrwxr-x 2 nagios nagios 4096 Jan 17 09:04 objects
-rw-rw-r-- 1 nagios nagios 2813 Jan 18 14:15 services.cfg


[root@proof etc]# cat services.cfg
# Generic service definition template
define service{
name generic-service ;
active_checks_enabled 1 ;
passive_checks_enabled 1 ;
parallelize_check 1 ;
obsess_over_service 1 ;
check_freshness 0 ;
notifications_enabled 1 ;
event_handler_enabled 1 ;
flap_detection_enabled 1 ;
process_perf_data 1 ;
retain_status_information 1 ;
retain_nonstatus_information 1 ;

register 0 ; DONT REGISTER TEMPLATE!
}

# Service definition
define service{
use generic-service ; Name of service template

hostgroup_name admins
service_description HTTP
is_volatile 0
check_period 24x7
max_check_attempts 3
normal_check_interval 3
retry_check_interval 1
contact_groups admins,
notification_interval 30
notification_period 24x7
notification_options w,u,c,r
check_command check_http
}

# Service definition
define service{
use generic-service ; Name of service template

hostgroup_name admins
service_description LocalDisk
is_volatile 0
check_period 24x7
max_check_attempts 3

normal_check_interval 3
retry_check_interval 1
contact_groups admins
notification_interval 30
notification_period 24x7
notification_options w,u,c,r
check_command check_local_disk
}

# Service definition
define service{
use generic-service ; Name of service template

hostgroup_name admins
service_description HTTP
is_volatile 0
check_period 24x7
max_check_attempts 3
normal_check_interval 3
retry_check_interval 1
contact_groups admins
notification_interval 30
notification_period 24x7
notification_options w,u,c,r
check_command check_http
}
# 4  
Old 01-24-2012
maybe your client configuration is also useful. ;-)
# 5  
Old 01-24-2012
Here you go.

I've used the same config file for all servers and updated relevant hostname & IPAddress respectively, do let me know if you need additional information.

define host{
use linux-server ; Inherit default values from a template
host_name clientnagios; The name we're giving to this server
alias CentOS 5; A longer name for the server
address 10.10.2.19; IP address of the server
}

#define service{
# use generic-service
# host_name clientnagios
# service_description HTTP
# check_command check_http!check_http
#}
define service{
use generic-service
host_name clientnagios
service_description PING
check_command check_ping!100.0,20%!500.0,60%
initial_state u
max_check_attempts 2
check_interval 5
retry_interval 1
check_period 24x7
notification_interval 15
notification_period 24x7
notifications_enabled 1
notification_options c,r
}

define service{
use generic-service
host_name clientnagios
service_description Root Partition
check_command check_local_disk!20%!10%!/
initial_state u
max_check_attempts 3
check_interval 10
retry_interval 2
check_period 24x7
notification_interval 30
first_notification_delay 15
notification_period 24x7
notifications_enabled 1
notification_options c,r
}


define service{
use generic-service ; Name of service template to use
host_name clientnagios
service_description Current Users
check_command check_local_users!3!5
initial_state u
max_check_attempts 3
check_interval 10
retry_interval 2
check_period 24x7
notification_interval 30
first_notification_delay 15
notification_period 24x7
notifications_enabled 1
notification_options c,r
}

define service{
use generic-service ; Name of service template to use
host_name clientnagios
service_description Total Processes
check_command check_local_procs!375!400!RSZDT
initial_state u
max_check_attempts 3
check_interval 10
retry_interval 2
check_period 24x7
notification_interval 30
first_notification_delay 15
notification_period 24x7
notifications_enabled 1
notification_options c,r
}

define service{
use generic-service ; Name of service template to use
host_name clientnagios
service_description Current Load
check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4.0
initial_state u
max_check_attempts 3
check_interval 10
retry_interval 2
check_period 24x7

notification_interval 30
first_notification_delay 15
notification_period 24x7
notifications_enabled 1
notification_options c,r
}

define service{
use generic-service ; Name of service template to use
host_name clientnagios
service_description Swap Usage
check_command check_local_swap!20!10
initial_state u
max_check_attempts 3
check_interval 10
retry_interval 2
check_period 24x7
notification_interval 30
first_notification_delay 15
notification_period 24x7
notifications_enabled 1
notification_options c,r
}
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Url check creating child process and generating false alerts

Hi All Below code is working as expected but creating too many child processes when the url is not up and every minute that process is sending false email alerts any help with the logic not to generate child process and not to send duplicate alerts app="https://url" appresult=$(wget... (2 Replies)
Discussion started by: srilinux09
2 Replies

2. Shell Programming and Scripting

False alerts

Hi I have written a script to send email alerts when load of my linux server reaches max point I keep getting false emails thought the load is normal , looks like same email is generated again and again - called from cron tab checked if the tempfile is present , no it is not , cleaned... (22 Replies)
Discussion started by: anil529
22 Replies

3. UNIX for Advanced & Expert Users

Check_by_ssh plugin (nagios users)

Hi Everyone, I need help from the nagios users out there. I have setup a new service using the check_by_ssh plugin and is executing the script on the remote server but I wonder what am I missing because is not picking up the returning values of my script. Basically the script on the remote host... (1 Reply)
Discussion started by: arizah
1 Replies

4. UNIX for Dummies Questions & Answers

Nagios, clean all alerts

In NAGIOS i have an host with lots of alerts , host detail- warnings -service , is there any way to acknowledge all of then at one time , or i have to click in all of then one by one? (3 Replies)
Discussion started by: prpkrk
3 Replies

5. Infrastructure Monitoring

Setting Nagios warning and critical load threshold?

Hii everyone. I am here with a new query hoping linux gurus can help me. we have server that was cloned from physical to virtual. Now the load on this server has tremendously reduced. But the threshold set are w.r.t physical server and we do not receive any alert for load as the increased... (1 Reply)
Discussion started by: zsycho
1 Replies

6. Emergency UNIX and Linux Support

Grouping of Nagios alerts

Hi! I want to group alerts in Nagios and want to know if and how that can be done? So something like 1 alert for multiple alerts and I am notified only once than multiple times. I tried using nan and nans (two of the opensource solutions) and tried hooking them into Nagios but after... (0 Replies)
Discussion started by: jacki
0 Replies

7. Red Hat

Current logged in users

I have 2 systems. (1) RHEL5 and (2) winXP pro from xpPRO putty i ssh into rhel5 : user root from xpPRO i ftp into rhel5 : user abc123 when i run #uptime it only shows 1 user when i do #ps -u abc123 : it shows vsftpd deamon PID is there a command that can be used to show all currently... (4 Replies)
Discussion started by: dplinux
4 Replies

8. UNIX for Dummies Questions & Answers

Nagios Alerts Issue

Hi, I had installed Nagios on RHEL5.0 for Server Monitoring and configured the email alerts. Presently i am able to get the alerts of the c drive space,ftp,http services, But when the host goes down I dont get any alert saying that the Host is down or Host is Up. Request you to please help me... (4 Replies)
Discussion started by: Arun.Kakarla
4 Replies

9. UNIX for Advanced & Expert Users

Sending Finder Messages/Alerts in UNIX

I recall a UNIX command and remember actually using it where you could send finder alerts/messages to the Finder on the machine you were on or use ssh so as to send the alerts/messages to another machine on the network. You could also send Finder Alerts/Messages that had a reply field in the sense... (0 Replies)
Discussion started by: Peaves
0 Replies

10. UNIX for Dummies Questions & Answers

nagios not sending hosts notification

I configured nagios version 1.0b on solaris 9 and it working fine, but when hosts goes down or unreachable I do not get hosts notification. I get service notification when servive is critical, unrechable and recovered but not an hosts notification. here is my contact.cfg define contact{... (1 Reply)
Discussion started by: hassan2
1 Replies
Login or Register to Ask a Question