Visit Our UNIX and Linux User Community


False alerts


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting False alerts
# 1  
Old 02-21-2017
False alerts

Hi

I have written a script to send email alerts when load of my linux server reaches max point
I keep getting false emails thought the load is normal , looks like same email is generated again and again - called from cron tab

checked if the tempfile is present , no it is not , cleaned after every time scripts runs

what could be the issue?

please suggest

Last edited by rbatte1; 02-22-2017 at 08:53 AM.. Reason: Spelling
# 2  
Old 02-21-2017
Hello,

In order for anyone to have a chance of helping you diagnose this, you'll have to provide more information. Without actually seeing the script, it isn't really possible to say what the problem might be. If you could also provide the full 'crontab' entry that is used to run the script, that would be good too. Without these things, anything anyone says is just going to be pure guesswork, which is best avoided if possible.
These 2 Users Gave Thanks to drysdalk For This Post:
# 3  
Old 02-21-2017
<joke>
"If there's an error, it's probably on line 42." [(c) Hitchhiker's Guide to the Galaxy]
</joke>
This User Gave Thanks to vgersh99 For This Post:
# 4  
Old 02-23-2017
Code:
#!/bin/sh
# Script to send email alerts to mail box if cpu is more than 90% utilization
LIMIT=90
ALERT="monitoringbox@abc.com"
TEMPFILE=/tmp/temp1
HOSTNAME=`hostname`
rm -f $TEMPFILE
CPU_LOAD=`sar -P ALL 10 1 |grep Average' |awk -F" " '{print 100.0 -$NF}' |cut -d \. -f1`
if [[ $CPU_LOAD -gt $LIMIT ]];
then
echo "CPU is high on $HOSTNAME " >> $TEMPFILE
fi
if [ -e $TEMPFILE ]
then
mail -s " CPU ALERT  " $ALERT < $TEMPFILE
fi
rm -f $TEMPFILE


The above script is working as desired , ran from crontab for every 5 minutes
I also get false alerts cpu is 1% check alert

My linux team thinks it is nagios issue

Last edited by Don Cragun; 02-23-2017 at 08:42 PM.. Reason: Add CODE and ICODE tags.
# 5  
Old 02-23-2017
I don't have sar on the system I use, so it is hard to guess at what might be going on. Furthermore, since you don't preserve the value that is triggering your mail message, we have even less information.

You could reduce the load on your system a little bit, preserve the CPU load value, and get rid of the temp file by changing:
Code:
ALERT="monitoringbox@abc.com"
TEMPFILE=/tmp/temp1
HOSTNAME=`hostname`
rm -f $TEMPFILE
CPU_LOAD=`sar -P ALL 10 1 |grep Average' |awk -F" " '{print 100.0 -$NF}' |cut -d \. -f1`
if [[ $CPU_LOAD -gt $LIMIT ]];
then
echo "CPU is high on $HOSTNAME " >> $TEMPFILE
fi
if [ -e $TEMPFILE ]
then
mail -s " CPU ALERT  " $ALERT < $TEMPFILE
fi
rm -f $TEMPFILE

to:
Code:
ALERT="monitoringbox@abc.com"
HOSTNAME=`hostname`
CPU_LOAD=`sar -P ALL 10 1 | awk -F" " '/Average/{print int(100-$NF)}'`
if [[ $CPU_LOAD -gt $LIMIT ]]
then	echo "CPU is high ($CPU_LOAD) on $HOSTNAME" | mail -s " CPU ALERT" "$ALERT"
fi

Please try this and let us know what mail message you get when the CPU load is not high.
# 6  
Old 02-24-2017
Thank you Don

Yes the script I used which provides the value of CPU and hostsme and date
Sorry I provided my local saved code which I initially started to write was missing that value for CPU usage in email
It will calculate and send the current CPU value when limit breaches
I feel some where a bad email is saved and it is triggered once in a while
I checked var spoil root emails and other folders
Is there any other spot to check and fix
# 7  
Old 02-24-2017
Quote:
Originally Posted by anil529
Thank you Don

Yes the script I used which provides the value of CPU and hostsme and date
Sorry I provided my local saved code which I initially started to write was missing that value for CPU usage in email
It will calculate and send the current CPU value when limit breaches
I feel some where a bad email is saved and it is triggered once in a while
I checked var spoil root emails and other folders
Is there any other spot to check and fix
OK. So we know that something isn't working correctly. And, we know that the code you showed us is not the code you're using. So, we know that we have absolutely no idea what is going on. Sorry, but given these conditions, I have absolutely no idea what, if anything, needs to be fixed nor where to look -- other than at your actual code.

Previous Thread | Next Thread
Test Your Knowledge in Computers #446
Difficulty: Medium
The Large Hadron Collider experiments at CERN run on Scientific Linux.
True or False?

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Url check creating child process and generating false alerts

Hi All Below code is working as expected but creating too many child processes when the url is not up and every minute that process is sending false email alerts any help with the logic not to generate child process and not to send duplicate alerts app="https://url" appresult=$(wget... (2 Replies)
Discussion started by: srilinux09
2 Replies

2. Red Hat

Nagios is sending critical false alerts about current users

Hello All, Nagios seems to be sending false alerts about few hosts, (ex: There were no users on one host and still Nagios was reporting a critical alert and says 6 users are logged in. How do I fix this one? Also, I have installed nagios and added 12 hosts as a start and monitoring few... (4 Replies)
Discussion started by: lovesaikrishna
4 Replies

3. AIX

Gid=0 and 7 + admin=FALSE

Checking configuration access files for an AIX server, left me wondering about this :confused:: If a user is added to system group, it gets gid=0 with some security risks because it gets some root kind of file access level. Is this insecure condition kept if the user has admin variable... (0 Replies)
Discussion started by: bkiddo
0 Replies

4. IP Networking

false tcp connection

Why this happens? How to solve this? $netstat -na |grep 9325 tcp 0 0 127.0.0.1:9325 127.0.0.1:9325 ESTABLISHED When a client socket repeatedly tries to connect to an inactive(no server socket is listening on this port) local port,connect succeeds. ... (1 Reply)
Discussion started by: johnbach
1 Replies

5. Shell Programming and Scripting

False Condition

Hi All, I am using the below Script to enter a line in the File: #!/bin/ksh # To delete the last line if it contains the pattern Redirect permanent / Virgin Atlantic Airways - Popup echo "Enter the URL that should point to the particular microsite" read url # To delete the last line if it... (0 Replies)
Discussion started by: Shazin
0 Replies

6. Solaris

False Memory usage alarm!!

Hi Experts, I am using Solaris-10, Sun-Fire-V445. i got often the below message- "Memory Usage – Critical, Memory usage (RAM) exceeding 90% The memory utilization is exceeding 90%" in a application running on solaris. I checked with Vmstat. Everything seems to be fine. Where i should... (5 Replies)
Discussion started by: thepurple
5 Replies

7. UNIX for Advanced & Expert Users

will sftp work with /bin/false

helo helo I have create user for the group and entry for the user in /etc/passwd file is liek this bhavin:x:2014:109:test:/home/pds_RBAC:/bin/false I have keep here /bin/false now i m accesing user through sftp ow when i access that user using sftp from the another linux pc for e.g... (1 Reply)
Discussion started by: amitpansuria
1 Replies

8. Shell Programming and Scripting

Why is it always false?

Hi, I'm new to UNIX and am trying to learn shell scripting in order to work on an interface that I inherited when a co-worker left. I need to be able to check to see whether a file exists to determine whether the FTP has taken place, but in testing, the if statement always evaluates as false,... (3 Replies)
Discussion started by: JeffR
3 Replies

9. Shell Programming and Scripting

false use of sed???

i want to delete every newline and every line which starts with "RECORD......." in a file. FILE: Record 61391 in base BROCKHAUS (Timestamp: 2008-04-09 11:38:38) UNTERTITEL : Gräfin (seit 1707 Reichsgräfin) von, * Schwerin 4. 2. 1686, + Berlin 21. 10. 1744; wurde Record 61392 in base BROCKHAUS... (4 Replies)
Discussion started by: trek
4 Replies

10. Linux

bin\false

We have requirments to not allow a userid login abilities but allow users to 'su' to it. In solaris I normally set the shell in /etc/passwd to bin/false. THis does not work on Linux, any suggestions would help. (1 Reply)
Discussion started by: bryanthomas
1 Replies

Featured Tech Videos