In that case, if Nagios is already monitoring the load of your server...what is it you're hoping to achieve by running your own separate load monitoring script ? What does it do differently from what the Nagios check does, and would it not be possible to amend the Nagios check to do whatever you want so you only have one single check ?
In that case, if Nagios is already monitoring the load of your server...what is it you're hoping to achieve by running your own separate load monitoring script ? What does it do differently from what the Nagios check does, and would it not be possible to amend the Nagios check to do whatever you want so you only have one single check ?
Hi
we need to get email alerts and monitor our app with out any delay
I doubt a script this simple could cause any problems for Nagios. What monitoring is already configured in Nagios ? Are these load alerts that you regard as false coming as e-mails from your script, or as alerts from Nagios ? And can you please provide the crontab entry so it can be ruled out as a cause ?
Here's a version of your script that's as streamlined as I've been able to make it:
Again in my own local tests this worked fine, but then so did your original. You may need to amend paths to things like sar, awk, etc (it's always a good idea to use fully-qualified paths in scripts that will be run via crontab).
... ... ...
I'd suggest changing this:
to
Also I'd debug what the value of CPU_LOAD is actually at the time of the email being sent.
Another, question.... you're doing this and checking of the existing of a file later:
Sounds like after the FIRST triggered condition, the file will be APPENDED to and any subsequent run of the script will trigger an email.
Don't you want to remove the file AFTER the condition has been triggered?
Hi vgersh99,
Good catch on the -gt versus > test. Without that change, it could miss reporting that the CPU load was 100% (but it still shouldn't have caused any false high load reports). Note that the script we were shown in post #4 didn't have this bug.
Note the code marked in red above. I agree wholeheartedly that the temp file is not needed (and suggested removing it back in post #5 in this thread), but the temp file is removed before it is appended to in the code you're questioning, so that shouldn't have caused any false high load reports either (assuming the code shown to us in post #9 is the actual code being run).
Quote:
Originally Posted by anil529
you don't need a temp file. How about this for the trailing portion of your script:
Quote:
[/QUOTE]
I can change and test
Hi anil529,
Why don't you also make the changes I suggested in post #5 in this thread (where I also proposed getting rid of the temp file) and get rid of two unneeded processes that are adding unneeded load to the system you're trying to monitor?
If you decide to try drysdalk's suggestion instead, at least note that you must change the:
to:
as noted above by vgersh99 to keep from missing reports if the load reaches 100%.
This User Gave Thanks to Don Cragun For This Post:
Hi All
Below code is working as expected but creating too many child processes when the url is not up and every minute that process is sending false email alerts
any help with the logic not to generate child process and not to send duplicate alerts
app="https://url"
appresult=$(wget... (2 Replies)
Hello All,
Nagios seems to be sending false alerts about few hosts, (ex: There were no users on one host and still Nagios was reporting a critical alert and says 6 users are logged in. How do I fix this one?
Also, I have installed nagios and added 12 hosts as a start and monitoring few... (4 Replies)
Checking configuration access files for an AIX server, left me wondering about this :confused::
If a user is added to system group, it gets gid=0 with some security risks because it gets some root kind of file access level.
Is this insecure condition kept if the user has admin variable... (0 Replies)
Why this happens?
How to solve this?
$netstat -na |grep 9325
tcp 0 0 127.0.0.1:9325 127.0.0.1:9325 ESTABLISHED
When a client socket repeatedly tries to connect to an inactive(no server socket is listening on this port) local port,connect succeeds.
... (1 Reply)
Hi All,
I am using the below Script to enter a line in the File:
#!/bin/ksh
# To delete the last line if it contains the pattern Redirect permanent / Virgin Atlantic Airways - Popup
echo "Enter the URL that should point to the particular microsite"
read url
# To delete the last line if it... (0 Replies)
Hi Experts,
I am using Solaris-10, Sun-Fire-V445.
i got often the below message-
"Memory Usage – Critical, Memory usage (RAM) exceeding 90%
The memory utilization is exceeding 90%" in a application running on solaris.
I checked with Vmstat. Everything seems to be fine. Where i should... (5 Replies)
helo
helo I have create user for the group and entry for the user in /etc/passwd file is liek this
bhavin:x:2014:109:test:/home/pds_RBAC:/bin/false
I have keep here /bin/false
now i m accesing user through sftp
ow when i access that user using sftp from the another linux pc
for e.g... (1 Reply)
Hi,
I'm new to UNIX and am trying to learn shell scripting in order to work on an interface that I inherited when a co-worker left. I need to be able to check to see whether a file exists to determine whether the FTP has taken place, but in testing, the if statement always evaluates as false,... (3 Replies)
i want to delete every newline and every line which starts with "RECORD......." in a file.
FILE:
Record 61391 in base BROCKHAUS (Timestamp: 2008-04-09 11:38:38)
UNTERTITEL :
Gräfin (seit 1707 Reichsgräfin) von, * Schwerin 4. 2. 1686, + Berlin
21. 10. 1744; wurde
Record 61392 in base BROCKHAUS... (4 Replies)
We have requirments to not allow a userid login abilities but allow users to 'su' to it. In solaris I normally set the shell in /etc/passwd to bin/false. THis does not work on Linux, any suggestions would help. (1 Reply)