12-02-2009
Monitor server's health
Dear all,
There wasn't any monitoring on our server except on the filesystem.
Therefore, I was wondering anything i should do on a daily basis to check on the server's status, health, hardware, or any other thing as a disaster prevention? Also, what command i should use to do that?
Thanks in advance for reading or replying on my post.
8 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
hi all
i have a script which will write a log file depending upon output of TOP command ( i am on HP-UX ) and some rules to analyze processes if process falls under that rule then log entry in file. i am ready with this part i have perl script which does this for me but i have 3 HP servers and... (1 Reply)
Discussion started by: zedex
1 Replies
2. Infrastructure Monitoring
HI all,
I want to make a webpage showing the status of services running on a server. for example email, http , etc. That way users can see that server status for themselfs. Maybe even show the status of each virtual domain running on the server. Any way to do this without buying some product?
... (4 Replies)
Discussion started by: mcraul
4 Replies
3. Solaris
do anybody has a procedure for daily weekley monthly health check for SUN server with solaris OS?? (5 Replies)
Discussion started by: mm00123
5 Replies
4. Shell Programming and Scripting
I have written little script to check the CPU performance of the machine.
Request you to contribute your comments on the same.
Feel free to add your own scriptlet to make it better.
I have decided to call it as doctortux
I have decided to run the script in two mode
1)Interactive.(Not... (4 Replies)
Discussion started by: pinga123
4 Replies
5. Shell Programming and Scripting
hello there,
can someone please tell me the commands that makes sense, from a production point of view, to be used to make sure CPU, LOAD or IO usages on a Linux or Solaris server isn't too high?
I'm aware of vmstat, iostat, sar. But i seriously need real world advice as to what fields in... (1 Reply)
Discussion started by: SkySmart
1 Replies
6. Solaris
What is the best way to monitor the health of solaris zones? Through the global zone or through the individual zones itself ? (5 Replies)
Discussion started by: CuriousDev
5 Replies
7. Red Hat
Hi, Earlier we used to reboot servers based on adhoc request, never checked anything on it pre-reboot., But now i need to reboot regularly but most of the info is not available, I need to know want to make sure that server to be rebooted without any issues, so I want to do few prechecks which will... (5 Replies)
Discussion started by: nanz143
5 Replies
8. UNIX for Beginners Questions & Answers
I have two files to be compared to get the output of the differences.
File1 has a lot more lists than File2.
After searching a lot on this thread I'am unable to find the exact code that im willing to get.
This will be used as 'pre-check'/post-check utility (health check Tool) to compare... (1 Reply)
Discussion started by: GeekyJimmy
1 Replies
LEARN ABOUT DEBIAN
ocf_pacemaker_healthsmart
OCF_PACEMAKER_HEALTH(7) Pacemaker Configuration OCF_PACEMAKER_HEALTH(7)
NAME
ocf_pacemaker_HealthSMART - SMART health status
SYNOPSIS
OCF_RESKEY_state=string OCF_RESKEY_drives=string OCF_RESKEY_devices=string OCF_RESKEY_temp_lower_limit=string
OCF_RESKEY_temp_upper_limit=string OCF_RESKEY_temp_warning=string
HealthSMART [start | stop | monitor | meta-data | validate-all]
DESCRIPTION
Systhem health agent that checks the S.M.A.R.T. status of the given drives and updates the #health-smart attribute.
SUPPORTED PARAMETERS
OCF_RESKEY_state = string [/HealthSMART-{OCF_RESOURCE_INSTANCE}.state]
State file
Location to store the resource state in.
OCF_RESKEY_drives = string [/dev/sda]
Drives to check
The drive(s) to check as a SPACE separated list. Enter the full path to the device, e.g. "/dev/sda".
OCF_RESKEY_devices = string
Device types
The device type(s) to assume for the drive(s) being tested as a SPACE separated list.
OCF_RESKEY_temp_lower_limit = string [0]
Lower limit for the red smart attribute
Lower limit of the temperature in deg C of the drive(s). Below this limit the status will be red.
OCF_RESKEY_temp_upper_limit = string [60]
Upper limit for red smart attribute
Upper limit of the temperature if deg C of the drives(s). If the drive reports a temperature higher than this value the status of
#health-smart will be red.
OCF_RESKEY_temp_warning = string [5]
Deg C below/above the upper limits for yellow smart attribute
Number of deg C below/above the upper/lower temp limits at which point the status of #health-smart will change to yellow.
AUTHOR
Andrew Beekhof <andrew@beekhof.net>
Author.
Pacemaker Configuration 04/17/2012 OCF_PACEMAKER_HEALTH(7)