Prediction of failures


 
Thread Tools Search this Thread
Operating Systems Solaris Prediction of failures
# 1  
Old 08-05-2009
Prediction of failures

Any diagnostic tool to do predictive check on all the SUN hard disks before it fails, as a preventive measure? Meaning, is there any tool that can really check for hdd which are failing/or "will fail soon" for Sun servers?
# 2  
Old 08-05-2009
SunVTS is the tool that's supposed to do this. It should do a stress test on the machine.
# 3  
Old 08-05-2009
SunVTS is a tool designed to validate hardware components against Solaris. It might be used to stress components but that wouldn't be a good practice.

A better suited tool would be Solaris Fault Manager (a.k.a. predictive self healing) which is precisely designed to check components before they fail.

Have a look at this blog for an example related to disks.

Bob Netherton's Weblog
# 4  
Old 08-07-2009
Its not a new machine. Its a production server, so probably we cant go with running VTS as it will cause too much stress on the server. Any better options?
# 5  
Old 08-07-2009
@ Incredible

but I think to predict hardware failures, then stress tests are the only way.

Other than that, you need to monitor your error messages wishing that the hardware itself (preferrably with up-to-date firmware) report early predictive failure.
# 6  
Old 08-07-2009
It looks like both of you overlook the second part of my previous reply. The tools you are looking for already exist and are included with Solaris.

Some more links:

Solaris Fault Manager (Solaris 10 What's New) - Sun Microsystems
Getting notified when hardware breaks
SCSI DISK FMA Project Part 1: SCSI Device Drivers as FMA Telemetry Detectors
# 7  
Old 08-07-2009
Commands :- fmstat / fmadm
Logs :- /var/fm/fmd

Solaris 10 now has a ton of background health monitoring, which reports to the above.

SBK
Login or Register to Ask a Question

Previous Thread | Next Thread

4 More Discussions You Might Find Interesting

1. Solaris

11.0 to 11.2 update failures

Attempting to update an 11.0 server with many non-global zones installed. pkg publisher is pkg.oracle.com/solaris/support. FMRI = pkg://solaris/entire@0.5.11,5.11-0.175.1.15.0.4.0:20131230T203500Z When we run pkg update --accept the server contacts oracle, checks packages, finds about 700... (4 Replies)
Discussion started by: CptCarrot
4 Replies

2. Post Here to Contact Site Administrators and Moderators

Event Prediction - Euro 2012

Please add this new "event". (10 Replies)
Discussion started by: ni2
10 Replies

3. Post Here to Contact Site Administrators and Moderators

Event Prediction - New Sports Events

Hi, Some sports predictions suggestions. Although the Celtics are still playing the Heat. Just being optimistic that they will win. Otherwise, please change to Heat. (9 Replies)
Discussion started by: ni2
9 Replies

4. HP-UX

Communication Failures

HI ALL, I have been trying to install a particular software using remote linux server. some thing like this: rsh <host ID> /usr/sbin/swinstall -x autoreboot=true -s /tmp/<software> <Product name>. The problem is whenever I try to install the product through a shell script the installation... (1 Reply)
Discussion started by: barun agarwal
1 Replies
Login or Register to Ask a Question