Visit Our UNIX and Linux User Community


Prediction of failures


 
Thread Tools Search this Thread
Operating Systems Solaris Prediction of failures
# 8  
Old 08-07-2009
Interesting...I'll be looking at these tools. Thanks guys !
# 9  
Old 08-07-2009
Quote:
Originally Posted by jlliagre
It looks like both of you overlook the second part of my previous reply. The tools you are looking for already exist and are included with Solaris.

Some more links:

Solaris Fault Manager (Solaris 10 What's New) - Sun Microsystems
Getting notified when hardware breaks
SCSI DISK FMA Project Part 1: SCSI Device Drivers as FMA Telemetry Detectors
You still dont get my point. I want prevention rather than reactive action after things happen.
# 10  
Old 08-07-2009
You are still missing mine. Unless you expect a crystal ball to predict what will happen in the future with currently healthy components, the only reasonable way to prevent their future faults is by monitoring events coming from them. This is what SMF is designed to do.

Alternatively, if your goal is really to react to something that hasn't happened yet, you can pro-actively replace each disk after a period of use significantly smaller than its MTBF.

If you just care about your data, use something like RAIDZ2 with hot spares. Your system will happily survive two disks crashing at the same time and will automatically replace them by the spares.
# 11  
Old 08-07-2009
yes I agree, there's no magic way to really predict each and every hardware failure.

If the data is so critical, then you should invest more in redundancy and HA, and RAS.
# 12  
Old 08-09-2009
Thanks for your valuable feedback.
# 13  
Old 08-10-2009
One more question:


After finishing installation, customer asked me to do "Network stress test" ...

any ideas ?

Previous Thread | Next Thread
Test Your Knowledge in Computers #577
Difficulty: Medium
If a language offers automated memory management, it is not possible to have memory leaks and other memory allocation issues.
True or False?

4 More Discussions You Might Find Interesting

1. Solaris

11.0 to 11.2 update failures

Attempting to update an 11.0 server with many non-global zones installed. pkg publisher is pkg.oracle.com/solaris/support. FMRI = pkg://solaris/entire@0.5.11,5.11-0.175.1.15.0.4.0:20131230T203500Z When we run pkg update --accept the server contacts oracle, checks packages, finds about 700... (4 Replies)
Discussion started by: CptCarrot
4 Replies

2. Post Here to Contact Site Administrators and Moderators

Event Prediction - Euro 2012

Please add this new "event". (10 Replies)
Discussion started by: ni2
10 Replies

3. Post Here to Contact Site Administrators and Moderators

Event Prediction - New Sports Events

Hi, Some sports predictions suggestions. Although the Celtics are still playing the Heat. Just being optimistic that they will win. Otherwise, please change to Heat. (9 Replies)
Discussion started by: ni2
9 Replies

4. HP-UX

Communication Failures

HI ALL, I have been trying to install a particular software using remote linux server. some thing like this: rsh <host ID> /usr/sbin/swinstall -x autoreboot=true -s /tmp/<software> <Product name>. The problem is whenever I try to install the product through a shell script the installation... (1 Reply)
Discussion started by: barun agarwal
1 Replies

Featured Tech Videos