09-05-2008
When I run into a hardware problem that I've not experienced before, I generally run SUN vts, then Explorer, then check sunsolve and google. If the host is still under a service contract, I call SUN.
Install VTS - see what it says. Be prepared to let it run (impact system performance - so don't serve anything during testing) for a few hours. Don't be surprised if it doesn't find a problem - this can run for a couple of days before it hits on anything.
If you have the ability to monitor the power that is coming into the host - something that shows spikes (+/-) in power - that's a likely cause of this sort of error.
Then, on a separate host:
- start a terminal session.
- type: script /somewhere/date.problem_hostname.capture
- telnet into the console on the problem host
Ideally, you'll write a small shell script to run prtdiag -v and dmesg every 3 minutes or so. If you have utilities that you like, include them in the script. The next time that the server crashes, let it complete it's power cycle and then see if any new and interesting errors arise. Compare your prtdiag outputs over the course of the hours prior to the crash. See if there are drastic changes in temperature, etc.
Good luck!
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi,
I have a Dell Inspiron 4000 that I am using to run Redhat Linux 7.1.
This machine had 128Mb of RAM in it, and I upgraded to 512Mb.
Since this, the PCMCIA cards have disappeared from the system running under Linux, although they still work under windows and the file "/var/lib/pcmcia/stab"... (1 Reply)
Discussion started by: ghoti
1 Replies
2. Solaris
Can someone tell me what the following means? We have a v440 running Solaris 9. What do the patterns actually mean?
May 17 12:12:03 ibasdb02 cediag: Revision: 1.78 @ 2005/02/11 15:5
:29 UTC
May 17 12:12:03 ibasdb02 cediag: Analysed System: SunOS 5.9 with
UP 117171-17 (MPR active)
May 17... (5 Replies)
Discussion started by: hshapiro
5 Replies
3. UNIX for Advanced & Expert Users
I have worked with Qlogic (Sun) fiber cards a lot and with Emulex fiber cards only a little. Here is the scenario: I have a Sun 490 with 3 Emulex cards in it. A pair are for the 6120 array that is attached, the other is a direct attach so our SAN for Veritas Netbackup to back up the system.I... (0 Replies)
Discussion started by: lm_admin_dh
0 Replies
4. Solaris
Dear All
I would Like to know that if suppose c1t0d0 is of 72 gb hard disk and system boot from this hard disk and c1t1d0 is of 147gb hard disk. Can we implement hardware raid (Mirror) between these two hard disks of different capacity if raid crontroller is present in the server
Kind regards (4 Replies)
Discussion started by: girish.batra
4 Replies
5. Linux
Hey guys how to do diagnose Linux for hardware problems/issues. This is a general question for all common hardware components under Linux server. (1 Reply)
Discussion started by: sbn
1 Replies
6. Solaris
Hey guys how to do diagnose solaris for hardware problems/issues. This is a general question for all common hardware components under solaris server. (4 Replies)
Discussion started by: sbn
4 Replies
7. HP-UX
Hey guys how to do diagnose HP-UX for hardware problems/issues. This is a general question for all common hardware components under HP-UX server. (4 Replies)
Discussion started by: sbn
4 Replies
8. Solaris
First of all it's shut down 60 second after power on and write on console :
SC Alert: Correct SCC not replaced - shutting managed system down!
This is cured by moving out battery from ALOM card.
Now server start to loop during the testing.
That's on the console:
>@(#) Sun Fire V440,Netra... (14 Replies)
Discussion started by: Alisher
14 Replies
9. Solaris
Dear:
the Cureent PC Dell GX620
the Current Hardware
4 GB ram
80 HDD
what i do
remove old 4GB ram and install 8 GB
remove old HDD and install new on 160GB
try to install the Unix Sun Solaris
after booting and when arrive to step which i should select interactive
error meg... (7 Replies)
Discussion started by: hosney00ux
7 Replies
10. SCO
How can I determine what process is currently using a serial port? A good bit of google searching hasn't turned up anything useful, but it seems like there has to be a way to do this without too much difficulty.
When I first started looking into this problem, I assumed that when a port was in... (2 Replies)
Discussion started by: jdsnatl
2 Replies