Need ideas what is wrong with v440


 
Thread Tools Search this Thread
Operating Systems Solaris Need ideas what is wrong with v440
# 1  
Old 03-08-2010
Need ideas what is wrong with v440

Hello everyone,

I'm here today looking for help...

I have a SunFire v440 running Solaris 9 at work that manages the SCSI tape drives(LTO3) in our L700 tape library (runs SAMFS).

I went into work today and found the machine was ping-able but I could not connect to it via SSH and could not get video on the KVM. I also noticed our other SAMFS server could not connect back to it anymore.

I attached my laptop to the serial ALOM and attached to the console.... NOTHING!

So, I exited the console (.#) and issued a 'poweroff' command; sure enough the server powered off.

However, when I issued 'poweron' and attached to the console again I noticed the machine is stuck in a reboot loop. It POSTS but when it attempts to boot then it just resets. I can see video on the KVM for a short period of time between the end of the post and the begining of the boot process.

I then turned the servers key to diagnostic mode and issued the poweron command again. When I attach to the console I can see that all the POST tests pass but once again when the system attempts to boot it hangs for a while then quickly stack dumps and resets. I was able to Stop-A and and get into the OpenBoot menu. The system has 4 internal drives. disk0 and disk1 are SVM RAID (mirror). I noticed the LED on disk0 is not illuminating but still no fault lights. When I check the boot device it is configured as "disk0 disk1". I changed the boot device to "disk1", committed the NVRAM changes, and issued a 'boot' command--SAME thing.... stacks dumps and resets.

Mind you this is all very preliminary diagnosis (from 3pm to 4:30pm today). Although this is a production box it can go a day or two being down without any serious problems.

At this point I'm thinking data corruption. I plan on booting off the Solaris 9 CD and fsck the volume.

Does this seem appropriate? Should I FLAR the drive before just to be safe (if it is readable)?

Any thing else I may not be thinking of?

Any help is greatly appreciated!

Thanks!
# 2  
Old 03-08-2010
What is the output:
Code:
boot -v

# 3  
Old 03-08-2010
The last time my Sun Fire v490 went to a loop without booting(post POST) ,i had to replace the System Board though there were no hardware issues posted during diagnostics mode...I hope the server is on support.

HG
# 4  
Old 03-09-2010
Plug out your disks, THEN power on, the system will drop to ok> prompt. setenv auto-boot? false then do a reset and do MAX POST test/obdiag to see what errors you get.
# 5  
Old 03-09-2010
@honglus: Ok, boot -v provided a more verbose boot up but still crash dumps and reboots. I'll try this again now that auto-boot? is false.

@Hari_Ganesh: As for coverage.... no! My company, against my suggestions, let the contract drop. And, I was just told that Oracle (according to my Sun FSE) will not do T&M anymore.

@incredible: I unplugged all drives.... got to the ok> prompt, tehn set auto-boot? to false, saved the nvram, reset the machine, and this time ran obdiag. Within obdiag I setenv diag-level max and then did a test-all. Everything passes!

---------- Post updated at 09:14 AM ---------- Previous update was at 08:55 AM ----------

I was able to see the error message before the stack dump and reset.

The error appears to be: Illegal major device number

I think the system is having a problem initializing the SCSI tape drive or the SCSI card itself. Although it doesn't list any errors it appears to be very slow during boot while initializing PCI/SCSI2. PCI/SCSI1 does not appear to have the same problem.

---------- Post updated at 09:25 AM ---------- Previous update was at 09:14 AM ----------

Ok, illegal major device number appears to be a SVM mirror issue so I'm back to square one.

Re: SVM mirror not working , illegal major device number...

I'll let you all know how this pans out. Thanks for the suggestions. As always, more suggestions are welcome--I am by no means a SVM expert
# 6  
Old 03-09-2010
Remove the suspected card, or with minimal configuration, reboot the system.
If possible,plug in the boot disk and reboot to single user (as a raw disk) after doing so. If it goes to single user without ANY errors, you may Ctrl+D to proceed to up it in multi user mode. If got errors, let us know
# 7  
Old 03-09-2010
@incredible: I'm sure it is a disk problem at this point. I booted to the Solaris 9 CD and ran format. Format cannot even stat disk0 (c0t0d0). I tried an fsck against /dev/dsk/c0t0d0s0 and it failed (no surprise there). So, I ran a fsck -m against /dev/dsk/c1t1d0s0 (SVM mirror) and it stated the drive needed to be fscked. So I've run fsck twice so far against /dev/dsk/c1t1d0s0 and it found a bunch of errors. Once fsck runs with no errors (probably after this run) I'll try mounting the volume to see what kind of carnage I'm dealing with.

With a little luck, I should be able to get it booting off disk1 and rebuild disk0 with a new disk.

I'll keep you guys updated
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Solaris V440 keeps crashing

Hi, I have Sun Solaris V440, every two days or so, the system crashes and it's at OK prompt. After that I do the FSCK and clear all the bad sectors. Then system stays up for close to 48 hours. Then it crashes again. I checked the logs and I don't see anything under /var/adm/messages file. ... (1 Reply)
Discussion started by: samnyc
1 Replies

2. Shell Programming and Scripting

Why result is wrong here ? whether break statement is wrong ?

Hi ! all I am just trying to check range in my datafile pls tell me why its resulting wrong admin@IEEE:~/Desktop$ cat test.txt 0 28.4 5 28.4 10 28.4 15 28.5 20 28.5 25 28.6 30 28.6 35 28.7 40 28.7 45 28.7 50 28.8 55 28.8 60 28.8 65 28.1... (2 Replies)
Discussion started by: Akshay Hegde
2 Replies

3. UNIX and Linux Applications

Need ideas for graduation project based on unix or linux Need ideas for graduation project based on

Dear all, i am in last year of electronics department in engineering faculty i need suggestions for a graduation project based on unix or free bsd or linux and electronics "embedded linux " i think about embedded unix for example or device drivers please i need helps (1 Reply)
Discussion started by: MOHA-1
1 Replies

4. Solaris

available PCI slots on v440

How do I find any available PCI slots on a v440? When I run prtconf, I get the following output? But I am not able to make out whether all the PCI slots are used. ================================= IO Devices ================================= Bus Freq Brd Type MHz Slot ... (1 Reply)
Discussion started by: jtamminen
1 Replies

5. Solaris

Reset password and login SC at SF V440

Dear All, I Have server SF V440, OS solaris is down and I must check logs at Sc, but I forgot my login and password. How to reset login and password at SF V440? Thanks Best Regards Jiman (3 Replies)
Discussion started by: mbah_jiman
3 Replies

6. Solaris

V210 to V440

I'm currently trying to move a perfectly find harddrive from a V210 to a V440. From what I can tell, the disk labeling a bit different, (V210 is c1t0d0 and V440 is c0t0d0). My question is, what all do I have to change to get the V440 to boot off of this with very little complications. Right now, it... (21 Replies)
Discussion started by: adelsin
21 Replies

7. Solaris

V210 to V440

I recently acquired a server for home use. Currently, I'm running a V210. I was wondering of a way to basically swap hard drives into the V440. I would like to avoid reinstall on the V440 for many reasons. Currently on the V440, when I try to boot up it forces itself into System Maintenance... (1 Reply)
Discussion started by: adelsin
1 Replies

8. Solaris

SUN V440 Question

I have a SUN V440 and it is running Solaris 9 with two NICs in the box ce0 ce1. Is there any way to get full duplex 1000? I do I check to see what ce0 and ce1 is set at and if it is not set to full duplex 1000, how do I set it and do I have to reboot the system afterward? I am new to the SUN unix... (5 Replies)
Discussion started by: pmwayne01
5 Replies

9. Solaris

video card for a V440

I am looking into getting some cards for the SunFire V440. Not sure how most folks go about this but I am assuming we can find compatible cards for this server platform without going through SUN. Do you guys recommend getting this video card from SUN? Yeah we got these servers without the video... (5 Replies)
Discussion started by: bluridge
5 Replies

10. Solaris

Can't get ok prompt on V440

Hello all, This is probably a commonly asked question, but it has me stumped. I have a V440 which upon bootup gives me the option of getting into ALOM with #. I don't have the ALOM password so I want to re-install Solaris 10. When connected to the Management serial interface (on the PCI... (3 Replies)
Discussion started by: juanj
3 Replies
Login or Register to Ask a Question