How can we confirm that, the disk has failed in Solaris?


 
Thread Tools Search this Thread
Operating Systems Solaris How can we confirm that, the disk has failed in Solaris?
# 1  
Old 07-18-2009
How can we confirm that, the disk has failed in Solaris?

Hi All,

Seems to be one of the disk has failed on my Solaris server.

How do i confirm that disk has really failed or not?

Here are alert details.

-------
Code:
iostat -En out/put

c1t3d0          Soft Errors: 1884 Hard Errors: 153 Transport Errors: 54
Vendor: FUJITSU  Product: MAW3073NCSUN72G  Revision: 1703 Serial No: 0618B0DR30
Size: 73.40GB <73400057856 bytes>
Media Error: 144 Device Not Ready: 0 No Device: 9 Recoverable: 1884
Illegal Request: 0 Predictive Failure Analysis: 9


dmesg o/p


Jul 17 21:42:40 fiesnolsp03 md_stripe: [ID 641072 kern.warning] WARNING: md: d41: read error on /dev/dsk/c1t3d0s3
Jul 17 21:42:40 fiesnolsp03 md_mirror: [ID 104909 kern.warning] WARNING: md: d41: /dev/dsk/c1t3d0s3 needs maintenance


metastat o/p

Code:
# metastat
d4: Mirror
    Submirror 0: d40
      State: Okay
    Submirror 1: d41
      State: Needs maintenance
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 142606464 blocks (68 GB)

d40: Submirror of d4
    State: Okay
    Size: 142606464 blocks (68 GB)
    Stripe 0:
        Device     Start Block  Dbase        State Reloc Hot Spare
        c1t2d0s3          0     No            Okay   Yes


d41: Submirror of d4
    State: Needs maintenance
    Invoke: metareplace d4 c1t3d0s3 <new device>
    Size: 142606464 blocks (68 GB)
    Stripe 0:
        Device     Start Block  Dbase        State Reloc Hot Spare
        c1t3d0s3          0     No     Maintenance   Yes


d1: Mirror
    Submirror 0: d20
      State: Okay
    Submirror 1: d21
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 51205632 blocks (24 GB)

d20: Submirror of d1
    State: Okay
    Size: 51205632 blocks (24 GB)
    Stripe 0:
        Device     Start Block  Dbase        State Reloc Hot Spare
        c1t0d0s1          0     No            Okay   Yes


d21: Submirror of d1
    State: Okay
    Size: 51205632 blocks (24 GB)
    Stripe 0:
        Device     Start Block  Dbase        State Reloc Hot Spare
        c1t1d0s1          0     No            Okay   Yes


d0: Mirror
    Submirror 0: d10
      State: Okay
    Submirror 1: d11
      State: Okay
    Pass: 1
    Read option: roundrobin (default)
    Write option: parallel (default)
    Size: 91950336 blocks (43 GB)

d10: Submirror of d0
    State: Okay
    Size: 91950336 blocks (43 GB)
    Stripe 0:
        Device     Start Block  Dbase        State Reloc Hot Spare
        c1t0d0s0          0     No            Okay   Yes


d11: Submirror of d0
    State: Okay
    Size: 91950336 blocks (43 GB)
    Stripe 0:
        Device     Start Block  Dbase        State Reloc Hot Spare
        c1t1d0s0          0     No            Okay   Yes


Device Relocation Information:
Device   Reloc  Device ID
c1t3d0   Yes    id1,sd@SFUJITSU_MAW3073NCSUN72G_000618B0DR30____DAN0P650DR30
c1t2d0   Yes    id1,sd@SFUJITSU_MAW3073NCSUN72G_000618B0DS30____DAN0P650DS30
c1t1d0   Yes    id1,sd@SFUJITSU_MAW3073NCSUN72G_000617B0DJBG____DAN0P640DJBG
c1t0d0   Yes    id1,sd@SFUJITSU_MAW3073NCSUN72G_000617B0DJBT____DAN0P640DJBT
#



Code:
# format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c1t0d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@1f,700000/scsi@2/sd@0,0
       1. c1t1d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@1f,700000/scsi@2/sd@1,0
       2. c1t2d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@1f,700000/scsi@2/sd@2,0
       3. c1t3d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@1f,700000/scsi@2/sd@3,0
Specify disk (enter its number):

------------------------------------

Please help me on this case. Thanks in advance.

Regards,
Naresh

Last edited by DukeNuke2; 07-18-2009 at 02:03 PM.. Reason: added code tags
# 2  
Old 07-18-2009
Does the drive leave you disappointed in its performance? Are you constantly trying to hide your drive from your friends because you are embarrassed to be seen with it? Do you go to sleep every night crying because of the empty feeling your drive leaves you inside? If so then yes, your drive has failed you.

Also, according to the errors that drive really is failing so go replace it. We'll all feel better.
# 3  
Old 07-18-2009
To keep the forums high quality for all users, please take the time to format your posts correctly.

First of all, use Code Tags when you post any code or data samples so others can easily read your code. You can easily do this by highlighting your code and then clicking on the # in the editing menu. (You can also type code tags [code] and [/code] by hand.)

Second, avoid adding color or different fonts and font size to your posts. Selective use of color to highlight a single word or phrase can be useful at times, but using color, in general, makes the forums harder to read, especially bright colors like red.

Third, be careful when you cut-and-paste, edit any odd characters and make sure all links are working property.

Thank You.

The UNIX and Linux Forums
# 4  
Old 07-18-2009
hi,
i dont think its good to make quick conclusion.
of course its terrible that you have hard errors, but anyway try first
repair deice in SVM, i mean force it for resync to the same device.
metareplace -e dx cxtxdxsx .. see man metareplace.
if it doesnt help try to repair, analyze at least in #format tool of solaris.

Good luck.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Replacing a failed disk using SVM

Hi Please can you help me on replacing or removing a faulty disk drive on a SUN NETRA X4250server with 4 internal drives only. the format comand show me the following: format Searching for disks...done AVAILABLE DISK SELECTIONS: 0. c0t0d0 <drive type unknown> ... (9 Replies)
Discussion started by: fretagi
9 Replies

2. AIX

Failed Disk,unable to varyonvg

Iam new to AIX I am unable varyonvg on the vios,some of the disk are reported as harware failure. I am going to replace the disk,Please tell me how to remove the disk without any corruption in the VG lspv -l hdiskx also didn't work (0 Replies)
Discussion started by: roayaix
0 Replies

3. Solaris

Recover failed system disk

I have an oldish Solaris 10 system (SunFire x4240), which due to a recent heating event in the server room, lost it's system disk. I have rsync backups of all the other (data) disks, but apparently I do not have a backup of /. :-( I can start the machine up in failsafe mode, but running fsck on... (4 Replies)
Discussion started by: jj5406
4 Replies

4. Red Hat

Find out the failed disk

Hi Masters, How to find out the failed disk in redhat linux (OR) How to decalare whether our hard disk goes bad ? My option is: dmesg | grep "failed" check in /var/log/messages with the kernel error. Please corrrect me if im wrong. And also i heard "smartctl" command. But it's... (2 Replies)
Discussion started by: abhay1983
2 Replies

5. UNIX for Advanced & Expert Users

read sectors from disk failed with timeout

i'm writing some code to simulate the boot progress after power on but when i try to read the 2nd sector from a floppy disk, this operation always fail with ah=0x80h which means timeout, how can i get over this problem? my code would be like this: $ cat boot.S .code16 #define SETUPLEN 4... (0 Replies)
Discussion started by: wljackhero
0 Replies

6. Linux

read sectors from disk failed with timeout

i'm writing some code to simulate the boot progress after power on but when i try to read the 2nd sector from a floppy disk, this operation always fail with ah=0x80h which means timeout, how can i get over this problem? my code would be like this: $ cat boot.S .code16 #define SETUPLEN 4... (0 Replies)
Discussion started by: wljackhero
0 Replies

7. Boot Loaders

read sectors from disk failed with timeout

i'm writing some code to simulate the boot progress after power on but when i try to read the 2nd sector from a floppy disk, this operation always fail with ah=0x80h which means timeout, how can i get over this problem? my code would be like this: $ cat boot.S .code16 #define SETUPLEN 4... (0 Replies)
Discussion started by: wljackhero
0 Replies

8. Solaris

Cannot remove and replace failed disk

-bash-3.00# zpool detach zonepool c1t1d0 cannot detach c1t1d0: only applicable to mirror and replacing vdevs -bash-3.00# zpool remove zonepool c1t1d0 cannot remove c1t1d0: only inactive hot spares, cache, top-level, or log devices can be removed -bash-3.00# zpool offline zonepool c1t1d0... (3 Replies)
Discussion started by: LittleLebowski
3 Replies

9. Solaris

How to replace failed disk?

Dear all Please can any one explain me how to replace failed disk in Solaris 10. Please tell me the step by step procedure. (9 Replies)
Discussion started by: suneelieg
9 Replies

10. UNIX for Advanced & Expert Users

Scsi Disk Failed

My WS boot disk has failed so when i want to boot system (by OK boot -s or Ok boot disk0 ) i get these mesgs: disk read error boot :can not find misc/sparc9v/krtld boot:error loading interperetor (misc/sparc9v/krtld) Elf64 read error. boot failed. although it's probed ,finally i try... (1 Reply)
Discussion started by: nikk
1 Replies
Login or Register to Ask a Question