What should I do with this metastat result? metareplace everything?


 
Thread Tools Search this Thread
Special Forums Hardware Filesystems, Disks and Memory What should I do with this metastat result? metareplace everything?
# 1  
Old 12-11-2008
What should I do with this metastat result? metareplace everything?

I have to rescue the volumes in a soloris 7 machine. The metastat returns the following result:


Quote:
d0: Mirror
Submirror 0: d10
State: Needs maintenance
Submirror 1: d20
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 4198392 blocks

d10: Submirror of d0
State: Needs maintenance
Invoke: metareplace d0 c0t0d0s0 <new device>
Size: 4198392 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t0d0s0 0 No Maintenance

d20: Submirror of d0
State: Okay
Size: 4198392 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t1d0s0 0 No Okay

d1: Mirror
Submirror 0: d11
State: Needs maintenance
Submirror 1: d21
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 2101552 blocks

d11: Submirror of d1
State: Needs maintenance
Invoke: metareplace d1 c0t0d0s1 <new device>
Size: 2101552 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t0d0s1 0 No Maintenance

d21: Submirror of d1
State: Okay
Size: 2101552 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t1d0s1 0 No Okay

d6: Mirror
Submirror 0: d16
State: Needs maintenance
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 12288896 blocks

d16: Submirror of d6
State: Needs maintenance
Invoke: after replacing "Maintenance" components:
metareplace d6 c0t0d0s6 <new device>
Size: 12288896 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t0d0s6 0 No Last Erred

d7: Mirror
Submirror 0: d17
State: Needs maintenance
Submirror 1: d27
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 16765296 blocks

d17: Submirror of d7
State: Needs maintenance
Invoke: metareplace d7 c0t0d0s7 <new device>
Size: 16765296 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t0d0s7 0 No Maintenance

d27: Submirror of d7
State: Okay
Size: 16765296 blocks
Stripe 0:
Device Start Block Dbase State Hot Spare
c0t1d0s7 0 No Okay

d26: Concat/Stripe
Size: 12288896 blocks
Stripe 0:
Device Start Block Dbase
c0t1d0s6 0 No
d6 is dying with iostat -nE returns Harderror 3, Transports error 9

Can some offer me some help? Thank you very much.

Last edited by nickychung; 12-11-2008 at 12:28 AM..
# 2  
Old 12-11-2008
Do you know what triggered these problems in the first place? It looks like c0t0d0 had some kind of failure... is it working again now?

Try using the non-destructive read tests in format to see if you can read the entire c0t0d0 device.

If it seems to be working okay again, you should be able to use metareplace -e d0 c0t0d0s0, for example, to replace the disk with itself (and repeat for the other mirrors of course). Otherwise you may need to replace that disk first... probably a good idea to have a spare disk handy anyway if it is indeed beginning to fail.
# 3  
Old 12-16-2008
Keep an eye on the hard errors. If these increase, you'll most probably need to replace the disk.

Check out the format command and see if you can write a label to your disk (you won't lose any data). If this fails, it's a good indicator that you've got a disk which is failing.

If hard errors are not increasing, you could just run the metareplace command, using the existing slice and see if this brings the mirrors back online (I have run into this issue before) e.g. metareplace d0 c0t0d0s0.
# 4  
Old 12-16-2008
There is no increase of hard error.
Format can not recognize c0t0d0.

And, metareplace -e d0 c0t0d0s0 returns I/O errors.

Is it possible to bring mirror d0 back online with c0t1d0s0?

metastat -p returns

Quote:
d0 -m d10 d20 1
d10 1 1 c0t0d0s0
d20 1 1 c0t1d0s0
d1 -m d11 d21 1
d11 1 1 c0t0d0s1
d21 1 1 c0t1d0s1
d6 -m d16 1
d16 1 1 c0t0d0s6
d7 -m d17 d27 1
d17 1 1 c0t0d0s7
d27 1 1 c0t1d0s7
d26 1 1 c0t1d0s6
# 5  
Old 12-16-2008
Just because it "needs maintenance" doesn't mean it is offline. If c0t1d0s0 is still working fine, then you should be able to access d0 fine... is it not working?
# 6  
Old 12-17-2008
Yes, I am still able to access d0 but not d6. d0 is online. d6 is dead.

But metareplace just can't help me to fix the "needs maintenance" problem.

iostat -nE returns:

Quote:
c0t0d0 Soft Errors: 0 Hard Errors: 3 Transport Errors: 9
Vendor: SEAGATE Product: ST318203LSUN18G Revision: 034A Serial No: LRD9164600001037
Size: 18.11GB <18110967808 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 3 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
c0t1d0 Soft Errors: 0 Hard Errors: 4 Transport Errors: 2
Vendor: FUJITSU Product: MAG3182L SUN18G Revision: 1111 Serial No: 02542399
Size: 18.11GB <18110967808 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 4 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
c0t6d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: TOSHIBA Product: XM6201TASUN32XCD Revision: 1103 Serial No: 12/12/97
Size: 18446744073.71GB <-1 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
# 7  
Old 12-17-2008
That makes sense because for some reason d26 is no longer associated with d6. I can't advise you a safe method to reattach it to d6 because I don't have access to a Solaris system using SVM/DiskSuite any more.

If the disk is genuinely faulty (which it appears to be judging by the number of hard errors and the fact that metareplace fails) then you should physically replace it before attempting to rejoin it to the mirrors.

If you need to recover the data from d26 one option would be to simply mount that device instead of d6.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Metastat shows state needs maintenance

Hi, We have a Solaris 10 machine with update 11 and was configured with IBM storage. It was assigned 2 LUNs (each 70GB) which were striped to make it 140GB. we have taken full backup of entire machine and Our storage team replaced IBM storage with Nimble storage (they did storage-level... (6 Replies)
Discussion started by: prvnrk
6 Replies

2. Solaris

metastat |grep Needs

Dear, Required an script such that : If metastat |grep Needs , results in some output then this command to be executed for the same : opcmsg object=metastat a=OS msg_grp=OpC severity=critical msg_text="Need maintenance for the system $line" With regards, Mjoshi (3 Replies)
Discussion started by: mjoshi87
3 Replies

3. Solaris

DiskSuite State Unavailable After metareplace

Hello, After replacing a JBOD disk that died, the metareplace was ran, the device went into enable/sync'ing state and then when the sync completed, the state changed to unavailable. metadb shows the replica db looking good on the new disk. format shows the disk online and looking good. ... (2 Replies)
Discussion started by: rfransix
2 Replies

4. Shell Programming and Scripting

Grep contents from metastat command

Hi, after issuing metastat command I am getting output as follows Actually these soft partitions are more than 100. I want output as Device Name & Size. for eg d4004 2.0 GB (4 Replies)
Discussion started by: tuxian
4 Replies

5. Solaris

Softpartition State: Errored in Command MetaStat

Hi people, I have on problem when execute the command METASTAT... d60: Soft Partition Device: d10 State: Errored Size: 12582912 blocks (6.0 GB) Someone help me? Thank you very much (4 Replies)
Discussion started by: denisgomes
4 Replies

6. Solaris

Urgent- metareplace -e write errors

I have a disk in maintanance status . I executed metareplace -e partition device . It resynced upto 5 % and then stopped . In the /var/adm/messages it showed the write error . My question is if I analyse and repair the disk in format command , will my write error go . I dont have 0 hard and... (6 Replies)
Discussion started by: Hitesh Shah
6 Replies

7. Solaris

SVM - metastat - Last Erred

My company is running a solaris 2.7 machine. The machine is getting slow recently. I have no expert in solaris. Please help. I checked the log in /var/adm/message: I also checked with the command iostat -nE. It returns: Metastat returns the followings: What should i do now?... (10 Replies)
Discussion started by: nickychung
10 Replies

8. Solaris

metareplace error

Hello guys, i am getting some errors as below when i want to perform maintainence on the disks of my servers, getting errors like this "metareplace: myhost: /dev/dsk/c1t0d0s5: Invalid argument" when i execute #metareplace -e d5 c1t0d0s5 kindly advise on this matter,am afraid to use -f... (4 Replies)
Discussion started by: cromohawk
4 Replies

9. Solaris

Help on metastat

hi all, can someone pls pass on your suggestion? Firs thing I am testing a script which checks for the pattern 'Needs Maintenance' from metastat output and prints some messages in the screen. So i need to simulate an error in mirrored disk for metastat to give this message 'Needs Maintenance'.... (3 Replies)
Discussion started by: srirammad007
3 Replies

10. Solaris

SVM metastat -- needs maintenance

Running Solaris 9 with SVM. I'm not that familiar with it, but metastat output gives "needs maintenance" message on 2 of the mirrors. There are no errors in /var/adm/messages. What do I need to do to fix this error? Thanks. (14 Replies)
Discussion started by: dangral
14 Replies
Login or Register to Ask a Question