Solaris 8 disk/mirroring issue


 
Thread Tools Search this Thread
Top Forums UNIX for Advanced & Expert Users Solaris 8 disk/mirroring issue
# 1  
Old 11-08-2005
Solaris 8 disk/mirroring issue

Hello!

I recently inherited system administration duties for a SUN v880 box. The system has 6 physical hard disks.. In doing some basic maintenance, I found they're configured for mirroring. I ran the metastat and metadb commands, and many of the mirrors are showing they are in need of maintenance -> See the attachments.

Question is -- How do get all the mirrors back online and functional? I was considering running the metareplace utility to enable the metabases, but it seems as if one of the hard disks is no longer being recongized by the system. (c1t3d0 -> see disks.txt) I've checked the /var/adm/message files but I don't see anything that would indicate the disk has failed, and the front status panel of the server itself (Including the error condition lights for each drive) is showing no problems... I was thinking of running touch /reconfigure, seeing the disk is redetected, if it is, then replacing the failed metabases using the metareplace and rebooting. Do you guys feel this will fix the problem?

Any advice would be greatly appreciated!
# 2  
Old 11-08-2005
M p unknown unknown /dev/dsk/c1t1d0s4 << bad disk
M p unknown unknown /dev/dsk/c1t3d0s4 << bad disk

Note in your metadb - looks like someone added to the metadb but never rebooted - that's usually why you get the unknown in the status although it's possible you are also getting it because the disks are bad. The good thing is that there are so many copies of the metadb - so you can replace the bad disks without no worry about losing anything.

As far as the bad disk, get the replacement disk drives and you can use the metareplace command. Or you can use metadetach and metaclear to remove them from the metastat and replace the drives, format, and readd them. You can hot swap the drives so you should not need any down time.

Also, insure that the boot device is c1t0d0s0 and not c1t1d0s0. You don't want to hot swap or remove the drive you are booted off of (inital boot will be off one drive and then the mirrors come into play). Just check the output of eeprom and look for boot-device - hopefully it isn't simply "disk:a" but gives you the output that you would be able to tell which drive it really is. If not, then you can play it safe and reboot into single user to do the drive replacements. More info can be found in this thread
# 3  
Old 11-08-2005
Quote:
Originally Posted by RTM
M p unknown unknown /dev/dsk/c1t1d0s4 << bad disk
M p unknown unknown /dev/dsk/c1t3d0s4 << bad disk

Note in your metadb - looks like someone added to the metadb but never rebooted - that's usually why you get the unknown in the status although it's possible you are also getting it because the disks are bad. The good thing is that there are so many copies of the metadb - so you can replace the bad disks without no worry about losing anything.

As far as the bad disk, get the replacement disk drives and you can use the metareplace command. Or you can use metadetach and metaclear to remove them from the metastat and replace the drives, format, and readd them. You can hot swap the drives so you should not need any down time.

Also, insure that the boot device is c1t0d0s0 and not c1t1d0s0. You don't want to hot swap or remove the drive you are booted off of (inital boot will be off one drive and then the mirrors come into play). Just check the output of eeprom and look for boot-device - hopefully it isn't simply "disk:a" but gives you the output that you would be able to tell which drive it really is. If not, then you can play it safe and reboot into single user to do the drive replacements. More info can be found in this thread
The boot disk looks like it refers to the c1t0d0 disk according to eeprom. It doesn't indicate the slice.

I think that error occured because of an incorrect powerdown... But I'm not sure because I just got a hold of this box. As I said the drive light indicators on the outside of the box aren't indicating any faults. On the negative side, format does not list c1t3d0, but it does list c1t1d0 as well as all other drives.

Is there any way to tell if the disk is definetely bad? The box was recently moved, so I am concerned the drives may have been rattled around a bit, maybe even come unseated. I was thinking of powering down, reseating all the drives, then booting and running touch /reconfigure. If the drive was previously detected in the system would I even need to do that for it to come back online?

Thanks!
# 4  
Old 11-08-2005
Re-seating the 'bad' drives can be done without powering down - they are hot-swappable so it would be like removing the old and putting in the new (just that it's the same drive). So try that first with c1t3 - if it spins up the system may see it. If not, you haven't lost anything.

You can determine the slice by looking at /etc/vfstab and finding the md device for / partiton. Then look at your metastat output for that device.

And the only way to tell if the drive is bad is to hit it - ls -Rla from top of the partition should create some errors at some point - or go into format and run an analyze (read, refresh, or test - the ones that do not harm data).

Double check that syslogd is running and configured to pop warning messages into your /var/adm/messages file (or what ever you put it in).
# 5  
Old 11-08-2005
Quote:
Originally Posted by RTM
Re-seating the 'bad' drives can be done without powering down - they are hot-swappable so it would be like removing the old and putting in the new (just that it's the same drive). So try that first with c1t3 - if it spins up the system may see it. If not, you haven't lost anything.

You can determine the slice by looking at /etc/vfstab and finding the md device for / partiton. Then look at your metastat output for that device.

And the only way to tell if the drive is bad is to hit it - ls -Rla from top of the partition should create some errors at some point - or go into format and run an analyze (read, refresh, or test - the ones that do not harm data).

Double check that syslogd is running and configured to pop warning messages into your /var/adm/messages file (or what ever you put it in).
I tried removing and reseating the drive, but no go. The box still isn't seeing it... I ran an iostat -En and found c1t1 has a ton of hard errors and c1t3 is not listed... So I've requested two replacement drives from Sun.

Once I get them, I can hot swap them for a the new drives, right? Is there a command a need to run before yanking out the drive and putting in the new one? C1t1 seems to be a mirror of the slices of the system disk. If so I can then just run these command to setup slices automatically, right?

# prtvtoc /dev/rdsk/c1t0d0s2 > /tmp/format.out
# fmthard -s /tmp/format.out /dev/rdsk/c1t1d0s2

# prtvtoc /dev/rdsk/c1t2d0s2 > /tmp/format.out
# fmthard -s /tmp/format.out /dev/rdsk/c1t3d0s2

Then once that's done, I enable the metabases on the new disks as metastat indicates:

# /usr/opt/SUNWmd/metareplace -e d60 c1t3d0s6
# /usr/opt/SUNWmd/metareplace -e d60 c1t1d0s0
# /usr/opt/SUNWmd/metareplace -e d60 c1t1d0s1
# /usr/opt/SUNWmd/metareplace -e d60 c1t1d0s3
# /usr/opt/SUNWmd/metareplace -e d60 c1t1d0s6

The mirrors will start resyncing once metareplace is invoked, right?

Thanks!
# 6  
Old 11-08-2005
That should work as far as I see.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Solaris 11 disk issue

I have 2 disks in my system.I recently added a zpool to the disk, but today I changed my mind and deleted the zpool , zpool destroy -f extra The zpool is now deleted and I want to partition the disk, so I delete the only partition on the disk. Now when I run format again, format... (13 Replies)
Discussion started by: cbtshare
13 Replies

2. Solaris

Root disk mirroring in Solaris 10

I would like to perform root disk mirroring task. Can someone please help me out on this. Thanks !! Regards, Rama (2 Replies)
Discussion started by: ramagore85
2 Replies

3. Solaris

Solaris Mirroring/Hotspare Issue

Hi Guys, I have an issue with my Hotspares/Metastat on an x4270m2. I have no hotspares available, when i printed out my metastat i got d21: Concat/Stripe Size: 33559785 blocks (16 GB) Stripe 0: Device Start Block Dbase Reloc ... (0 Replies)
Discussion started by: A-Train
0 Replies

4. Solaris

root disk mirroring in solaris volume manager for solaris 10

Need a procedure document to do "root disk mirroring in solaris volume manager for solaris 10". I hope some one will help me asap. I need to do it production environment. Let me know if you need any deatils on this. Thanks, Rama (1 Reply)
Discussion started by: ramareddi16
1 Replies

5. Solaris

Disk Mirroring on solaris 5.8

Hi Friends, I am having Sun Solaris 5.8 OS installed having 2 different size hard disk, sizes are c0t0d0s0(160 GB) and c0t2d0s0 (40GB). I have installed Sun Solaris 5.8 OS in c0t0d0s0 (160GB) harddisk. I have configured all the parameters required for disk mirroring. But when executing... (4 Replies)
Discussion started by: Vijayakumarpc
4 Replies

6. Solaris

Solaris 10 Disk Mirroring

Has anyone managed to set up disk mirroring in Solaris 10 yet? If so can you point me in the direction of some useful documentation please. Cheers (25 Replies)
Discussion started by: korfnz
25 Replies

7. Solaris

Disk Mirroring for Solaris 10 x86

I am having an issue with setting up disk mirroring for Solaris 10 on an x86 server. My main problem is that the volumes and slices have already been setup and our proprietary software has already been installed and configured. The entire drive has been allocated in this configuration and the... (2 Replies)
Discussion started by: chuck43
2 Replies

8. Solaris

Help !! disk Mirroring

Hi I have a Sunfire X4100 box with a 4 disk Chassis (although I only have 2 disks in it). I have been asked to add two more disks into the chassis so that I can mirror the original two using SVM .....Ive read through a couple of SVM docs but am finding it a little confusing, and if any of you... (1 Reply)
Discussion started by: hcclnoodles
1 Replies

9. Solaris

[HELP] mirroring disk at solaris 8 sparc v890

hi expert, sorry i'm new in solaris system, need more advice and help :) i'm confusing with mirroring disk at my sparc v890 solaris 8, i have 6 harddisk which the structure shown below : format : AVAILABLE DISK SELECTIONS: 0. c1t0d0 <SUN146G cyl 14087 alt 2 hd 24 sec 848> ... (4 Replies)
Discussion started by: bucci
4 Replies

10. Solaris

Solaris mirroring / non-root disk

Hi Guys, Need to add 2 disks into a JBOD array (3310). Does anyone see anything wrong with my Procedure / Doco below? 1> Logon to system, check system logs for abnormal entries. 2> Make backups of related system files: A>cp -p /etc/system /etc/system.backup.081505 B>cp -p /etc/vfstab... (3 Replies)
Discussion started by: BG_JrAdmin
3 Replies
Login or Register to Ask a Question