mdadm unable to fail a resyncing drive?


 
Thread Tools Search this Thread
Homework and Emergencies Emergency UNIX and Linux Support mdadm unable to fail a resyncing drive?
# 1  
Old 08-09-2011
mdadm unable to fail a resyncing drive?

Hi All

I have a RAID 5 array consisting of 4 drives that had a partial drive failure in one of the drives.

Rebooting shows the faulty drive as background rebuilding and mdadm /dev/ARRAYID shows three drives as in sync with the fourth drive as spare rebuilding.

However the array won't come online instead reporting itself as active, degraded, not started

I want to take the faultyesque drive out of the array as smartctl shows it as just about to fail with a reallocated sectors problem.

I've executed a stop on the array but I can't fail the faulty drive as it just reports "No such device" however mdadm -E /dev/thatdrive reports it belonging to the array and mdadm -D /dev/thatarray shows the drive in the array but with thatdrive as "spare rebuilding".

I've read assorted stuff on this for the last 7-8 hours and short of physically pulling the drive out can't see a way to get the drive out of the array, I've even done a reboot to see if that helped with no effect.

What am I missing?

Thanks in advance.

---------- Post updated at 09:13 PM ---------- Previous update was at 03:25 AM ----------

No one has any ideas?

Wow that's scary how little depth of knowledge for mdadm there is out there.
# 2  
Old 08-09-2011
The man seems to day you can pull them all offline, but is otherwise terse, so maybe they assume if you want to limp without the bad drive, you just pull it? Maybe the facility you want is lower down, in the system device layer not the md virtual device layer.
This User Gave Thanks to DGPickett For This Post:
# 3  
Old 08-09-2011
Quote:
Originally Posted by DGPickett
The man seems to day you can pull them all offline, but is otherwise terse, so maybe they assume if you want to limp without the bad drive, you just pull it? Maybe the facility you want is lower down, in the system device layer not the md virtual device layer.
According to what I've read elsewhere you should be able to.....

mdadm --stop /dev/thatarray

mdadm --fail /dev/thatdrive

and then the drive should no longer show as in the array.....well that's the theory but it ain't working in practice.
# 4  
Old 08-10-2011
Try -force ?
# 5  
Old 08-11-2011
I'd already tried with the --force option....no go I'm afraid.

I should get the replacement disks today so will be physically pulling the drive and crossing my fingers.
# 6  
Old 08-11-2011
You did not give us very much information so it is hard to give precise answers

What flavor and version of Linux are you on?

What is the output of:
Code:
cat /proc/mdstat
mdadm --detail

# 7  
Old 08-12-2011
Though I always found md to be stable, the Linux world seems to have fixated on the much more flexible (and thoroughly documented) Volume Management (LVM) tools. Should you recover, consider a rebuild with LVM. There are more steps and there is a learning curve involved, but these are
Outweighed by the ability to get support.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Maint, resyncing and last-erred notifications

Hi fellow members! I have a oracle solaris server with two internal disks, that acts as an authentication server only, and for now the server seems to be doing its job, but when typing metastat -c I get some notifications. I am not familiar with SVM, I wonder if someone can help me on this:... (3 Replies)
Discussion started by: fretagi
3 Replies

2. Solaris

Unable to send SCSI commands to USB Drive

I am connecting a USB mass storage removeable drive to Solaris 10 x86 machine. The device is detected and i am able to perform standard read and write functions. But i want to use a code to send IOCTL based SCSI commands to the same device to read and write the data. Which i am unable to do so.... (17 Replies)
Discussion started by: danish2012
17 Replies

3. Filesystems, Disks and Memory

MDADM Failure - where it came from?

Hello, i have a system with 6 sata3 seagate st3000dm01 disks running on stable Debian with software raid mdadm. i have md0 for root and md1 for swap and md2 for the files. i now want to add one more disk = sdh4 for md2 but i got this errors: The new disk is connected to an 4 port sata... (7 Replies)
Discussion started by: Sunghost
7 Replies

4. UNIX for Dummies Questions & Answers

unable to automount a cifs drive in linux

Hi I am using SUSE 11 linux I have couple of "nfs" entries in /etc/fstab which are automatically loaded after system restart. One of the entry is windows drive mounted using cifs as shown below //IP-Address/Partition /mnt/x cifs credentials=/creds/.creds,rw,uid=<name> 0 0 I want to... (1 Reply)
Discussion started by: rakeshkumar
1 Replies

5. Red Hat

mdadm for / and /boot

had this RHEL 5 installation with /dev/sda1 and /dev/sda2 running.. created two more partitions /dev/sdj1 and /dev/sdj2 , the same sized partition as /dev/sda trying to use mdadm to create RAID1 .. I cannot even do it in "rescue" mode, I wonder if it can be done.. it kept... (2 Replies)
Discussion started by: ppchu99
2 Replies

6. Debian

Unable to mount external drive

Trying to mount an external 160GB Toshiba drive but.... this is my dmesg tail output: usb 2-2: new high speed USB device using ehci_hcd and address 3 usb 2-2: New USB device found, idVendor=13fd, idProduct=1618 usb 2-2: New USB device strings: Mfr=0, Product=0, SerialNumber=0 usb 2-2:... (4 Replies)
Discussion started by: Ridson
4 Replies

7. UNIX for Advanced & Expert Users

mdadm container! How does it work

Hi everyone, I am not sure if I understand how mdadm --create /dev/md0 --level=container works? A device called /dev/md0 appears in /proc/mdstat but I am not sure how to use that device? I have 2 blank drives with 1 500GB partition on each. I would like to setup mirroring, but not in the... (0 Replies)
Discussion started by: hytron
0 Replies

8. UNIX for Advanced & Expert Users

mdadm question

Hello, I have 4 drives (500G each) in a raid 10, I got a power failior and this is the result? cat /proc/mdstat Personalities : md126 : inactive sdb sdc sdd sde 1953536528 blocks super external:-md127/0 md127 : inactive sdd(S) sde(S) sdb(S) sdc(S) 9028 blocks super... (3 Replies)
Discussion started by: rmokros
3 Replies

9. Solaris

Unable to mount USB Pen drive on my Server

Hello Gurus!! Very recently i tried to mount a USB pen drive onto my solaris 10 (X4170 model) server. As i understand, in ideal scenarios it should get mounted automatically, but it did not happen. Neither anything is shown in "iostat -En" output or "rmformat -l" about the pen drive. I also... (10 Replies)
Discussion started by: EmbedUX
10 Replies

10. Solaris

Resyncing Progress of hardware mirror

Hi, I've recently mirrored the 4 disks in a V440. Disks 0 + 1 have been mirrored with hardware mirroring using the command raidctl -c c1t0d0 c1t2d0, the other 2 disks have been mirrored using Soltisce Disk Suite. I know how to check the progress for the SDS mirroring but how can I find the... (2 Replies)
Discussion started by: Chains
2 Replies
Login or Register to Ask a Question