RAID 10 Failed Drive Swap


 
Thread Tools Search this Thread
Operating Systems AIX RAID 10 Failed Drive Swap
# 1  
Old 01-17-2014
[Solved] RAID 10 Failed Drive Swap

I am new to the AIX operating system and am seeking out some advice. We recently have had a drive go bad on our AIX server that is in a RAID 10 array. We have a replacement on the way. I was wondering what the correct steps are to swap out this drive. Does the server need to be powered off? Or can I hot swap?

I found some instruction through the diag command were I can perform a hot plug task and remove the drive from the array.


Attached is a screenshot from the "diag" command with Disk Array Configuration being shown. pdisk6 is the affected disk.
RAID 10 Failed Drive Swap-capturepng
# 2  
Old 01-17-2014
First off, welcome to the AIX board.

Having said this, it might help to describe your hardware a bit more in detail. The more detail you give the better the offered solutions will be.

In general (but this will depend on your hardware, so take this cum grano salis) it will not be necessary to power off or even unmount filesystems involved. AIX' LVM and IBMs RAID driver can handle practically all the necessary tasks while the storage is in use. I wouldn't start the biggest database import available while recovering from disk failures but that's about it.

I have not used RAID arrays for probably 10 years now, so i can only draw on some remote memory, but IBMs arrays always included hot-standby-disks. A failed disk is immediately swapped with the standby and you take the former out and bring in a new standby disk in when recovering the array.

I hope this helps.

bakunin
This User Gave Thanks to bakunin For This Post:
# 3  
Old 01-17-2014
Thank you for the welcome and yes your post does shed light on what I was thinking along the lines of with regards to the hot-swap.

I am out of office and forgot to grab the exact model number but the system is an older IBM TotalStorage unit. Attached is a picture I have found that looks somewhat like the unit we have in place minus the model number.

I do appreciate the help and response and I am sorry for the limited information I have; I am new to the field but that is no excuse.

Attached also a low quality photo I took awhile ago I found on my phone. The arrow is pointing to the affected SCSI drive. The drives are all IBM Ultra 320 36GB at 10K RPM.
RAID 10 Failed Drive Swap-1261jpg
RAID 10 Failed Drive Swap-photocropjpg
# 4  
Old 01-21-2014
Just going to follow up with what I did in hopes it helps someone else who has this issue arise.

Use diag command to check array and find failed disk(s).

# diag
---> Task Selection
---> RAID Array Manager
---> PCI-X SCSI Disk Array Manager
---> List PCI-X SCSI Disk Array Configuration
---> sisioa1 Available 06-08 PCI-X Dual Channel U320 SCSI RAID



Activate the LED indicator of the physical disk to locate it on the rack.

# diag
---> Task Selection
---> Hot Plug Task
---> SCSI and SCSI RAID Hot Plug Manager
---> Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure
---> select failed disk here(pdisk#)


A message will appear in regards to an LED and Remove state. Find the physical drive that is now flashing amber from its LED and remove it from the array. After you remove the failed physical drive, replace it with the new unit.
Hit Enter on that message screen to remove that slot from the "remove state".

# diag
---> Task Selection
---> Hot Plug Task
---> SCSI and SCSI RAID Hot Plug Manager
---> Configure Added/Replaced Devices


# diag
---> Task Selection
---> Log Repair Action (Select affected disk)



Rebuild the array

# diag
---> Task Selection
---> RAID Array Manager
---> PCI-X SCSI Disk Array Manager
---> Reconstruct a PCI-X SCSI Disk Array
These 2 Users Gave Thanks to mpeter05 For This Post:
# 5  
Old 01-21-2014
Thanks for your contribution, appreciated
This User Gave Thanks to vbe For This Post:
# 6  
Old 01-21-2014
Thank you for sharing the solution. This is the spirit!

Moderator's Comments:
Mod Comment I changed the status of the thread to "Solved".


bakunin
This User Gave Thanks to bakunin For This Post:
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Filesystems, Disks and Memory

Failed raid 1 partition cannot re-add

I found out that the raid 1 was degraded: # cat /proc/mdstat Personalities : md3 : active raid1 sda5 sdb5 1822445428 blocks super 1.0 md2 : active raid1 sda3(F) sdb3 1073741688 blocks super 1.0 md1 : active raid1 sda2 sdb2 524276 blocks super 1.0 md0 : active raid1 sda1... (0 Replies)
Discussion started by: ZaNaToS
0 Replies

2. UNIX for Advanced & Expert Users

Identify failed disk in Linux RAID

Good Evening, 2 years ago, I set up an Ubuntu file-server for a friend, who is a photograph amateur. Basically, the server offers a software RAID-5 that can be accessed remotely from a MAC. Unfortunately, I didn't labeled the hard drives (i.e. which physical drive corresponds to the /dev/sdX... (2 Replies)
Discussion started by: Loic Domaigne
2 Replies

3. SCO

SCO 5.0.7 Tape Drive swap

Our tape drive died and I installed a newer Quantum DAT72 drive in it's place with the same SCSI ID. It still works, but with one major flaw, the system will lock up if I try to upgrade BackupEDGE or view NFS settings in scoadmin. I get a Transition to ready failure on ha=0* message when the... (4 Replies)
Discussion started by: psytropic
4 Replies

4. Hardware

How to connect a 4TB G-Raid hard drive to a laptop?

I am having trouble connecting my 4TB G-Raid Hard drive to my Compaq Hp laptop can anyone tell me how or what I need in order to connect the 4T and getting it working. (10 Replies)
Discussion started by: Jake Wolf
10 Replies

5. Hardware

[Solved] Boot Lockup After Drive Swap

Hey All, Im using Fedora 2.6 (which is cannot be changed for compatibility reasons). I cloned a drive from a different server and when i added this drive to a new box, during startup it hangs on "Configuring Kernel Parameters:" Is there any way to bypass this process and still boot... (0 Replies)
Discussion started by: robfwauk
0 Replies

6. UNIX for Dummies Questions & Answers

RAID Drive

Hello, I have a machine that has software based RAID. One of the hard drives failed. The problem is that the old systems administrator created LVM and then RAID. My understanding is that RAID had to be created before and then the LVM's. Is there someway to install the new drive without loosing... (2 Replies)
Discussion started by: mojoman
2 Replies

7. OS X (Apple)

Failed Drive

I am trying to recover data off a drive that failed in my iMac. Apple returned the drive to me and I purchased a hard drive enclosure. I have been doing research on prices for data recovery services, way too expensive. I seen some links using Unix DD commands in the terminal none of which worked.... (6 Replies)
Discussion started by: KJ1906
6 Replies

8. UNIX for Dummies Questions & Answers

How to view Drive/RAID config in UNIX...

How do you view Drive/RAID configuration in UNIX? We are running an ML370 with 6 drives in it... Version: Sco 5.2.0 Sco Openserver Release 5 (2 Replies)
Discussion started by: bpoulson
2 Replies

9. UNIX for Advanced & Expert Users

RAID 1 on SWAP file?

Hey everyone, first of all, this is the motherboard I have: GigaByte GA-K8NXP-9 , nForce4 Ultra. It supports RAID 0, 1, 0+1 apparently both on the IDE and S-ATA HDD. Now what I had in mind is popping 2 x 6Gb HDD in the IDE slots as slave 1 & 2 where my 2 DVD/CD burners are master. I then plan... (2 Replies)
Discussion started by: temba
2 Replies

10. Filesystems, Disks and Memory

mksysb to hot swap hard drive

AIX 4.3.3 I am investigating methods of creating system backups. One method I am investigating is installing a hot swap hard drive and creating a mksysb to that hard drive. Does anyone have any ideas on getting this accomplished? I am thinking that I need a mounted file system from the 2nd... (0 Replies)
Discussion started by: jalburger
0 Replies
Login or Register to Ask a Question