Replace a Failed Root Disk in AIX


 
Thread Tools Search this Thread
Operating Systems AIX Replace a Failed Root Disk in AIX
# 1  
Old 08-10-2006
MySQL Replace a Failed Root Disk in AIX

My root disk is failed and how to replace the root disk in AIX.

Can u give a detailed explanation in step wise.

Pls give the answer taking different scenarios.



Regards
Praveen
# 2  
Old 08-14-2006
Here's some notes that I created awhile ago:

Code:
Disk Replacement

How to replace a bad drive? (mirrored disks)

# lsvg -p rootvg

rootvg:

PV_NAME           PV STATE          TOTAL PPs   FREE PPs    FREE DISTRIBUTION

hdisk0            active            542         264         109..00..00..46..109

hdisk1            active            542         278         88..00..00..81..109

 

*Here your making sure that both disks are assigned to the volume group

 

# lsvg -l rootvg

rootvg:

LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT

hd5                 boot       1     2     2    closed/syncd  N/A

hd6                 paging     64    128   2    open/syncd    N/A

hd8                 jfslog     1     2     2    open/syncd    N/A

hd4                 jfs        6     12    2    open/syncd    /

hd2                 jfs        127   254   2    open/syncd    /usr

hd9var              jfs        4     8     2    open/syncd    /var

hd3                 jfs        37    74    2    open/syncd    /tmp

hd1                 jfs        3     6     2    open/syncd    /home

hd10opt             jfs        21    42    2    open/syncd    /opt

dumplv              sysdump    14    14    1    open/syncd    N/A

 

*Here your checking to make sure there's a 1:2 relationship, meaning that there are copies.  Notice dumplv.  It's not copied so we need to make sure dumplv data isn't on the failing disk.  To check, run;

 

$ lslv -l dumplv

dumplv:N/A

PV                COPIES        IN BAND       DISTRIBUTION

hdisk0            014:000:000   100%          000:014:000:000:000

This is telling us that the logical volume dumplv is on hdisk0.  If hdisk1 is the failing disk, then we are okay.  Otherwise, we would have to migrate the data over to the good drive and proceed.

 

# unmirrorvg rootvg hdisk1

# reducevg rootvg hdisk1

# rmdev -l hdisk1 -d

 

Before you power down, it's a good idea to check the system to make sure it will boot from the good drive.  Do that by performing;

 

# bootinfo -b

hdisk1

 

This tells you what drive it was last booted up.  We want to change this to boot to the new drive, so;

 

# bosboot -ad /dev/hdisk0

 

And check bootlist

 

# bootlist -m normal -o

 

NOW WE CAN POWER DOWN THE BOX AND REPLACE THE DRIVE

 

Once disk has been replaced, power up the server.  Once at command prompt, run;

 

# cfgmgr

 

This will install the new device and allow the OS to see it.

 

# lsdev -Cc disk

hdisk0  Available 40-60-00-4,0 16 Bit LVD SCSI Disk Drive

hdisk1  Available 40-60-00-8,0 16 Bit LVD SCSI Disk Drive

 

Make sure that the OS says it's available.  If it is, we can assign it to a volume group.

 

# extendvg rootvg hdisk1

 

This will assign it a PVID and assign it to the volumegroup rootvg to make it available for use.  Now we can mirror;

 

# mirrovg rootvg

 

This will take a little while as it's taking all data now on hdisk0 and making a copy to hdisk1.

 

# lsvg -p rootvg

rootvg:

PV_NAME           PV STATE          TOTAL PPs   FREE PPs    FREE DISTRIBUTION

hdisk0            active            542         264         109..00..00..46..109

hdisk1            active            542         278         88..00..00..81..109

 

Once it's mirroring, we can make sure it's assigned to rootvg by doing the above.  We can also check to make sure there's copies;

 

# lsvg -l rootvg

rootvg:

LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT

hd5                 boot       1     2     2    closed/syncd  N/A

hd6                 paging     64    128   2    open/syncd    N/A

hd8                 jfslog     1     2     2    open/syncd    N/A

hd4                 jfs        6     12    2    open/syncd    /

hd2                 jfs        127   254   2    open/syncd    /usr

hd9var              jfs        4     8     2    open/syncd    /var

hd3                 jfs        37    74    2    open/syncd    /tmp

hd1                 jfs        3     6     2    open/syncd    /home

hd10opt             jfs        21    42    2    open/syncd    /opt

dumplv              sysdump    14    14    1    open/syncd    N/A

 

Now we need to modify the bosboot to recreate the boot image;

 

# bosboot -a

 

Double check your bootlist to make sure hdisk1 is in there;

 

# bootlist -m normal -o

 

YOUR DONE

.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. AIX

Stale PPs in AIX, failed disks.. how to replace?

I have a AIX 7.1 system that has 3 failed disks, 1 in rootvg and 2 in vg_usr1. Here is the output of lspv. # lspv hdisk0 00044d4dfbb11575 vg_usr1 active hdisk1 0000150179158027 vg_usr1 active hdisk2 ... (11 Replies)
Discussion started by: c3rb3rus
11 Replies

2. Solaris

Need assistance to replace root disk on Netra X4200

Good day. I have a SUN Netra X4200 running on Solaris 10, with two disks in raid 1 configuration. HDD0 (c2t0d0) seems to have a problem but HDD1 (c2t1d0) is ok: # for a in c2t0d0 c2t1d0 ; do raidctl -l $a; done Volume Size Stripe Status Cache RAID Sub ... (5 Replies)
Discussion started by: BRH
5 Replies

3. Red Hat

Find out the failed disk

Hi Masters, How to find out the failed disk in redhat linux (OR) How to decalare whether our hard disk goes bad ? My option is: dmesg | grep "failed" check in /var/log/messages with the kernel error. Please corrrect me if im wrong. And also i heard "smartctl" command. But it's... (2 Replies)
Discussion started by: abhay1983
2 Replies

4. Solaris

Lost Root Password on VXVM Encapsulated Root Disk

Hi All Hope it's okay to post on this sub-forum, couldn't find a better place I've got a 480R running solaris 8 with veritas volume manager managing all filesystems, including an encapsulated root disk (I believe the root disk is encapsulated as one of the root mirror disks has an entry under... (1 Reply)
Discussion started by: sunnyd76
1 Replies

5. Solaris

Cannot remove and replace failed disk

-bash-3.00# zpool detach zonepool c1t1d0 cannot detach c1t1d0: only applicable to mirror and replacing vdevs -bash-3.00# zpool remove zonepool c1t1d0 cannot remove c1t1d0: only inactive hot spares, cache, top-level, or log devices can be removed -bash-3.00# zpool offline zonepool c1t1d0... (3 Replies)
Discussion started by: LittleLebowski
3 Replies

6. Solaris

Step by step to replace Failed disk

Hi all. This may be a stupid question but here goes: I lost a disk on my Sun-Fire 480-r and want to replace it. I don't care about the data, I just need to get a new disk in. Can anyone supply me with step by step procedure please. I am not an expert by any means and it is a dev box so I... (1 Reply)
Discussion started by: jamie_collins
1 Replies

7. Solaris

How to replace failed disk?

Dear all Please can any one explain me how to replace failed disk in Solaris 10. Please tell me the step by step procedure. (9 Replies)
Discussion started by: suneelieg
9 Replies

8. Solaris

Help: root partition crashed, fsck failed. Disk problem?

recently this sunfire 280R (solaris 10) is having problme, it was able to boot up, then all suddent, it failed, and I can't even mount the first disk which contains root partition when I boot up from cdrom. 2 X 36G hard drive, I scan first hard drive by "read", there is no bad block. The... (2 Replies)
Discussion started by: steeles
2 Replies

9. UNIX for Advanced & Expert Users

broadcasting failed 'su root'

hi, if i try to 'su' to root and enter the wrong password, the failed su is broadcastet to all user logged in on the machine. Anyone know where i can stop that broadcast? thanks Lazzar (3 Replies)
Discussion started by: Lazzar
3 Replies

10. UNIX for Advanced & Expert Users

Scsi Disk Failed

My WS boot disk has failed so when i want to boot system (by OK boot -s or Ok boot disk0 ) i get these mesgs: disk read error boot :can not find misc/sparc9v/krtld boot:error loading interperetor (misc/sparc9v/krtld) Elf64 read error. boot failed. although it's probed ,finally i try... (1 Reply)
Discussion started by: nikk
1 Replies
Login or Register to Ask a Question