I'm new to this forum and as you will see from my question I'm new to UNIX as well.
One of our costumers has HP rx4640 running on UNIX with two 300GB hot-swappable disks that are mirrored. They reported to us that one of the disks is faulty and they want us to take care of it. Below is the only log they sent to us.
Code:
Fri May 18 17:50:11 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol1 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol3 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol4 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol5 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol6 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol7 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol8 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:50:12 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/SwapVol2 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol1 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol3 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol4 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol5 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol6 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol7 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/lvol8 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Fri May 18 17:56:06 2012 STCHK 122 sd_procchk sd_procchk 1 Logical volume
/dev/vg00/SwapVol2 is mirrored but has some stale blocks. Data loss on
hardware failure could occur.
Code:
# pvdisplay -v /dev/disk/disk13_p2 | grep stale
00000 stale /dev/vg00/lvol1 00000
00089 stale /dev/vg00/lvol3 00000
00090 stale /dev/vg00/lvol3 00001
00094 stale /dev/vg00/lvol3 00005
00096 stale /dev/vg00/lvol3 00007
00121 stale /dev/vg00/lvol4 00000
00122 stale /dev/vg00/lvol5 00000
00171 stale /dev/vg00/lvol5 00049
00176 stale /dev/vg00/lvol5 00054
00177 stale /dev/vg00/lvol5 00055
00183 stale /dev/vg00/lvol5 00061
00184 stale /dev/vg00/lvol5 00062
00186 stale /dev/vg00/lvol5 00064
00215 stale /dev/vg00/lvol5 00093
00219 stale /dev/vg00/lvol5 00097
00221 stale /dev/vg00/lvol5 00099
00237 stale /dev/vg00/lvol5 00115
00242 stale /dev/vg00/lvol5 00120
00279 stale /dev/vg00/lvol6 00000
00296 stale /dev/vg00/lvol7 00000
00298 stale /dev/vg00/lvol7 00002
00299 stale /dev/vg00/lvol7 00003
00306 stale /dev/vg00/lvol7 00010
00309 stale /dev/vg00/lvol7 00013
00314 stale /dev/vg00/lvol7 00018
00318 stale /dev/vg00/lvol7 00022
00326 stale /dev/vg00/lvol7 00030
00327 stale /dev/vg00/lvol7 00031
00337 stale /dev/vg00/lvol7 00041
00338 stale /dev/vg00/lvol7 00042
00340 stale /dev/vg00/lvol7 00044
00344 stale /dev/vg00/lvol7 00048
00415 stale /dev/vg00/lvol8 00000
00416 stale /dev/vg00/lvol8 00001
00417 stale /dev/vg00/lvol8 00002
00422 stale /dev/vg00/lvol8 00007
00429 stale /dev/vg00/lvol8 00014
00434 stale /dev/vg00/lvol8 00019
00437 stale /dev/vg00/lvol8 00022
00438 stale /dev/vg00/lvol8 00023
00439 stale /dev/vg00/lvol8 00024
00441 stale /dev/vg00/lvol8 00026
00445 stale /dev/vg00/lvol8 00030
00446 stale /dev/vg00/lvol8 00031
00447 stale /dev/vg00/lvol8 00032
00448 stale /dev/vg00/lvol8 00033
00449 stale /dev/vg00/lvol8 00034
00459 stale /dev/vg00/lvol8 00044
00460 stale /dev/vg00/lvol8 00045
00461 stale /dev/vg00/lvol8 00046
00462 stale /dev/vg00/lvol8 00047
00497 stale /dev/vg00/SwapVol2 00000
With my limited knowledge of UNIX i assumed from this that the disk ID is 13. If yes how do i find which of the two physical disk should be replaced?
And if i identify the problematic disk, are the below steps correct?
1) Check that the disk is not in the root volume group with lvlnboot -v command
2) continue with the disk replacement:
Code:
# pvchange -a N /dev/dsk/-
# <replace the hot-swappable disk>
# vgcfgrestore –n vg01 /dev/rdsk/-
# vgchange –a y vg01
If I'm way off please inform me as i got all this from "When Good Disks Go Bad" and as i mentioned I have very little experience with UNIX.
Any help is appreciated.
Thanks Gjk
Last edited by Scrutinizer; 05-20-2012 at 05:18 PM..
Reason: code tags
I dont even know what raid level this is, but its raid 5 mirrored from the looks of it.
I have a failed disk (t12) within this mirror. What is the best way to replace this disk? 2 things concern me, isn't there a command to prepare the disk for a hot swap? and what should i do with the... (3 Replies)
Hi Experts,
I have configured A D40 Array. There is an faulty disk which is not part of an raid volume but shows fault in the diagnostics.
pdisk15 U0.1-P1-I1/Q1-W40AA83CC2400D
SSA160 Physical Disk Drive ( MB)
Is there a way to stop this... (2 Replies)
Hi,
Recently i came across a disk that seems to be faulty and need help. I have gathered some information by running below commands and any help on how to solve this will be great.
# uname –a
SunOS XYZ 5.7 Generic_106541-16 sun4u sparc SUNW,Ultra-4
#df -k
Filesystem kbytes used... (3 Replies)
Requirement to remove a faulty mirrored disk from hp-ux LVM
<root@pdwp1s>/etc # vgdisplay -v /dev/vg00
vgdisplay: Warning: couldn't query physical volume "/dev/dsk/c2t0d0":
The specified path does not correspond to physical volume attached to
this volume group
vgdisplay: Warning: couldn't... (9 Replies)
Hi,
Have mirrored the primary disk to 3 .
Server and OS:
# uname -a
HP-UX pdwp1s B.11.11 U 9000/800 118434630 unlimited-user license
# model
9000/800/L3000-7x
# strings /etc/lvmtab
/dev/vg00
+F@<
/dev/dsk/c1t2d0
/dev/dsk/c2t2d0
/dev/dsk/c2t0d0
But now I have only 1 disk... (5 Replies)
Hi Guys,
One of two disks in my solaris machine has failed, the name is disk0, this is SUN physical sparc machine
But I work remotely, so people working near that physical server are not that technical, so from OS command prompt can run some command to bink faulty disk at front panel of Server.... (9 Replies)
Hi,
One of my disk is in 'disk missing state'. It is a sharedVG and cluster nodes.
The errpt keeps reporting stale partition error.
lvs are in open/stale state.
In this sceanario is replacing the disk the best practice?
When i do a lsdev the disk is labelled as below.
hdisk3 Available ... (2 Replies)
Hello,
On 4/20/2018, we performed a disk replacement on our IBM 8202 P7 server. After the disk was rebuilt, the SAS Disk Array sissas0 showed a status of degraded. However, the pdisks in the array all show a status of active.
We did see a message in errpt. DISK ARRAY PROTECTION SUSPENDED.
... (1 Reply)
Hello,
On 4/20/2018, we performed a disk replacement on our IBM 8202 P7 server. After the disk was rebuilt, the SAS Disk Array sissas0 showed a status of degraded. However, the pdisks in the array all show a status of active.
We did see a message in errpt. DISK ARRAY PROTECTION SUSPENDED.
... (3 Replies)
I plugged both power cables in both power supply. When I unplugged each power cable one by one, the SPARC T4-1 machine keep running. However, show faulty command shows below message. (I have also attached the picture of both power supply)
-> show faulty
Target ... (1 Reply)