11-17-2011
corrupt disk
Hallo Friends,
I have application X running on hpux 11.11 and oracle 9i release 2. I recently had a hardware failure on disk /dev/dsk/c2t0d0
Below is the systemlog file :
Code :
root@a7dmc:/var/adm/syslog [139] > /opt/resmon/bin/resdata -R 155713541 -r /storage/events/enclosures/gazemon/0_1_1_0.0.0 -n 155713537 -a
CURRENT MONITOR DATA:
Event Time..........: Wed Aug 17 23:55:47 2011
Severity............: CRITICAL
Monitor.............: gazemon
Event #.............: 100337
System..............: a7dmc
Summary:
**** Disk at hardware path 0/1/1/0.0.0 : Media failure
Description of Error:
**** The device was unsuccessful in reading data for the current I/O request
**** due to an error on the medium. The maximum number of retries were
**** attempted and the data could not be read. The request was likely processed
**** in a way which could cause damage to or loss of data.
Probable Cause / Recommended Action:
**** Reformatting the medium may fix the problem.
**** Alternatively, the medium in the device is flawed. If the medium is
**** removable, replace the medium with a fresh one.
**** Alternatively, if the medium is not removable, the device has experienced
**** a hardware failure. Contact your HP support representative to have the
**** device checked.
Additional Event Data:
**** System IP Address...: 192.168.0.17
**** Event Id............: 0x4e4c38e300000000
**** Monitor Version.....: B.01.00
**** Event Class.........: I/O
**** Client Configuration File...........:
**** /var/stm/config/tools/monitor/default_gazemon.clcfg
**** Client Configuration File Version...: A.01.01
********* Qualification criteria met.
************** Number of events..: 1
**** Associated OS error log entry id(s):
********* 0x4e4c38e100000000
**** Additional System Data:
********* System Model Number.............: 9000/800/rp3440
********* EMS Version.....................: A.04.20
********* STM Version.....................: A.53.00
**** Latest information on this event:
********* http://docs.hp.com/hpux/content/hard...csi.htm#100337
v-v-v-v-v-v-v-v-v-v-v-v-v*** D* E* T* A* I* L* S*** v-v-v-v-v-v-v-v-v-v-v-v-v
Product/Device Identification Information:
**** Logger ID.........: sdisk
**** Product Identifier: SCSI Disk
**** Product Qualifier.: HP146
**** SCSI Target ID....: (not available/applicable)
**** SCSI LUN..........: (not available/applicable)
I/O Log Event Data:
**** Driver Status Code..................: 0x0000007C
**** Length of Logged Hardware Status....: 36 bytes.
**** Offset to Logged Manager Information: 40 bytes.
**** Length of Logged Manager Information: 34 bytes.
Hardware Status:
**** Raw H/W Status:
********* 0x0000: 00 00 00 02** F0 00 03 01** 4B F6 66 28** 00 00 00 00
********* 0x0010: 11 01 00 80** 00 3F 00 28** 00 67 00 01** AF 03 00 00
********* 0x0020: 0E E8 03 51
**** SCSI Status...: CHECK CONDITION (0x02)
********* Indicates that a contingent allegiance condition has occurred.* Any
********* error, exception, or abnormal condition that causes sense data to be
********* set will produce the CHECK CONDITION status.
SCSI Sense Data:
**** Undecoded Sense Data:
********* 0x0000: F0 00 03 01** 4B F6 66 28** 00 00 00 00** 11 01 00 80
********* 0x0010: 00 3F 00 28** 00 67 00 01** AF 03 00 00** 0E E8 03 51
**** SCSI Sense Data Fields:
********* Error Code********************* : 0x70
********* Segment Number***************** : 0x00
********* Bit Fields:
************** Filemark****************** : 0
************** End-of-Medium************* : 0
************** Incorrect Length Indicator : 0
********* Sense Key********************** : 0x03
********* Information Field Valid******** : TRUE
********* Information Field************** : 0x014BF666
********* Additional Sense Length******** : 40
********* Command Specific*************** : 0x00000000
********* Additional Sense Code********** : 0x11
********* Additional Sense Qualifier***** : 0x01
********* Field Replaceable Unit********* : 0x00
********* Sense Key Specific Data Valid** : TRUE
********* Sense Key Specific Data******** : 0x80 0x00 0x3F
********* Sense Key 0x03, MEDIUM ERROR, indicates that the command terminated
********* with a nonrecovered error condition that was probably caused by a
********* flaw in the medium or an error in the recorded data.* This sense key
********* may also be returned if the device is unable to distinguish between a
********* flaw in the medium and a specific hardware failure (sense key 0x04).
********* For the RECOVERED ERROR, HARDWARE ERROR, or MEDIUM ERROR Sense Key,
********* the Sense Key Specific data indicates that 63 retries were attempted.
********* The combination of Additional Sense Code and Sense Qualifier (0x1101)
********* indicates: Read retries exhausted.
SCSI Command Data Block:* (not present in log record)
Manager-Specific Information:
**** Raw Manager Data:
********* 0x0000: 02 08 B5 B9** 00 00 34 00** 00 00 00 02** 00 00 00 00
********* 0x0010: 02 00 00 20** 7A 00 09 0A** 28 00 01 4B** F6 40 00 00
** *******0x0020: 40 00
root@a7dmc:/var/adm/syslog [140] >
root@a7dmc:/ [131] > ioscan -funC disk
Class I H/W Path Driver S/W State H/W Type Description
=========================================================================
disk 0 0/0/2/0.0.0.0 sdisk CLAIMED DEVICE TEAC DV-28E-C
/dev/dsk/c0t0d0 /dev/rdsk/c0t0d0
disk 1 0/1/1/0.0.0 sdisk CLAIMED DEVICE HP 146 GMAT3147NC
/dev/dsk/c2t0d0 /dev/rdsk/c2t0d0
disk 2 0/1/1/0.1.0 sdisk CLAIMED DEVICE HP 146 GMAT3147NC
/dev/dsk/c2t1d0 /dev/rdsk/c2t1d0
root@a7dmc:/ [132] >
root@a7dmc:/ [133] > bdf
Filesystem kbytes used avail %used Mounted on
/dev/vg00/lvol3 229376 141008 87696 62% /
/dev/vg00/lvol1 314736 59528 223728 21% /stand
/dev/vg00/lvol8 8192000 2631048 5517560 32% /var
/dev/vg00/lvfeedData
24117248 21653916 2424856 90% /var/opt/dmc/feedData
/dev/vg00/lvSORT 393216 1197 367525 0% /var/opt/dmc/SORT
/dev/vg00/lvASCII 60555264 53112744 7384400 88% /var/opt/dmc/ASCII
/dev/vg00/lvol7 5144576 1298368 3816216 25% /usr
/dev/vg00/lvol6 10256384 3694136 6511800 36% /tmp
/dev/vg00/lvol5 5144576 2002544 3117520 39% /opt
/dev/vg00/lvoracle 10256384 3627982 6421300 36% /opt/oracle/product/9.2.0
/dev/vg00/lvol4 5144576 20976 5083632 0% /home
root@a7dmc:/ [134] > vgdisplay -v vg00
--- Volume groups ---
VG Name /dev/vg00
VG Write Access read/write
VG Status available
Max LV 255
Cur LV 12
Open LV 12
Max PV 16
Cur PV 1
Act PV 1
Max PE per PV 4384
VGDA 2
PE Size (Mbytes) 32
Total PE 4374
Alloc PE 4088
Free PE 286
Total PVG 0
Total Spare PVs 0
Total Spare PVs in use 0
--- Logical volumes ---
LV Name /dev/vg00/lvol1
LV Status available/syncd
LV Size (Mbytes) 320
Current LE 10
Allocated PE 10
Used PV 1
LV Name /dev/vg00/lvol2
LV Status available/syncd
LV Size (Mbytes) 4096
Current LE 128
Allocated PE 128
Used PV 1
LV Name /dev/vg00/lvol3
LV Status available/syncd
LV Size (Mbytes) 224
Current LE 7
Allocated PE 7
Used PV 1
LV Name /dev/vg00/lvol4
LV Status available/syncd
LV Size (Mbytes) 5024
Current LE 157
Allocated PE 157
Used PV 1
LV Name /dev/vg00/lvol5
LV Status available/syncd
LV Size (Mbytes) 5024
Current LE 157
Allocated PE 157
Used PV 1
LV Name /dev/vg00/lvol6
LV Status available/syncd
LV Size (Mbytes) 10016
Current LE 313
Allocated PE 313
Used PV 1
LV Name /dev/vg00/lvol7
LV Status available/syncd
LV Size (Mbytes) 5024
Current LE 157
Allocated PE 157
Used PV 1
LV Name /dev/vg00/lvol8
LV Status available/syncd
LV Size (Mbytes) 8000
Current LE 250
Allocated PE 250
Used PV 1
LV Name /dev/vg00/lvfeedData
LV Status available/syncd
LV Size (Mbytes) 23552
Current LE 736
Allocated PE 736
Used PV 1
LV Name /dev/vg00/lvSORT
LV Status available/syncd
LV Size (Mbytes) 384
Current LE 12
Allocated PE 12
Used PV 1
LV Name /dev/vg00/lvASCII
LV Status available/syncd
LV Size (Mbytes) 59136
Current LE 1848
Allocated PE 1848
Used PV 1
LV Name /dev/vg00/lvoracle
LV Status available/syncd
LV Size (Mbytes) 10016
Current LE 313
Allocated PE 313
Used PV 1
--- Physical volumes ---
PV Name /dev/dsk/c2t0d0
PV Status available
Total PE 4374
Free PE 286
Autoswitch On
root@a7dmc:/ [135] >
I have sourced a new disk in the meantime. Is there a way i can avoid reinstalling the OS, database and application?
Last edited by Scott; 11-17-2011 at 07:19 AM ..
Reason: Code tags....
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Thank you livin Free for all your help. We removed a lot of spool files and report files. Which should have freed up some space.
But now I think a major problem we have is we have lost or corrupt files which are preventing us from coming up correctly. Can we load or can you copy us a directory... (1 Reply)
Discussion started by: NOT A CLUE
1 Replies
2. Cybersecurity
Help!
SCO Unix 5.05.
A relatievely new system went down on me today. I got the dreaded error:
Out of Space on Device (1/42).
I was able to clear up some space in the /tmp directory, however, when I try to boot, the system prompts me to go into single user mode and I get the... (2 Replies)
Discussion started by: gseyforth
2 Replies
3. UNIX for Dummies Questions & Answers
Hi guys,
For some reason a client has given us a Sun Netra T1 with Solaris 8 to administer for them. That's always good business. However, the other day we rebooted the machine and to our amazement, after doing the preliminary hardware tests, we got an error messgae saying that /etc/inittab was... (3 Replies)
Discussion started by: Ivo
3 Replies
4. UNIX for Dummies Questions & Answers
Hmm, how to ask this without sounding too malicious...
How might one go about causing a disk corruption in OS X specifically or via the command line in UNIX in general?
Doesnt matter the severity of the problem, I just want to scare the person a little, then fix the problem for them.
Any... (1 Reply)
Discussion started by: Yummator
1 Replies
5. UNIX for Dummies Questions & Answers
Hi Everyone!
Would someone please tell me if it is still true that rootdg should not be used for production/primary data and that you should create additional disk groups so that if rootdg gets corrupt you can recreate rootdg and then bring in the other groups with no data loss. Or is it still... (0 Replies)
Discussion started by: llrios
0 Replies
6. HP-UX
I have been fine adding/removing printers up until this week. Now when I go to add a new remote printer I get "corrupted member file". I go to /etc/lp/member and the byte count on the new printer name is 0. I VI the file and put /dev/null in to make it the correct size and it all looks fine and... (2 Replies)
Discussion started by: astout
2 Replies
7. Solaris
I need to corrupt a superblock of a mounted device in a soalris m/c and check recovery from an alternate superblock. How can this be done? (2 Replies)
Discussion started by: sujathan
2 Replies
8. UNIX for Dummies Questions & Answers
Hello, I am currently dumping 30-40 reports on a Unix folder located here /home/apps/reports/prode/excel
I use K-shell to do this task. In that, I use the gzip command to compress these files. I want to be able to use a tar command to first load the entire directory into one file then gzip that... (2 Replies)
Discussion started by: Pramodini Rode
2 Replies
9. Shell Programming and Scripting
I don't know if I am asking this correctly, but I have a hard drive with some bad sectors and it appears that some of the data is corrupt. I am having allot of trouble copying the data to a new drive. The issue is not in copying files, but that the new drive to which files are copied is not acting... (17 Replies)
Discussion started by: LMHmedchem
17 Replies
10. High Performance Computing
I am managing a linux cluster which has been build on Platform Cluster Manager PCM 1.2.1) from IBM Platform Computing. Unfortunately somebody deteled data files of postgresql from /var/lib directory. I somehow managed to start the postmaster service again, but all the administrative commands of... (2 Replies)
Discussion started by: ahsanpmd
2 Replies