VCS Crashing due to inconsistency in opt (managed by VxvM)


 
Thread Tools Search this Thread
Operating Systems Solaris VCS Crashing due to inconsistency in opt (managed by VxvM)
# 1  
Old 03-02-2012
VCS Crashing due to inconsistency in opt (managed by VxvM)

We have a Sun Server running Solaris 10 and Veritas Cluster Server. The RAID Volumes in the Server (/ , swap, opt, var, usr) are managed by VxVm and UFS is grown on all these volumes.

Lately the system has been crashing due to an inconsistency in the opt filesystem. Upon reboot we did a fsck on the the opt multiple times and booted the system to multiuser mode. But again the system is crashing once the cluster is ok. The following is the panic message:-



panic[cpu1]/thread=3000d19a6c0: alloccgblk: can't find blk in cyl, pos:0, i:377, fs:/opt bno: 300


000002a102c50fb0 ufs:real_panic_v+60 (0, 19017f8, 2a102c51250, 30003bea000, 0, 600080e7d40)
%l0-3: 000006000832a000 0000000000090000 000006000e5d64c0 0000000000000300
%l4-7: 0000000000000180 0000000000000000 0000000000000064 0000000001826c00
000002a102c51060 ufs:ufs_fault_v+c8 (600085c7180, 19017f8, 2a102c51250, 6000d957648, 60006b2a2a8, 0)
%l0-3: 000006000832a000 0000000000090000 000006000e5d64c0 0000000000000300
%l4-7: 0000000000000180 0000000000000000 0000060006b2a200 0000000000000000
000002a102c51110 ufs:ufs_fault+1c (600085c7180, 19017f8, 0, 179, 6000832a0d4, 300)
%l0-3: 000006000832a000 0000000000090000 000006000e5d64c0 0000000000000300
%l4-7: 0000000000000180 0000000000000479 0000000000000179 000006000832a560
000002a102c511c0 ufs:alloccgblk+4c8 (1901400, 6000e5d6000, 0, 6000d957648, 2188, 0)
%l0-3: 000006000832a000 0000000000090000 000006000e5d64c0 0000000000000300
%l4-7: 0000000000000180 0000000000000479 0000000000000179 000006000832a560
000002a102c51270 ufs:alloccg+144 (90000, 60006b2a2a8, 662188, 2000, 90255, 6000e5d64c0)
%l0-3: 000006000e5d6000 000006000d957648 000006000832a2d8 0000000000000880
%l4-7: 0000000000000088 0000060006b2a200 000006000832a000 0000000000090255
000002a102c51320 ufs:hashalloc+24 (6000e8db878, 88, 662188, 2000, 122e8d0, 2a102c51480)
%l0-3: 0000060006b2a200 000006000e8db878 000006000832a000 0000000000000003
%l4-7: 0000060006b2a200 0000000000002000 0000000000000088 0000000000000088
000002a102c513d0 ufs:alloc+128 (0, 662188, 34d4f00, 2a102c51690, 600004040b8, 6000832a000)
%l0-3: 0000060006b2a200 000006000e8db878 0000000001e6a130 0000000000000003
%l4-7: 0000000000000000 0000000000002000 0000000000002000 0000000000000010
000002a102c51490 ufs:bmap_write+c40 (0, 2000, 2a102c515e8, 10, 0, 6000e8db878)
%l0-3: 0000060006b2a200 0000000000000000 0000000000662188 000002a102c515e8
%l4-7: 000000000000001c 0000000000661f48 0000000000000007 000006000d237d10
000002a102c516a0 ufs:wrip+448 (0, 2a102c51a98, ffffffffff, 2000, 6000e8db878, 8000)
%l0-3: 0000000000026000 0000000000000001 0000000000000000 0000000000000000
%l4-7: 0000060006b2a2a8 0000000000028000 0000000000000000 0000000000002000
000002a102c51810 ufs:ufs_write+580 (6000e8cdb80, 2a102c51a98, 8, 60006b2a248, 1, 6000e8db878)
%l0-3: 000006000e8db898 000006000e8db958 000006000e8db960 0000000000000001
%l4-7: 00000000019004f4 000006000e8db9b8 0000060006b2a200 0000000000000000
000002a102c51930 genunix:fop_write+20 (6000e8cdb80, 2a102c51a98, 8, 600004040b8, 0, 123ed74)
%l0-3: 0000000000002000 000006000e8cdb80 0000000000000000 000000000104db10
%l4-7: 0000000000002000 0000000000026000 0000000000000008 000000000000210a
000002a102c519e0 genunix:write+268 (1, 8058, 600155cb008, 2000, 210a, 1)
%l0-3: 0000000000000000 000006000e8cdb80 0000000000000000 000000000104db10
%l4-7: 0000000000002000 0000000000026000 0000000000000008 000000000000210a

syncing file systems... [1] 34 [1] 28 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 [1] 5 done (not all i/o completed)

From the vxprint -th output i see some parameter of the 2 plexes of opt has some different value as shown below:-

dm rootdisk c0t0d0s2 auto 20351 143328960 -
dm rootmirror c0t1d0s2 auto 9919 143328960 -

v opt - ENABLED SYNC 110796288 ROUND - fsgen
pl opt-01 opt ENABLED ACTIVE 110796288 CONCAT - RW
sd rootdisk-03 opt-01 rootdisk 32532672 110796288 0 c0t0d0 ENA
pl opt-02 opt ENABLED ACTIVE 110796288 CONCAT - RW
sd rootmirror-05 opt-02 rootmirror 32512320 110796288 0 c0t1d0 ENA


Please help me with a fix.
# 2  
Old 05-10-2012
This is probably too late for this issue, assuming you have found a way to fix it already.

Any panic in Solaris 8, 9, 10 that involves: "alloccgblk: can't find blk in cyl, pos"
before running fsck -o f /disk-in-question
you need to check and make sure to have fix for CR# 6660301

We Sun Solve: Bug details for 6660301
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Linux

Inconsistency with parallel run

Hi All, I am running a parallel processing on aggregating a file. I am splitting the process into 7 separate parallel process and processing the same input file and the process will do the same for each 7 run. The issue I am having is for some reason the 1st parallel processes complete first... (7 Replies)
Discussion started by: arunkumar_mca
7 Replies

2. Red Hat

Process not running: /opt/java15/jdk/bin/java -classpath /opt/apache/apache-ant-1.7.0-mod/lib/ant-la

Have no idea on what the below error message is: Process not running: /opt/java15/jdk/bin/java -classpath /opt/apache/apache-ant-1.7.0-mod/lib/ant-launcher.jar org.apache.tools.ant.launch.Launcher -buildfile build.xml dist. Any help? (3 Replies)
Discussion started by: gull05
3 Replies

3. Red Hat

file system inconsistency

here in one of the server the lvol4 is having 20G and used space is 181M but it showing 98% used kindly advice any one can i run fsck -y after unmounted that lvol4 /dev/mapper/vg01-lvol4 20G 19G 418M 98% /var/opt/fedex aymara.emea $ du -sh /var/opt/fedex/... (3 Replies)
Discussion started by: venikathir
3 Replies

4. Solaris

VCS on Solaris: VCS ERROR V-16-2-13077 (host2) Agent is unable to offline resource(DiskReservation)

hi, dear all I get a problem "VCS ERROR V-16-2-13077 " on VCS 4.1 for Solaris 10. I can not offline the host2 when the raid is bad. I don't know the reason and how to offline host2 and switch to host1. please help me, thank you! the message of engine_A.log is : ... (2 Replies)
Discussion started by: ForgetChen
2 Replies

5. UNIX for Dummies Questions & Answers

Inconsistency between passwd and group

Hi, I have a passwd file with 3 users belonging to the the root group (gid=0), but the group file does not list these users as members of the root group? Shoud I be worried and apart from manually changing it, how can it be remediated? thx Norgaard (1 Reply)
Discussion started by: Norgaard
1 Replies

6. Solaris

Veritas VxVm 3.2 - Cannot unrelocate subdisk due to overlap with existing subdisks

Hi all, New to this forum as well as the world of Veritas Volume Manager. My client is using VxVM 3.2. We just changed one of the disk which is under veritas control. I used the appropriate options in vxdiskadm to replace this failed disk. Now when I am trying to unrelocate subdisks back to the... (0 Replies)
Discussion started by: rajan_g4
0 Replies

7. AIX

Process crashing on AIX due to memory Leak

Hi All, I have a process running on my AIX 5.3 server box. The process runs fine for 5-6days but then crashes. The log file shows malloc failure and the svmon (Virtual memory size), ps -lef (SZ value) are also gradually increasing. But unfortunately MALLOCDEBUG and any other memory debugging... (3 Replies)
Discussion started by: SBatra
3 Replies

8. HP-UX

Backspace stty inconsistency

I have this in my .profile: stty erase `tput kbs` which sets erase to ^H for a vt and ^? for an xterm. This has been fine up until now on all systems whether I login using a vt terminal emulator or an xterm. On this new system though, if I log in directly using an xterm, backspace doesn't... (1 Reply)
Discussion started by: Runrig
1 Replies

9. Ubuntu

packet inconsistency problem

Hello everyone, I was trying to install db2 on Ubuntu, but got messed up with manual installation and Synaptic. At the moment, I find myself with a filesystem where DB2 is NOT installed ( I removed it with a sudo rm :o ) and with Synaptic still flagging db2exc as installed. The problem is that... (1 Reply)
Discussion started by: clalfa
1 Replies

10. UNIX for Dummies Questions & Answers

Disk inconsistency

Hi, it seems I've got an hw error on more than one device. I use an AIX 5.2. this is the problem desc. Description DISK OPERATION ERROR Probable Causes DASD DEVICE Failure Causes DISK DRIVE DISK DRIVE ELECTRONICS I wish to read the SYSLOG file, where is it ? tk (1 Reply)
Discussion started by: Carmen123
1 Replies
Login or Register to Ask a Question