Serial Split Brain detected in solaris10


 
Thread Tools Search this Thread
Operating Systems Solaris Serial Split Brain detected in solaris10
# 8  
Old 04-02-2010
Actually what you have already provided looks good enough. It looks like you lost conatct to these disks:

Code:
myapp-dg02     c1t3d0s2     1267475616.65.servername        /dev/vx/rdmp/c1t3d0s2    
myapp-dg01     c1t2d0s2     1267475608.63.servername        /dev/vx/rdmp/c1t2d0s2

Which I am guessing are internal disks judging by the naming. I would guess there was something ongoing and you didn't reboot with 'init 6'.

This should work for you:

Code:
/usr/sbin/vxdg -Cf -o selectcp=1267475633.69.servername import myapp-dg

# 9  
Old 04-02-2010
Actually I did init 6 only but while booting system was showing lot of scsi disconnect errors or similar scsi messges when I decided to send break but that did not work so powered off from sc and rebooted again by reinserting scsi cable again.

Your command actually worked successfully,

/usr/sbin/vxdg -Cf -o selectcp=1267475633.69.servername import myapp-dg

Now,
-bash-3.00# vxdisk list
Code:
DEVICE       TYPE            DISK         GROUP        STATUS
c1t0d0s2     auto:none       -            -            online invalid
c1t1d0s2     auto:cdsdisk    -            -            online
c1t2d0s2     auto:cdsdisk    myapp-dg01  myapp-dg    online spare
c1t3d0s2     auto:cdsdisk    myapp-dg02  myapp-dg    online
c2t0d0s2     auto:cdsdisk    myapp-dg03  myapp-dg    online
c2t1d0s2     auto:cdsdisk    myapp-dg04  myapp-dg    online
c2t2d0s2     auto:cdsdisk    myapp-dg05  myapp-dg    online
c2t8d0s2     auto:cdsdisk    myapp-dg06  myapp-dg    online
c2t9d0s2     auto:cdsdisk    myapp-dg07  myapp-dg    online failing
c2t10d0s2    auto:cdsdisk    myapp-dg08  myapp-dg    online failing

-bash-3.00# vxprint -htr
Code:
Disk group: myapp-dg

DG NAME         NCONFIG      NLOG     MINORS   GROUP-ID
ST NAME         STATE        DM_CNT   SPARE_CNT         APPVOL_CNT
DM NAME         DEVICE       TYPE     PRIVLEN  PUBLEN   STATE
RV NAME         RLINK_CNT    KSTATE   STATE    PRIMARY  DATAVOLS  SRL
RL NAME         RVG          KSTATE   STATE    REM_HOST REM_DG    REM_RLNK
CO NAME         CACHEVOL     KSTATE   STATE
VT NAME         RVG          KSTATE   STATE    NVOLUME
V  NAME         RVG/VSET/CO  KSTATE   STATE    LENGTH   READPOL   PREFPLEX UTYPE
PL NAME         VOLUME       KSTATE   STATE    LENGTH   LAYOUT    NCOL/WID MODE
SD NAME         PLEX         DISK     DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
SV NAME         PLEX         VOLNAME  NVOLLAYR LENGTH   [COL/]OFF AM/NM    MODE
SC NAME         PLEX         CACHE    DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
DC NAME         PARENTVOL    LOGVOL
SP NAME         SNAPVOL      DCO
EX NAME         ASSOC        VC                       PERMS    MODE     STATE
SR NAME         KSTATE

dg myapp-dg    default      default  95000    1267475668.79.servername

dm myapp-dg01  c1t2d0s2     auto     65536    143283520 SPARE
dm myapp-dg02  c1t3d0s2     auto     65536    143283520 -
dm myapp-dg03  c2t0d0s2     auto     65536    71061376 -
dm myapp-dg04  c2t1d0s2     auto     65536    71061376 -
dm myapp-dg05  c2t2d0s2     auto     65536    71061376 -
dm myapp-dg06  c2t8d0s2     auto     65536    71061376 -
dm myapp-dg07  c2t9d0s2     auto     65536    71061376 FAILING
dm myapp-dg08  c2t10d0s2    auto     65536    71061376 FAILING

v  myapp-vxvol -            DISABLED ACTIVE   355305600 RAID     -        raid5
pl myapp-vxvol-01 myapp-vxvol DISABLED ACTIVE 355305600 RAID    6/32     RW
sd myapp-dg02-01 myapp-vxvol-01 myapp-dg02 0 71061120 0/0      c1t3d0   ENA
sd myapp-dg03-01 myapp-vxvol-01 myapp-dg03 0 71061120 1/0      c2t0d0   FAIL
sd myapp-dg04-01 myapp-vxvol-01 myapp-dg04 0 71061120 2/0      c2t1d0   FAIL
sd myapp-dg05-01 myapp-vxvol-01 myapp-dg05 0 71061120 3/0      c2t2d0   FAIL
sd myapp-dg06-01 myapp-vxvol-01 myapp-dg06 0 71061120 4/0      c2t8d0   FAIL
sd myapp-dg07-01 myapp-vxvol-01 myapp-dg07 0 71061120 5/0      c2t9d0   ENA
pl myapp-vxvol-03 myapp-vxvol DISABLED LOG   5760     CONCAT    -        RW
sd myapp-dg01-01 myapp-vxvol-03 myapp-dg01 0 5760    0         c1t2d0   ENA

# 10  
Old 04-02-2010
Ok, that makes sense for the failure but it looks like you internal disk controller could be on the way out.

You now need to clear the failing state on these disks since you know what caused them.

Code:
vxedit -g myapp set failing=off myapp-dg07
vxedit -g myapp set failing=off myapp-dg08

then start the volumes:
Code:
vxvol -g myapp-dg startall

# 11  
Old 04-02-2010
Quote:
Originally Posted by reborg
Ok, that makes sense for the failure but it looks like you internal disk controller could be on the way out.

You now need to clear the failing state on these disks since you know what caused them.

Code:
vxedit -g myapp set failing=off myapp-dg07
vxedit -g myapp set failing=off myapp-dg08

then start the volumes:
Code:
vxvol -g myapp-dg startall


Thanks for helping me out here . appreciate it.

Failing flag on disks is cleared.

Got below message after vxvol start command,

Code:
VxVM vxvol ERROR V-5-1-10127 changing volume myapp-vxvol:
        Raid volume would be unusable



vxprint -htr

Code:
vxprint -htr
Disk group: myapp-dg

DG NAME         NCONFIG      NLOG     MINORS   GROUP-ID
ST NAME         STATE        DM_CNT   SPARE_CNT         APPVOL_CNT
DM NAME         DEVICE       TYPE     PRIVLEN  PUBLEN   STATE
RV NAME         RLINK_CNT    KSTATE   STATE    PRIMARY  DATAVOLS  SRL
RL NAME         RVG          KSTATE   STATE    REM_HOST REM_DG    REM_RLNK
CO NAME         CACHEVOL     KSTATE   STATE
VT NAME         RVG          KSTATE   STATE    NVOLUME
V  NAME         RVG/VSET/CO  KSTATE   STATE    LENGTH   READPOL   PREFPLEX UTYPE
PL NAME         VOLUME       KSTATE   STATE    LENGTH   LAYOUT    NCOL/WID MODE
SD NAME         PLEX         DISK     DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
SV NAME         PLEX         VOLNAME  NVOLLAYR LENGTH   [COL/]OFF AM/NM    MODE
SC NAME         PLEX         CACHE    DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
DC NAME         PARENTVOL    LOGVOL
SP NAME         SNAPVOL      DCO
EX NAME         ASSOC        VC                       PERMS    MODE     STATE
SR NAME         KSTATE

dg myapp-dg    default      default  95000    1267475668.79.servername

dm myapp-dg01  c1t2d0s2     auto     65536    143283520 SPARE
dm myapp-dg02  c1t3d0s2     auto     65536    143283520 -
dm myapp-dg03  c2t0d0s2     auto     65536    71061376 -
dm myapp-dg04  c2t1d0s2     auto     65536    71061376 -
dm myapp-dg05  c2t2d0s2     auto     65536    71061376 -
dm myapp-dg06  c2t8d0s2     auto     65536    71061376 -
dm myapp-dg07  c2t9d0s2     auto     65536    71061376 -
dm myapp-dg08  c2t10d0s2    auto     65536    71061376 -

v  myapp-vxvol -            DETACHED CLEAN    355305600 RAID     -        raid5
pl myapp-vxvol-01 myapp-vxvol ENABLED(SPARSE) ACTIVE 355305600 RAID 6/32 RW
sd myapp-dg02-01 myapp-vxvol-01 myapp-dg02 0 71061120 0/0      c1t3d0   ENA
sd myapp-dg03-01 myapp-vxvol-01 myapp-dg03 0 71061120 1/0      c2t0d0   RCOV
sd myapp-dg04-01 myapp-vxvol-01 myapp-dg04 0 71061120 2/0      c2t1d0   RCOV
sd myapp-dg05-01 myapp-vxvol-01 myapp-dg05 0 71061120 3/0      c2t2d0   RCOV
sd myapp-dg06-01 myapp-vxvol-01 myapp-dg06 0 71061120 4/0      c2t8d0   RCOV
sd myapp-dg07-01 myapp-vxvol-01 myapp-dg07 0 71061120 5/0      c2t9d0   ENA
pl myapp-vxvol-03 myapp-vxvol ENABLED LOG    5760     CONCAT    -        RW
sd myapp-dg01-01 myapp-vxvol-03 myapp-dg01 0 5760    0         c1t2d0   ENA

And, you could be right about scsi controller. but I think both storedge as well as internal disks use same onboard controller?
# 12  
Old 04-02-2010
Looks like vxrecover kicked in correctly.

What does
Code:
vxtask list

show?
# 13  
Old 04-02-2010
vxtask list
Code:
TASKID  PTID TYPE/STATE    PCT   PROGRESS
-bash-3.00#

# 14  
Old 04-02-2010
ok, that shows nothing but all the subdisks are recoverable so that is good.

First try the safest option:

Code:
vxrecover -g myapp-dg myapp-vxvol

Login or Register to Ask a Question

Previous Thread | Next Thread

5 More Discussions You Might Find Interesting

1. What is on Your Mind?

The Human Brain project

A global group of scientists are spending the next ten years and a billion dollars to try and develop a computer simulation of the brain: https://www.humanbrainproject.eu/ I always found it fascinating that the brain can understand itself. This almost sounds like in a few years the computer... (0 Replies)
Discussion started by: figaro
0 Replies

2. Ubuntu

Ubuntu 9.04 Serial application to telnet to serial device

Hello! I am working on an application which reads environmental instruments which have serial ports. The application requires a serial port to be present to talk to the device (i.e. /dev/ttyS0 ). In some instances the environmental devices will be 100's of yards away from the computer, so a... (5 Replies)
Discussion started by: mvona
5 Replies

3. Programming

Brain Teaser Extended

Hi Gurus, To the Brain Teaser, if I add another condition, say the executable should not be altered, how the program should be altered? (no perl please, purely C). I forgot to mention this condition my staff had mentioned. ( forgot then and got now :D ) The program executed the first time... (4 Replies)
Discussion started by: vrk1219
4 Replies

4. Programming

C Brain Teaser

Dear Gurus, I have encountered a C question, which I thought of sharing with you. This question was asked by one of my technical training staff...Though my training was over I'm still thinking of a solution for this.. Write a C program to do a small task(lets say just simply printing a "Hello... (34 Replies)
Discussion started by: vrk1219
34 Replies

5. Shell Programming and Scripting

Brain Bench Certification

Hi, Can anybody provide me Pointers to Practice tests or any Material to prepare for Brainbench certification in Unix Shell Scripting? Also how good is this Certification for UNIX programmers. Is it worth it? I'm planning to take this certification in 2 weeks. Kindly let me know all the pros... (0 Replies)
Discussion started by: pavan_emani
0 Replies
Login or Register to Ask a Question