Hello All,
Here is a snipet from our cluster.log, I was wondering if anyone could shed some light on what may have caused the failover.
The first two lines indicate a possible memory issue which I am currently looking into.
Quote:
Nov 7 16:30:21 server_01 grpsvcs[16000]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6xYcC4/BO8I3/c2C/4Im5t....................:::Reference ID: :::Template ID: 463a893d:::Details File: :::Location: RSCT,pgsd.C,1.51,195 :::GS_ERROR_ER Internal logic error in Group Services daemon DIAGNOSTIC EXPLANATION Memory allocation failed. Please check the memory availability.
Nov 7 16:30:21 server_01 grpsvcs[16000]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6xYcC4/BO8I3/Ysc/4Im5t....................:::Reference ID: :::Template ID: 463a893d:::Details File: :::Location: RSCT,pgsd.C,1.51,195 :::GS_ERROR_ER Internal logic error in Group Services daemon DIAGNOSTIC EXPLANATION Memory allocation failed. Please check the memory availability.
Nov 7 16:32:10 server_01 clstrmgrES[17318]: Tue Nov 7 16:32:10 SendInfoBcast: ha_gs_send_message() failed rc=1
Nov 7 16:32:10 server_01 clstrmgrES[17318]: Tue Nov 7 16:32:10 clstrmgr on node 1 is exiting with code 4
Nov 7 16:32:10 server_01 haemd[16528]: LPP=PSSP,Fn=emd_gsi.c,SID=1.4.1.33,L#=1361, haemd: 2521-032 Cannot dispatch group services (1).
Nov 7 16:32:11 server_01 clsmuxpdES[17574]: clRGInfoGetRGHandle() failed, error: : The system call does not exist on this system.
Nov 7 16:32:11 server_01 clsmuxpdES[17574]: Error from ha_em_receive_response(): EMAPI error number 10 EMAPI error message 2521-649 An attempt to receive a command response was unsuccessful; read() detected end-of-file; connection with Event Manager lost. : The system call does not exist on this system.
Nov 7 16:32:11 server_01 clsmuxpdES[17574]: Event Manager API Disconnected:: The system call does not exist on this system.
Nov 7 16:32:11 server_01 snmpd[14998]: NOTICE: SMUX packet from (127.0.0.1+32771+1)
Nov 7 16:32:11 server_01 snmpd[14998]: NOTICE: SMUX trap: (6 10) (127.0.0.1+32771+1)
Nov 7 16:32:11 server_01 snmpd[14998]: NOTICE: SMUX packet from (127.0.0.1+32771+1)
Nov 7 16:32:11 server_01 snmpd[14998]: NOTICE: SMUX trap: (6 11) (127.0.0.1+32771+1)
Nov 7 16:32:12 server_01 snmpd[14998]: NOTICE: SMUX packet from (127.0.0.1+32771+1)
Nov 7 16:32:12 server_01 snmpd[14998]: NOTICE: SMUX trap: (6 15) (127.0.0.1+32771+1)
Nov 7 16:32:12 server_01 HACMP for AIX: clexit.rc : Unexpected termination of clstrmgrES.
Nov 7 16:32:12 server_01 HACMP for AIX: clexit.rc : Halting system immediately!!!
Nov 7 17:29:19 server_01 RMCdaemon[11610]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6eKora0TF9I3/6V2/4Im5t....................:::Reference ID: :::Template ID: a6df45aa:::Details File: :::Location: RSCT,rmcd.c,1.34,196 :::RMCD_INFO_0_ST The daemon is started.
Nov 7 17:29:19 server_01 ctcasd[11870]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6YzeY.1TF9I3/UeV/4Im5t....................:::Reference ID: :::Template ID: c092afe4:::Details File: :::Location: rsct.core.sec,ctcas_main.c,1.13,295 :::ctcasd Daemon Started
Thanks.