When i try to move of put offline RG i got this error for every RG:
Then this RG RG_XXX goes to ERROR state and cluster is UNSTABLE.
I have tried unmount everything , varyoff vg's and run smit hacmp -> Problem Determination Tools -> Recover From HACMP Script Failure , but it didn't help.
I had to reboot both nodes to get cluster back to STABLE state.
Then I started up all RG's successfully.
Then I manually run stop script for RG XXX and all applications has been stopped successfully, then I unmounted fs , varyoffvg and tried turn off RG_XXX via hacmp but I got that error above.
Could you tell me what could be a problem that non of RG's can be turned off of moved to another node ?
Could you tell me some hints where to look or what to check ? pls
Did you check if there is more detailed information according to the actions you did issue inside /tmp/hacmp.out (if this is still the actual path to it)?
I think that "node_down_local 1" is just a status in between when the RG is being brought down ie. not up yet. It looks to me as if it is not the problem since that event was just completed with a parameter or status of 0, right in the next lines.
When the cluster is currently in an undefined state, I would check which VGs are maybe still active (you should know which resources network and disk wise etc. are part of that RG xxx) and which adapters are still up, to get a clue which of those might have a problem - also check if your issued scripts for that RG might have written a log somewhere (if they do at all).
There is also 2 warnings pointing to FFDC event log files under /tmp/ibmsupt/hacmp which could be also investigated with with one of the fc* commands you can find in /usr/sbin/rsct/bin like fcreport etc. You might have to check some documentation for that; google for "aix ffdc" and you'll get some IBM documentation sites.
Is there anything related in the errpt that might be helpful?
Did this cluster work after it has been tested? Have there been made changes to the hardware or software and you had not cluster tests after that?
What version of HA/CMP do you use?
Sadly the hacmp.out shows not the detailed information I hoped to see. You can check and set the verbosity of it via
smitty hacmp -> Problem Determination -> HACMP Log Viewing and Management -> Change/Show HACMP Log File Parameters -> Select your node ...
This is the path from an HA/CMP 5.3 cluster node. Later versions should have a smiliar path from what I saw.
Hi
Mload script has been running properly. Suddenly started giving error since yesterday.
UTY2408 Error occurred while trying to record control output intonation - terminating.
The table is loading using file but .ksh failing with above error. but same code executed on UAT without error. Now... (0 Replies)
Hi ,
I was invoking a sh file using the nohup command. But while invoking, I received a below error.
Error occurred during initialization of VM
Unable to load native library: /u01/libjava.so: cannot open shared object file: No such file or directory
.
Could you please help out.
Regards,... (2 Replies)
Hi,
When i am trying to read data from tape cassette its giving below error:
tar tvf /dev/rmt0
"tar: 0511-193 An error occurred while reading from the media.
A system call received a parameter that is not valid."
OS: - AIX 6.1
Tape Library : - IBM TS3100
Tape Cassette : - Ultrium LTO... (1 Reply)
Hi all,
Error occurred while making the net-snmp-5.4.4 on Solaris 5.10 version.
Environment
- Solaris 5.10-x86
- Net-SNMP-5.4.4.tar.gz
- Path (/etc/profile)
PATH=/usr/local/bin:$PATH
export PATH
LD_LIBRARY_PATHUSR=/usr/ccs/bin:
export LD_LIBRARY_PATH
Error01 - summary
***... (3 Replies)
I have RHEL5.3 that is with the Platform Cluster Manger PCM installation. on master node. Unfortunately some files were deleted from the /var directory and then the postgresql service couldn't start. I have deleted, rm -rf /var/lib/pgsql/data and started the service again now the service is running... (1 Reply)
Hi
I am trying to sort a file of 88075743B size. I am doing some processing on the file and after the processing is done; I get 2 files temp1 and temp2. I need to combine both these files as one and this final file should be sorted on fields 1 and 2. Space is the delimiter between fields. Record... (2 Replies)
All,
I'm getting the following error while I try to register the server to connect the redhat network for the updates.
rhn_register updateLoginInfo() login info
rhn_register A socket error occurred: (111, 'Connection refused'), attempt #1
rhn_register A socket error occurred: (111,... (6 Replies)