PowerHa 7.1 Migration check failed


 
Thread Tools Search this Thread
Operating Systems AIX PowerHa 7.1 Migration check failed
# 1  
Old 04-02-2015
PowerHa 7.1 Migration check failed

Hello Guys,
I was trying to upgrade the PowerHA from 6.1.0.9 to 7.1.3 in prod cluster. Have snapshot migration and run climgcheck.

1. Run climgcheck on node1(kul1pqcmur01) and it success
2. Upgrade the cluster sw
3. Run climgcheck on node2(kul1pqcmur02) and it failed with below errors

HTML Code:
Clmigchek ran successfully on node1 and upgrade the cluster filesets
	Climigcheck throws the below error on the node2 and unable to proceed the upgrade
			PowerHA System Mirror migration check program.
			Run date: Tue Mar 31 16:05:25 MYT 2015 on machine kul1pqcmur02
			ck_user: Called
			ck_version: Called
			ck_version: Version check is OK
			write_rhosts: Called
			write_rhosts: ERROR - Unable to get hostname from node kul1pqcmur01 using cl_rsh
			Error termination on: Tue Mar 31 16:05:25 MYT 2015 
Action taken:
1. Ensure the rhosts file has the proper entry on both nodes
2. Ensure the rhosts permission has set to 600

Due to time constraint, have rollback powerHA to 6.1.0.9 but i wonder how the rsh work in 6.1.0.9 with the same rhosts entry.

Did anyone of you faced this kind of issue?

Appreciate your help and reply.

CheersSmilie
# 2  
Old 04-03-2015
Try running the script on the second node with verbose logging at maximum to see if it reveals more information about the failure. Also, do the regular checks, ping the host by name and by ip
Code:
export VERBOSE_LOGGING=high
clmigcheck

# 3  
Old 04-15-2015
Couldn't find what is the problem

HTML Code:
# export VERBOSE_LOGGING=high
# clmigcheck
clmigcheck[4011]: /usr/es/sbin/cluster/utilities/cl_get_path -S
clmigcheck[4011]: 2> /dev/null
clmigcheck[4011]: OP_SEP='~'
clmigcheck[4012]: OP_SEP='~'
clmigcheck[4013]: MPING_PARAMS4='-v -c 5 -a 228.168.101.43'
clmigcheck[4014]: MPING_PARAMS6='-v -c 5 -6'
clmigcheck[4015]: MPING_PARAMS='-v -c 5 -a 228.168.101.43'
clmigcheck[4021]: trap cleanup 0 11
clmigcheck[4022]: trap user_exit 1 2 15
clmigcheck[4028]: ERR=/tmp/5308538.err
clmigcheck[4034]: [ -f /tmp/clmigcheck ]
clmigcheck[4039]: [[ -d /tmp/clmigcheck ]]
clmigcheck[4041]: [ -f /tmp/clmigcheck/clmigcheck.log ]
clmigcheck[4043]: print 'Saving existing /tmp/clmigcheck/clmigcheck.log to /tmp/clmigcheck/clmigcheck.log.bak'
Saving existing /tmp/clmigcheck/clmigcheck.log to /tmp/clmigcheck/clmigcheck.log.bak
clmigcheck[4044]: mv -f /tmp/clmigcheck/clmigcheck.log /tmp/clmigcheck/clmigcheck.log.bak
clmigcheck[4044]: 2> /dev/null
clmigcheck[4062]: date
clmigcheck[4062]: hostname
clmigcheck[4062]: echo 'PowerHA System Mirror migration check program.\nRun date: Wed Apr 15 15:53:29 MYT 2015 on machine kul1pqcmur02\n'
clmigcheck[4062]: 1> /tmp/clmigcheck/clmigcheck.log
clmigcheck[4071]: [ '' = -h ]
clmigcheck[4071]: [ '' = '-?' ]
clmigcheck[4080]: [ 0 != 0 ]
clmigcheck[4101]: echo
clmigcheck[4101]: sed 's/\-//g'
clmigcheck[4101]: cmdline_flags=''
clmigcheck[4107]: ck_user
ck_user[971]: log 'ck_user: Called\n'
log[896]: echo ck_user: 'Called\n'
log[896]: 2> /dev/null log[897]: return 0
ck_user[973]: whoami
ck_user[973]: [ root != root ]
clmigcheck[4108]: ck_version
ck_version[1010]: log 'ck_version: Called\n'
log[896]: echo ck_version: 'Called\n'
log[896]: 2> /dev/null log[897]: return 0
ck_version[1011]: rm -f /tmp/5308538.err
ck_version[1011]: 1> /dev/null 2>& 1
ck_version[1014]: lslpp -lcqOr cluster.es.server.rte
ck_version[1014]: 2> /dev/null
ck_version[1014]: cut -f3 -d:
ck_version[1014]: read version release mod fix
ck_version[1014]: IFS=.
ck_version[1021]: (( 6 <= 6 ))
ck_version[1023]: log 'ck_version: Version check is OK\n'
log[896]: echo ck_version: Version check is 'OK\n'
log[896]: 2> /dev/null log[897]: return 0
ck_version[1031]: return 0
clmigcheck[4114]: write_rhosts
write_rhosts[1172]: log 'write_rhosts: Called\n'
log[896]: echo write_rhosts: 'Called\n'
log[896]: 2> /dev/null log[897]: return 0
write_rhosts[1177]: 1> /tmp/rhosts
write_rhosts[1182]: odmget -q 'object = VERBOSE_LOGGING' HACMPnode
write_rhosts[1182]: ODMDIR=/etc/objrepos
write_rhosts[1183]: grep name
write_rhosts[1184]: cut -d '"' -f2
write_rhosts[1182]: nlist=$'kul1pqcmur01\nkul1pqcmur02'
write_rhosts[1188]: /usr/es/sbin/cluster/utilities/cl_rsh -n kul1pqcmur01 hostname
rshexec: cannot connect to node kul1pqcmur01
write_rhosts[1188]: hname=''
write_rhosts[1190]: (( 1 == 0 ))
write_rhosts[1196]: log 'write_rhosts: ERROR - Unable to get hostname from node kul1pqcmur01 using cl_rsh\n'
log[896]: echo write_rhosts: ERROR - Unable to get hostname from node kul1pqcmur01 using 'cl_rsh\n'
log[896]: 2> /dev/null log[897]: return 0
write_rhosts[1197]: error $'Unable to get hostname from node kul1pqcmur01\nusing cl_rsh\\n'
error[529]: syntax=$'Unable to get hostname from node kul1pqcmur01\nusing cl_rsh\\n'
error[529]: typeset syntax
error[531]: [[ $'Unable to get hostname from node kul1pqcmur01\nusing cl_rsh\\n' == no ]]
error[531]: [[ $'Unable to get hostname from node kul1pqcmur01\nusing cl_rsh\\n' == yes ]]
error[536]: print $'\\nERROR:  Unable to get hostname from node kul1pqcmur01\nusing cl_rsh\\n'
error[536]: 1>& 2

ERROR:  Unable to get hostname from node kul1pqcmur01
using cl_rsh

error[539]: [[ -s /tmp/5308538.err ]]
error[548]: print
error[548]: 1>& 2

error[550]: [[ $'Unable to get hostname from node kul1pqcmur01\nusing cl_rsh\\n' == yes ]]
error[553]: date
error[553]: log 'Error termination on: Wed Apr 15 15:53:29 MYT 2015 \n'
log[896]: echo Error termination on: Wed Apr 15 15:53:29 MYT 2015 '\n'
log[896]: 2> /dev/null log[897]: return 0
error[554]: exit 1
clmigcheck[1]: cleanup
# 4  
Old 04-15-2015
Moved to AIX board.
# 5  
Old 04-15-2015
This is the problem:
Quote:
write_rhosts[1188]: /usr/es/sbin/cluster/utilities/cl_rsh -n kul1pqcmur01 hostname
rshexec: cannot connect to node kul1pqcmur01
Check /etc/cluster/rhosts on both sides and restart clcomd
# 6  
Old 04-16-2015
Both the node having the below entry
cat /etc/cluster/rhosts
kul1pqcmur01
kul1pqcmur02


Restarted the clcomd in kul1pqcmur01 node & restarted clcomd & clcomdES in kul1pqcmur02, tried to run the date command and it fails with the same error
# 7  
Old 04-20-2015
there is no magic in clcomd. If cl_rsh fails, your /etc/cluster/rhosts configuration is wrong. It may be a problem with your /etc/hosts and/or DNS resolution. Try to enter all possible host names and IP addresses of both your hosts into rhosts.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Command failed for target 'check-recursive' error

Sorry if this is the wrong place for this. I haven't done UNIX Admin in a long time and am trying to update the utilities on a Solaris server so that I can get Apache 2.4.25 installed. I am finding a lot of utilities that are just too far out of date and some versions are no longer supported. So... (0 Replies)
Discussion started by: PJ_Namias
0 Replies

2. AIX

AIX - FC Switch migration, SAN Migration question!

I'm New to AIX / VIOS We're doing a FC switch cutover on an ibm device, connected via SAN. How do I tell if one path to my remote disk is lost? (aix lvm) How do I tell when my link is down on my HBA port? Appreciate your help, very much! (4 Replies)
Discussion started by: BG_JrAdmin
4 Replies

3. AIX

Powerha on AIX 6.1.9

hello ive installed powerha 7.1.3 on two servers aix 6.1.9 6100-09-03-1415 work with dmx4 EMC storgae. after sync cluster ( terminate with OK ) ive see that the repository disk upper only in one machine : hdiskpower60 00c7f6b59fc60d9d caavg_private active... (1 Reply)
Discussion started by: ariec
1 Replies

4. Solaris

How to check FTP service failed in between on Solaris 10?

Hi Guys, I need to check why FTP service on Solaris 10 machine did not work in between. What does Jun-29 means here? Does it mean that it has been RUNNING/UP since Jun 29th ? -bash-3.2$ sudo /usr/bin/svcs |grep -i ftp legacy_run Jun_29 lrc:/etc/rc3_d/S90proftpd online ... (3 Replies)
Discussion started by: manalisharmabe
3 Replies

5. Linux

Ping check failed from Nagios master server on windows hosts in the same subnet

Hello All, We have added a windows host and its config files to Nagios master server and wanted to do a ping check alone at the moment however, the nagios master server identifies the host in its GUI and immediately disappears can anyone let me know the right approach to this one, We want to... (2 Replies)
Discussion started by: lovesaikrishna
2 Replies

6. High Performance Computing

Non global zone check failed error

Hi All, When i was installing patches in the Solaris 10 machine. It does not install properly and gave the below message =========================================== Approved patches will be installed in this order: 126868-02 Preparing checklist for non-global zone check... Checking... (1 Reply)
Discussion started by: jegaraman
1 Replies

7. Solaris

Solaris Zone : Non global Zone check failed

Hi All , I try to install some packages in my global zone... On the execution of the installion of the script it quits by saying the error "Non global zone check failed" Kindly help me in this regard Thanks in advance, jeganr (7 Replies)
Discussion started by: jegaraman
7 Replies

8. UNIX for Advanced & Expert Users

*** [Gentoo] sanity check failed! ***

I faced the following error while configuring the spine for cacti. Can any one help me to sort out this problem: hecking how to run the C++ preprocessor... g++ -E checking for g77... g77 checking whether we are using the GNU Fortran 77 compiler... yes checking whether g77 accepts -g... yes... (1 Reply)
Discussion started by: praveen_b744
1 Replies

9. AIX

ftp check for failed attempts

Hi, I have created the below ftp script to put files over to our capacity server, the check at the end works if ftp fails to run however if the script cannot login or the transfer itself failed there is no warnings. Does anyone know the syntax to trap the erorr codes or to put a check within... (3 Replies)
Discussion started by: chlawren
3 Replies

10. Shell Programming and Scripting

Failed to check status code in "rsh" command

Hi folks, I wrote a ksh program which run scripts from remote server. To check the status code I wrote the following function: check_remote_status() { status_code=`tail -1 $installLog` if ] ; then echo $errMsg | tee -a $installLog exit 1 else echo $validMsg >> $installLog fi... (9 Replies)
Discussion started by: nir_s
9 Replies
Login or Register to Ask a Question