clstat X connection ... broken ?


 
Thread Tools Search this Thread
Special Forums UNIX and Linux Applications Infrastructure Monitoring clstat X connection ... broken ?
# 1  
Old 11-12-2010
clstat X connection ... broken ?

Hi.
Could you help me with this problem ?

When I run clstat then I got this error message:

Code:
1:root@test1:/root # clstat
X connection to localhost:10.0 broken (explicit kill or server shutdown).

I got the same error for both nodes.
If I would run xclstat I would understand that there is something wrong with X server but why clstat ?

Thank you.
# 2  
Old 11-12-2010
Quote:
Originally Posted by phobus
[...]If I would run xclstat I would understand that there is something wrong with X server but why clstat ?[...]
If clstat detects a running X-Server it starts with the X11 GUI automatically.
To make clstat start with the ASCII GUI regardless use clstat -a.
# 3  
Old 11-12-2010
thanks a lot Smilie
# 4  
Old 11-15-2010
Hi again. I was busy.

I would like to ask you what would you recommend me to check if :

Code:
9:root@xxxxxxx:/root # clstat -a
Failed retrieving cluster information.

There are a number of possible causes:
clinfoES or snmpd subsystems are not active.
snmp is unresponsive.
snmp is not configured correctly.
Cluster services are not active on any nodes.

Refer to the HACMP Administration Guide for more information.
Additional information for verifying the SNMP configuration on AIX 6
can be found in /usr/es/sbin/cluster/README5.4.1.UPDATE
9:root@xxxxxxx:/root # cldump

cldump: Waiting for the Cluster SMUX peer (clstrmgrES)
to stabilize.............
Failed retrieving cluster information.

There are a number of possible causes:
clinfoES or snmpd subsystems are not active.
snmp is unresponsive.
snmp is not configured correctly.
Cluster services are not active on any nodes.

Refer to the HACMP Administration Guide for more information.
Additional information for verifying the SNMP configuration on AIX 6
can be found in /usr/es/sbin/cluster/README5.4.1.UPDATE

The OS is :

Code:
5300-11-05-1036

The hacmp version is :

Code:
9:root@xxxxxxx:/root # lslpp -l | grep cluster
  cluster.adt.es.client.include
  cluster.adt.es.client.samples.clinfo
  cluster.adt.es.client.samples.clstat
  cluster.adt.es.client.samples.libcl
  cluster.adt.es.java.demo.monitor
  cluster.es.cfs.rte         5.4.1.5  COMMITTED  ES Cluster File System Support
  cluster.es.client.lib      5.4.1.6  COMMITTED  ES Client Libraries
  cluster.es.client.rte      5.4.1.7  COMMITTED  ES Client Runtime
  cluster.es.client.utils    5.4.1.7  COMMITTED  ES Client Utilities
  cluster.es.client.wsm      5.4.1.5  COMMITTED  Web based Smit
  cluster.es.cspoc.cmds      5.4.1.7  COMMITTED  ES CSPOC Commands
  cluster.es.cspoc.dsh       5.4.1.0  COMMITTED  ES CSPOC dsh
  cluster.es.cspoc.rte       5.4.1.4  COMMITTED  ES CSPOC Runtime Commands
  cluster.es.ercmf.cmds      5.4.1.1  COMMITTED  ES HACMP - ERCMF Commands
  cluster.es.ercmf.rte       5.4.1.0  COMMITTED  ES HACMP - ERCMF PPRC Runtime
  cluster.es.plugins.dhcp    5.4.1.0  COMMITTED  ES Plugins - dhcp
  cluster.es.plugins.dns     5.4.1.0  COMMITTED  ES Plugins - Name Server
  cluster.es.plugins.printserver
  cluster.es.server.cfgast   5.4.1.0  COMMITTED  ES Two-Node Configuration
  cluster.es.server.diag     5.4.1.7  COMMITTED  ES Server Diags
  cluster.es.server.events   5.4.1.7  COMMITTED  ES Server Events
  cluster.es.server.rte      5.4.1.7  COMMITTED  ES Base Server Runtime
  cluster.es.server.simulator
  cluster.es.server.testtool
  cluster.es.server.utils    5.4.1.7  COMMITTED  ES Server Utilities
  cluster.license            5.4.1.1  COMMITTED  HACMP Electronic License
  cluster.msg.En_US.cspoc    5.4.1.0  COMMITTED  HACMP CSPOC Messages - U.S.
  cluster.msg.En_US.es.client
  cluster.msg.En_US.es.server
  cluster.msg.en_US.cspoc    5.4.1.0  COMMITTED  HACMP CSPOC Messages - U.S.
  cluster.msg.en_US.ercmf    5.4.0.0  COMMITTED  HACMP ERCMF Messages - U.S.
  cluster.msg.en_US.es.client
  cluster.msg.en_US.es.server
  cluster.sgn                5.4.0.0  COMMITTED  Signature File for Hacmp
  cluster.sng                5.4.0.0  COMMITTED  Signature File for Hacmp
  cluster.xd.license         5.4.1.0  COMMITTED  HACMP XD Feature License
  cluster.es.client.lib      5.4.1.6  COMMITTED  ES Client Libraries
  cluster.es.client.rte      5.4.1.7  COMMITTED  ES Client Runtime
  cluster.es.cspoc.rte       5.4.0.0  COMMITTED  ES CSPOC Runtime Commands
  cluster.es.ercmf.rte       5.4.1.0  COMMITTED  ES HACMP - ERCMF PPRC Runtime
  cluster.es.server.diag     5.4.1.0  COMMITTED  ES Server Diags
  cluster.es.server.events   5.4.1.0  COMMITTED  ES Server Events
  cluster.es.server.rte      5.4.1.7  COMMITTED  ES Base Server Runtime
  cluster.es.server.simulator
  cluster.es.server.utils    5.4.1.7  COMMITTED  ES Server Utilities
  cluster.man.en_US.es.data  5.4.1.3  COMMITTED  ES Man Pages - U.S. English

Snmpd is vesion 3.

I have tried this:

stop/start/restart clcomd,clinfoES,snmpd.

Check /etc/services for ports :

Code:
  [1] clinfo_deadman 6176/tcp
  [1] clm_lkm 6150/tcp
  [1] clm_smux 6175/tcp
  [1] godm 6177/tcp
  [1] topsvcs 6178/udp
  [1] grpsvcs 6179/udp
  [1] emsvcs 6180/udp
  [1] clcomd 6191/tcp
  [1] clinfo_client 6174/tcp
  [1] rpv 6192/tcp (if its PowerHA/XD for GLVM)

everything looks ok there.
Then I have checked /usr/es/sbin/cluster/etc/clhosts for 127.0.0.1 # HACMP/ES for AIX
and it is there.
This all I have checked on both nodes.

Do you have any idea what I could check/fix next ? Please.

Thank you.

---------- Post updated at 04:27 PM ---------- Previous update was at 04:20 PM ----------

I have noticed that in /usr/tmp/snmpdv3.log I am getting this error while starting:

Code:
 11/15/10 03:38:16 PM -  Dynamic configuration initialized.
 11/15/10 03:38:16 PM -  Using file /etc/snmpdv3.conf for configuration.
 11/15/10 03:38:16 PM -  Using existing file /etc/snmpd.boots for boots initialization.
 11/15/10 03:38:16 PM -  Tracing is set to 0.
 11/15/10 03:38:16 PM -  snmpd: 1473-148 Line number 175 contains an unsupported keyword : snmpd.
 11/15/10 03:38:16 PM -  SNMP agent: initialization complete.

I have checked that line and is this line from snmpdv3.conf:

Code:
snmpd smuxtimeout=200 #muxatmd

Do you think this could that be problem ? I can not do any testing until I am sure that it will be resolved.

Thank you.
# 5  
Old 11-16-2010
Quote:
Originally Posted by phobus
[...]
Code:
 11/15/10 03:38:16 PM -  snmpd: 1473-148 Line number 175 contains an unsupported keyword : snmpd.
 11/15/10 03:38:16 PM -  SNMP agent: initialization complete.

I have checked that line and is this line from snmpdv3.conf:
Code:
snmpd smuxtimeout=200 #muxatmd

[...]
You cannot do any harm by disabling this particular entry. So just insert a # at the beginning of this line and try again. Does the cluster verification work?
This User Gave Thanks to shockneck For This Post:
# 6  
Old 11-16-2010
I have tried to comment it out , but it didn't help.
Then I did verification on node1 and it started to work on node1.
Then I did verification on node2 and it started to work on node2 as well.

During verification I didn't see any errors.

Thanks for help shockneck .

Problem solved.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. AIX

Clstat not working in a HACMP 7.1.3 cluster

I have troubles making clstat work. All the "usual suspects" have been covered but still no luck. The topology is a two-node active/passive with only one network-interface (it is a test-setup). The application running is SAP with DB/2 as database. We do not use SmartAssists or other gadgets. ... (8 Replies)
Discussion started by: bakunin
8 Replies

2. SuSE

Connection is broken because of underlying network issues

Hi Guys, I have two databases where I copy archivelog files from server A to server B frequently...yesterday we did the VM upgrade now...I am struggling to copy/ship the files from server A to server B... I manually tested the file transfer but the big files, e.g 46M, stall. I can copy small... (2 Replies)
Discussion started by: Phuti
2 Replies

3. UNIX for Advanced & Expert Users

How keep running a program n an another computer via a connection ssh when the connection is closed?

Hi everybody, I am running a program on a supercomputer via my personal computer through a ssh connection. My program take more than a day to run, so when I left work with my PC I stop the connection with the supercomputer and the program stop. I am wondering if someone know how I can manage... (2 Replies)
Discussion started by: TomTomGre
2 Replies

4. AIX

X connection to localhost:10.0 broken (explicit kill or server shutdown)

I want to run applet on AIX 6 machine. I already have setup $DISPLAY variable for putty session by selecting X11 option. I got below error for any X related commands (xclock, X, applet viewer ) X connection to localhost:10.0 broken (explicit kill or server shutdown). Please can anyone... (0 Replies)
Discussion started by: kailas.girase
0 Replies

5. AIX

clstat problem

It is a long time since I have used HACMP so I suppose I could be classed as a newbie! The cluster pair is running and I am trying to get the passive node to give me sensible information from the commmand above. However all I can get back is: Can not get cluster information. Check if clinfo... (4 Replies)
Discussion started by: johnf
4 Replies

6. Solaris

Solaris 10 ftp connection problem (connection refused, connection timed out)

Hi everyone, I am hoping anyone of you could help me in this weird problem we have in 1 of our Solaris 10 servers. Lately, we have been having some ftp problems in this server. Though it can ping any server within the network, it seems that it can only ftp to a select few. For most servers, the... (4 Replies)
Discussion started by: labdakos
4 Replies

7. UNIX for Advanced & Expert Users

Connection reset by peer..closing connection

Hello I'm facing the above problem while doing a performance run. I've a script which I'm launching from my windows desktop using mozilla. The script will invoke backend action on a Solaris host which in turn feeds the records to a driver located on a linux box(Cent OS). What's happening is... (1 Reply)
Discussion started by: subramanyab
1 Replies

8. UNIX for Advanced & Expert Users

connection to localhost:10.0 host broken (explicit kill or server shutdown)

Hi All, We use tomcat web server and it will get terminated with below error: connection to localhost:10.0 host broken (explicit kill or server shutdown) Please let me know how to fix this error. (5 Replies)
Discussion started by: bache_gowda
5 Replies

9. UNIX for Advanced & Expert Users

Recover X Forwarding after broken connection

Hello, today I logged in on a remote machine via "ssh -X ..." and started a screen session. In this i started OpenOffice.org writer by writing "oowriter". After some time the connection got lost (The network here is quite unstable). Now I logged in again via "ssh -X ...". The screen session is... (1 Reply)
Discussion started by: schola
1 Replies

10. UNIX for Advanced & Expert Users

Broken

Ok i am running Linux, or rather was. I can not longer do anything. This was a dns server amoungst other things. It will no longer boot. I have used a startup disk, but how can i recover the OS? I need help and urgently. Please someone thanks (3 Replies)
Discussion started by: ollyparkhouse
3 Replies
Login or Register to Ask a Question