Thoughts on HACMP: Automatic start of cluster services


 
Thread Tools Search this Thread
Operating Systems AIX Thoughts on HACMP: Automatic start of cluster services
# 1  
Old 07-26-2013
Thoughts on HACMP: Automatic start of cluster services

Hi all,

I remember way back in some old environment, having the HA cluster services not being started automatically at startup, ie. no entry in /etc/inittab.
I remember reason was (taken a 2 node active/passive cluster), to avoid having a backup node being booted, so that it will not automatically be able to receive RGs from the active node, in case the backup node is being booted due to some error, failure, maintenance etc. and just then a failover from the active node could happen... Maybe data loss could occure, whatever as the node could be in an undefinable state (taking a paranoid view of things).
A boot of a system, that is not being issued by an administrator but by any other reason, needs to be investigated. Before that, I would not consider the node ready to be put back into the cluster. I think it is absolutely reasonable.

Though I often see environments and several good official and good unofficial documentations on the net, that the entry in the /etc/inittab to start hacmp automatically at boot is done automatically (I didn't remember that) and there is no note on precaution to disable it of the reasons described above.

How do you handle this? Do you leave the entry in there or remark it?
What's your reason for the one or other?
Is my approach too paranoid or unrealistic?

Please share your thoughts, thanks Smilie

Last edited by zaxxon; 07-26-2013 at 06:35 AM.. Reason: typo
# 2  
Old 07-26-2013
I think a system - any system, not only HACMP-nodes - having gone through a power-cycle should not be started automatically, because there was surely a reason for having come down in first place. A simple power-cycle will most certainly not correcct that problem and the machine should stay down until an admin can verify the system to be OK and initiate the application startup.

If a system is important enough for a downtime (usually until the next morning) not to be feasible then a HACMP-system should replace the single system. If it is important enough that not even the downtime of a standby-node can be tolerated then you need an admin available 24/7. It can't be stressed often enough: unstoppable service costs money. To place a system somehwere and then blame it on the admin that the hardware/software turns out not to be running non-stop without any maintenance is idiotic (and nevertheless oftenly seen).

Admins, btw., are not without fault at all. Not, because they cannot make the impossible possible, but because they didn't object from the first minute a plan for such a system has been hatched.

Back to the original question: i think it is better to start the cluster manager manually and i always configure my systems to work that way.

bakunin

Last edited by bakunin; 07-27-2013 at 07:36 AM.. Reason: grammar, typos
This User Gave Thanks to bakunin For This Post:
# 3  
Old 07-28-2013
Hi Wolf, thanks a lot for sharing your point of view - good to have this confirmation Smilie
# 4  
Old 08-08-2013
Hi Zaxxon,

somewhere in the HACMP Ressource Group Configuration you can define how the RG behaves when the original Node rejoins the cluster. You can configure it to not fail back if the original node rejoins the cluster.
That way you could automatically start HA Services and be able to takeover in case of a failure without risking an unwanted takeover when the node rejoins.
# 5  
Old 08-13-2013
Since HACMP 5.X the cluster manager gets started automatically - that is why it is in the inittab.

There are three kinds of resource groups and each have their own startup (node just coming up) and recovery policy.

So, to be to the point: HACMP v4 and earlier did not have HACMP in inittab as best practice - because the cluster manager was not always active. Starting with HACMP v5 the cluster manager is always active, so the daemons (not the resource groups) are always (meant to be) activated when the node/server starts.

Hope this helps.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. AIX

Clstat not working in a HACMP 7.1.3 cluster

I have troubles making clstat work. All the "usual suspects" have been covered but still no luck. The topology is a two-node active/passive with only one network-interface (it is a test-setup). The application running is SAP with DB/2 as database. We do not use SmartAssists or other gadgets. ... (8 Replies)
Discussion started by: bakunin
8 Replies

2. Shell Programming and Scripting

Script to Start services based on dependent services on other AIX machine

Hi, I just started working on a script. After my research, i found a command which can help me: AIM: To build a script which starts the services (Services 1) on server 1 automatically whenever its down. And it has a dependency on other service (Service 2) on Server 2. So my script has to... (4 Replies)
Discussion started by: draghun9
4 Replies

3. AIX

Re-cluster 2 HACMP 5.2 nodes

Hi, A customer I'm supporting once upon a time broke their 2 cluster node database servers so they could use the 2nd standby node for something else. Now sometime later they want to bring the 2nd node back into the cluster for resilance. Problem is there are now 3 VG's that have been set-up... (1 Reply)
Discussion started by: elcounto
1 Replies

4. AIX

[Howto] Update AIX in HACMP cluster-nodes

As i have updated a lot of HACMP-nodes lately the question arises how to do it with minimal downtime. Of course it is easily possible to have a downtime and do the version update during this. In the best of worlds you always get the downtime you need - unfortunately we have yet to find this best of... (4 Replies)
Discussion started by: bakunin
4 Replies

5. AIX

HACMP does not start db2 after failover (db2nodes not getting modified by hacmp)

hi, when I do a failover, hacmp always starts db2 but recently it fails to start db2..noticed the issue is db2nodes.cfg is not modified by hacmp and is still showing primary node..manually changed the node name to secondary after which db2 started immediately..unable to figure out why hacmp is... (4 Replies)
Discussion started by: gkr747
4 Replies

6. AIX

Make system backup for 2 nodes HACMP cluster

Hi all, I was wondering if someone direct me in how to Make system backup for 2 nodes HACMP cluster ( system image ) . What are the consideration for this task (3 Replies)
Discussion started by: h@foorsa.biz
3 Replies

7. AIX

MQ upgrade(ver.6to7) in a HACMP cluster

Hi What is the procedure to upgrade the MQ from 6 to 7 in aix hacmp cluster. Do i need to bring down the cluster services running in both the nodes and then give #smitty installp in both the nodes separately. Please assist... (0 Replies)
Discussion started by: samsungsamsung
0 Replies

8. AIX

HACMP 5.4.1 Two-Node-Cluster-Configuration-Assistant fails

This post just as a follow-up for thread https://www.unix.com/aix/115548-hacmp-5-4-aix-5300-10-not-working.html: there was a bug in the clcomdES that would cause the Two-Node-Cluster-Configuration-Assistant to fail even with a correct TCP/IP adapter setup. That affected HACMP 5.4.1 in combinatin... (0 Replies)
Discussion started by: shockneck
0 Replies

9. AIX

xntpd starts after reboot only when HACMP services are started ?

Hello, Running AIX 6.1, AIX machine is HACMP node. Recently I set up ntp service. Started xntpd by hand - everythig is OK. Configured xntpd to start after reboot and rebooted the machine. After reboot checked xntpd: # lssrc -a|grep ntp xntpd tcpip ... (5 Replies)
Discussion started by: vilius
5 Replies

10. UNIX and Linux Applications

Oracle Cluster Ready Services waiting for SunCluster on x86 to start

Recently i faced problem starting oracle application on my galaxy cluster on one node.In the log i found that the CRS demon was not started after the booting of the node , so i manually tried to start it but faced some error. So here are the work around that i had done and the CRS services got... (0 Replies)
Discussion started by: amitranjansahu
0 Replies
Login or Register to Ask a Question