I was away for a while ... enjoying nice autumn weather here...
Well, you got to the reboot stage. Good.
Strange that the cluster does not come back after the reboot...
I think you no longer receive the first error regarding communication problems do you? Can you see any error message now? The log provided looks fairly OK, what happens after reboot? Suncluster is rather sensitive and talkative, when it boots.
If you can't see exact error message I will try to provide you with some basic guidance.
You sound like someone familiar with Solaris, but you said you're newbie, maybe you're doing some simple error like I did several times setting up my sun clusters
1/ Are you sure the quorum device is picked up?
Does cldevice list -v / cldevice refresh / clquorum list commands return anything other then error?
2/ Chances are your cluster is still in "installmode", scinstall gives a very clear statement after completing all initial config and leaving "installmode".
3/ Are your /etc/hosts, nsswitch.conf, resolv.conf, /etc/domainname files identical? Make them identical dot after dot on all nodes, once I solved a problem by re-ordering lines although I wasn't able to reproduce it and it was in early days of myself playing with Suncluster.
4/ Are you sure that the underlaying storage is connected correctly?
Perhaps have a chat about basic cluster concepts (shared storage) with someone experienced?
Verify that both nodes can see the same disks, (play with cfgadm -al, luxadm display, cfgadm -al -o show_FCP_dev, format, probe-scsi-all commands on both nodes to ensure they see the same storage).
5/ Did you labeled the disks?
6/ Did you sliced and then mounted filesystems identically ?
Disks have to be sliced identically but SVM names (/dev/md/dsk/d??) have to be uniqe within cluster!
7/ last resort: Do you work for a Sun Service Partner or have a close relationship to any?
I am thinking of EIS (Enterprise Installation Standards) DVD - this would greatly help you setting this up.
8/ (maybe this should go first): are you suing fairly new Solaris10 update?
Forget about all the early first, second releases, have somethig fresh and patched, Oooh I mentioned patches.. large topic, install lots of patchesm recommended, security, and finally SunCluster's patches (they're not for free).
When I install cluster I just install it alone in the first place, and after that I set up application-related things quorum, IPMP, IP, HAStoragePlus, various agents etc...).
I'll give you one of my install-logs below:
I don't know what application are you installing, but you're not even at half the way. So don't get frustrated too early, try to get interested in it, and treat it as a valuable challenge. It really is. I'll try to help you if I will be able to, and other here willl do the same. Please share some feedback with us, as we do it in hope for our own development. Looking forwared to hear from you!
One more thing came to my mind overnight: it is important to note, that SC uses regular IP for inter-node communication, the addresses and subnets are pre-defined (may be changed) and are: 172.16.0.0/21.
As you can see the subnet is rather short resulting in large subnet beign used, the subnets behave like any other IP addresses on the system: they pop up in system's routing table and may ,,hide'' other routes. Please refer to: Private Network (Sun Cluster Software Installation Guide for Solaris OS) - Sun Microsystems
Sorry for replying so so late. I have gone through all your check list and trying figure out what is blocking me in this. Also I faced some resource(One of the node went to other team) problem at my test lab after last communication.
Anyways, I've got both servers now and will try to configure from scratch.
Thanks for your all guidance. Will present you my results soon.
Hi Gurus
I am not able to find the patching procedure for solaris 10 ( sol10 u11) to latest patchset with sun cluster having failover zones so that same I should follow.
Take an instance, there are sol1 and sol2 nodes and having two failover zones like sozone1-rg and sozone2-rg and currently... (1 Reply)
I build up two node cluster (node1, node2) in virtualbox. For these two nodes I add 5 shared disk. (Also each node have own OS disk).
1 shared disk for vtoc
2 shared disk for NFS resource group
2 shared disk for WEB resource group
When I finished my work; two nodes was ok and shared disk... (4 Replies)
Hi,
Is it possible to have a Solaris cluster of 2 nodes at SITE-A using SVM and creating metaset using say 2 LUNs (on SAN). Then replicating these 2 LUNs to remote site SITE-B via storage based replication and then using these LUNs by importing them as a metaset on a server at SITE-B which is... (0 Replies)
Hi All,
i have 2 zone
1- Oracle DB Primary Server
2- Oracle DB Secondary Server
i make script do r sync between this 2 zones
but i am planning to do is
make Solaris cluster between this zones
if the primary Server field the secondary server up and running automatically without any... (1 Reply)
Simple question:
After applying a cluster patch to a sun solaris box I am left with a root volume 81% full. I could run through the hassle of resizing the slices which is way too much work for a Ultra 5 running DNS only.
Is there a way to clean up the /var/sadm/pkg area, aka dump the save info. (5 Replies)