Suncluster query


 
Thread Tools Search this Thread
Operating Systems Solaris Suncluster query
# 1  
Old 05-08-2009
Suncluster query

Hello,

I have 2 systems Solaris 10 sparc with latest patches, 2 transports connected and 1 public port each and fiber connections to SAN.

This is first time I am doing a SUN cluster install (latest soft). I installed the software using ./installer and then tried to install using "scinstall" command

Now I find that after configuring the second node, scinstall hangs even though the node is rebooted.(rebooting node .....)

I have already read initial documentation from Sun for Suncluster configuration but now I am confused. Question here is,

What should be the steps taken in order to configure sun cluster on 2 nodes which will be sharing SAN disk?

Do I have to install VxVM before using scinstall or scinstall will detect the SAN disk ? I am sure LUN is assigned for the systems.

I have VxVM product with license (not installed) but no experience on when and how it should be used. Atleast if I come to know when to do which step then I am ready to read any related docuements.

So is it mounting SAN disks to both systems first or is it scinstall first. I tried scinstall first and hung many times. When I retried scinstall, second node is doing kernel panic.

Hope I was able to explain myself here.

Please help!

Thank you...Smilie
# 2  
Old 05-09-2009
whats the Sun Cluster version? Kernel patch level of Solaris 10 on both systems? and OBP , is it updated to the latest?
You can run scinstall w/o installing VxVM first
# 3  
Old 05-09-2009
MySQL

Quote:
Originally Posted by incredible
whats the Sun Cluster version? Kernel patch level of Solaris 10 on both systems? and OBP , is it updated to the latest?
You can run scinstall w/o installing VxVM first
138888-08 - kernel on both systems, SF v480r
SC version - suncluster-3_2u2-ga.iso
OBP 4.10.7 2003/06/11 07:0 on both systems.

Systems patched to latest.

I wonder why the scinstall hangs after rebooting second node. so even if SAN disk is not configured I can go ahead with scinstall, right? does scinstall detect SAN LUNS or needs to be configured using Sun vol. manager or VXVM?

is there any link anywhere for step by step config? Smilie

Thanks!
# 4  
Old 05-09-2009
any outputs? how does it hang ? due to any firewall?
Might be easier to install them one node at a time, i.e. install the first node and then add a 2nd node to the cluster

also check both nodes on the svcs -xv

Last edited by DukeNuke2; 05-09-2009 at 06:08 PM..
# 5  
Old 05-09-2009
you don't need vxvm... but you need somekind of volumemanagement to switch the a filesystem between clusternodes. whre you don't need a volumemanagement is the quorum device.
the question is what kind of cluster (what service is going to be HA) are you trying to install?
# 6  
Old 05-09-2009
Which Storage is attached to the cluster nodes?

The FC ports in the Storage have defined the corrected Flags for SUn CLuster?

For example if you have EMC array, the LUNS must be have the "PER" flag.
# 7  
Old 05-10-2009
Quote:
Originally Posted by incredible
any outputs? how does it hang ? due to any firewall?
Might be easier to install them one node at a time, i.e. install the first node and then add a 2nd node to the cluster

also check both nodes on the svcs -xv
Attaching the log.

This time I found while scinstall was rebooting the other node, it hung, then found second was rebooting again and again..

EDIT :

Attaching log from second server,

Quote:
cat cluster_check_exit_code.log
100
cat ql_upgrade_debug.log
Quote:
Sat May 9 22:46:49 2009 CCR: INFO Upgrade state key not found

Sat May 9 22:46:49 2009 CCR: INFO Upgrade state key not found

Sat May 9 22:46:49 2009 QL_CHECK_OBJ: INFO Found RGM object

Sat May 9 22:50:29 2009 CCR: INFO Upgrade state key not found

Sat May 9 22:50:29 2009 CCR: INFO Upgrade state key not found

Sat May 9 22:50:30 2009 QL_CHECK_OBJ: INFO RGM object not found...yet

Sat May 9 22:50:31 2009 QL_CHECK_OBJ: INFO Found RGM object

Sat May 9 22:51:10 2009 QL_UPGRADE: INFO Ping time out interval = 1

Sat May 9 22:51:10 2009 CCR: INFO Upgrade state key not found

Sat May 9 22:51:10 2009 CCR: INFO Upgrade state key not found

Sat May 9 22:51:10 2009 QL_UPGRADE: INFO Upgrade not in progress
cat scinstall.log.1549

Quote:
scinstall -i -k -C lumtest-p -F -T node=server1,node=server2,authtype=sys -w netaddr=172.16.0.0,netmask=255.255.240.0,maxnodes=64,maxprivatenets=10,numvirtualclusters=12 -A trtype=dlpi,name=ce0 -A trtype=dlpi,name=ce1 -B type=switch,name=switch1 -B type=switch,name=switch2 -m endpoint=:ce0,endpoint=switch1 -m endpoint=:ce1,endpoint=switch2 -P task=quorum,state=INIT


Checking device to use for global devices file system ... done

Initializing cluster name to "lumtest-p" ... done
Initializing authentication options ... done
Initializing configuration for adapter "ce0" ... done
Initializing configuration for adapter "ce1" ... done
Initializing configuration for switch "switch1" ... done
Initializing configuration for switch "switch2" ... done
Initializing configuration for cable ... done
Initializing configuration for cable ... done
Initializing private network address options ... done
Plumbing network address 172.16.0.0 on adapter ce0 >> NOT DUPLICATE ... done
Plumbing network address 172.16.0.0 on adapter ce1 >> NOT DUPLICATE ... done


Setting the node ID for "server2" ... done (id=1)


Checking for global devices global file system ... done
Updating vfstab ... done

Verifying that NTP is configured ... done

Updating nsswitch.conf ... done

Adding cluster node entries to /etc/inet/hosts ... done


Configuring IP multipathing groups ...done


Verifying that power management is NOT configured ... done

Ensure that the EEPROM parameter "local-mac-address?" is set to "true" ... done

Ensure network routing is disabled ... done

Please reboot this machine.
cat client.log.0

Quote:
info : main: SccheckClient() -- ENTER; clientNumber: 0
info : main: SccheckClient() java version: 1.5.0_12
info : main: SccheckClient.checkRequiredDefines() clustername: server2
info : main: SccheckClient.checkRequiredDefines() clientName: server2
info : main: SccheckClient.checkRequiredDefines() private clientName:
info : main: SccheckClient.checkRequiredDefines() inClusterMode: false
info : main: SccheckClient.checkRequiredDefines() gunzip: /usr/bin/gunzip
info : main: SccheckClient.checkRequiredDefines() keLogName: /var/cluster/logs/sccheck/ke-client.log.0
info : main: SccheckClient.checkRequiredDefines() xslDir: /var/cluster/sccheck/tmp/client.0
info : main: SccheckClient.loadOptionalDefines() brief: false
info : main: SccheckClient.loadOptionalDefines() verbose: false
info : main: SccheckClient.loadOptionalDefines() vverbose: false
info : main: SccheckClient.loadOptionalDefines() hostlist: null
info : main: SccheckClient.loadOptionalDefines() minSeverity: 0
info : main: SccheckClient.loadOptionalDefines() explorer input files path: null
info : main: explorer archive specified : false
info : main: do only multi node check : false
info : main: SccheckClient.expandHostlist() -- sessionPublicNames: [server2]
info : main: SccheckClient.expandHostlist() -- sessionPrivateNames: [localhost]
info : main: SccheckClient.remOps(): [server2] / [server2]
info : main: SccheckClient.remOps() reportsDir: /var/cluster/sccheck/reports.2009-05-09.12:44:12
info : main: SccheckClient.remOps() CLIENT_REPORTNAME: sccheck-results
info : main: SccheckClient.remOps() reportFilename: /var/cluster/sccheck/reports.2009-05-09.12:44:12/sccheck-results.server2
info : main: SccheckClient.remOps() new thread for: server2/localhost
info : main: Utils.runCmdOneString() cmdArray: [/usr/ucb/printenv,LC_MESSAGES]
info : main: Utils.runCmdOneString() result: null
info : main: Utils.runCmdOneString() returnCode: 1
info : main: Utils.runCmdOneString() error return: 1
info : main: Utils.runCmdOneString() cmdArray: [/usr/ucb/printenv,LANG]
info : main: Utils.runCmdOneString() result: null
info : main: Utils.runCmdOneString() returnCode: 1
info : main: Utils.runCmdOneString() error return: 1
info : main: I18n.getBundle(): en_US
info : main: postProgress: (verbose: 1) Requesting explorer data and node report from server2.
info : main: ClientThread() -- ENTER -- server2
info : main: ClientThread() resultsFilename: /var/cluster/sccheck/explorers-gz/server2.expl.gzip
info : main: ClientThread() reportFilename: /var/cluster/sccheck/reports.2009-05-09.12:44:12/sccheck-results.server2
info : main: ClientThread() brief: false
info : main: ClientThread() minSeverity: 0
info : main: ClientThread() check rule file: /usr/cluster/lib/sccheck/checklist.cluster.singlenode.xml
info : main: ClientThread() explorer archive: null
trace: main: ClientThread() -- EXIT -- server2
trace: main: SccheckClient.waitForClientThreads() -- ENTER --
info : Thread-0: ClientProtocol() inetPort: 7123
info : Thread-0: ClientProtocol() serverPort: 7124
trace: Thread-0: ClientProtocol() contacting localhost on inetd port 7123
error: Thread-0: ClientProtocol(): IOException server2: Connection refused
error: Thread-0: ClientThread.remoteOperations() in ProtocolException: Connection refused
error: Thread-0: ClientThread.run() SCException on: server2: Connection refused
info : Thread-0: postErrMsg: server2 error: Connection refused
trace: main: SccheckClient.waitForClientThreads() -- EXIT --
error: main: SccheckClient.remOps() failedNodes: server2
error: main: SccheckClient() remote ops exception: server2
info : main: SccheckClient.earlyExit() called with code 107 (server2)
Most of the new logs I have attached/ copied are from server 2. log.txt is from server 1

Thanks !!

Last edited by upengan78; 05-10-2009 at 11:02 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

suncluster co-existing with Veritas Cluster

have a pair of sun servers, wanting to test drive a 2-node cluster using both software.. would like disable one of these cluster software while the othe is running.. Is it feasible at all ?? (2 Replies)
Discussion started by: ppchu99
2 Replies

2. Solaris

in-completed suncluster installation..

machine got into this mess after in-complete suncluster installation.. can't spawn the console-login service at all... Feb 11 00:07:13 svc.startd: svc:/system/cluster/cl_boot_check:default: Method "/usr/cluster/lib/svc/method/svc_boot_check start" failed with exit status 1. Feb 11 00:07:13... (1 Reply)
Discussion started by: ppchu99
1 Replies

3. Shell Programming and Scripting

Shell Script to execute Oracle query taking input from a file to form query

Hi, I need to query Oracle database for 100 users. I have these 100 users in a file. I need a shell script which would read this User file (one user at a time) & query database. For instance: USER CITY --------- ---------- A CITY_A B CITY_B C ... (2 Replies)
Discussion started by: DevendraG
2 Replies

4. Shell Programming and Scripting

Query Oracle tables and return values to shell script that calls the query

Hi, I have a requirement as below which needs to be done viz UNIX shell script (1) I have to connect to an Oracle database (2) Exexute "SELECT field_status from table 1" query on one of the tables. (3) Based on the result that I get from point (2), I have to update another table in the... (6 Replies)
Discussion started by: balaeswari
6 Replies

5. Solaris

zfs maintenance under suncluster

Hello, Running Solaris 10x86 - on the 2 nodes of sun cluster. I have a zfspool as HAstorage and zfs as SUNW.nfs (failover) Now these resources including Logical hostanme are online. No issues. But I want to change zfs-name 'export' to 'export-b' What would be the correct procedure... (3 Replies)
Discussion started by: upengan78
3 Replies

6. Shell Programming and Scripting

Query

hi i am producing txt outputs of archives created by our tsm software. problem is some of these files are too large to open in excel so i can despatch. here is a example of the output TAG_460 /prod_bak 7 FILE /ARCHIVE/Whistles- ... (4 Replies)
Discussion started by: treds
4 Replies

7. UNIX and Linux Applications

Oracle Cluster Ready Services waiting for SunCluster on x86 to start

Recently i faced problem starting oracle application on my galaxy cluster on one node.In the log i found that the CRS demon was not started after the booting of the node , so i manually tried to start it but faced some error. So here are the work around that i had done and the CRS services got... (0 Replies)
Discussion started by: amitranjansahu
0 Replies

8. Shell Programming and Scripting

add the output of a query to a variable to be used in another query

I would like to use the result of a query in another query. How do I redirect/add the output to another variable? $result = odbc_exec($connect, $query); while ($row = odbc_fetch_array($result)) { echo $row,"\n"; } odbc_close($connect); ?> This will output hostnames: host1... (0 Replies)
Discussion started by: hazno
0 Replies

9. UNIX for Dummies Questions & Answers

Need Help on query

I just started to learn unix - need help to write a script to query a logfile and produce the results that contains a specific word "alarm" for a period from X day to Y day. I really have no idea how to begin - :( please help... ____________________________________________________ #... (1 Reply)
Discussion started by: snipfer
1 Replies

10. Shell Programming and Scripting

query.....

hi friends i want to know details of `exec` exact use of this command ..... actually i went through the man page but i didn`t get the satisfactory ...conclusion.... thaks in advance.... (1 Reply)
Discussion started by: newson
1 Replies
Login or Register to Ask a Question