Sponsored Content
Full Discussion: Guides for new HPC admins
Special Forums UNIX and Linux Applications High Performance Computing Guides for new HPC admins Post 302534715 by DBryan on Tuesday 28th of June 2011 02:40:45 PM
Old 06-28-2011
Guides for new HPC admins

In my company, it's fallen on me to serve as the admin of our new HPC cluster, a task that's very new to me. It's very important to me to lay a solid foundation and avoid any unnecessary pitfalls. So, can anyone recommend a succinct guide or list of do's-and-don'ts for adiminstering an HPC cluster? The cluster runs the latest CentOS, PGI compilers, MPICH2, WRF, etc.

Thanks!
 

6 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Solaris Study Guides

I am currently working on my Solaris 8.0 Certification ,and I've been working primarily with the Solaris Study Guide produced by Syngress & Osborne. It's a good study guide ,but I think that it does lack some clarity and detail! My question is ,are there any other Solaris Study Guides that could... (1 Reply)
Discussion started by: bilal_aa
1 Replies

2. Solaris

Mail server guides/tutorials?

solved issue (0 Replies)
Discussion started by: n0rus
0 Replies

3. Windows & DOS: Issues & Discussions

Beginners Guides: Forgotten Passwords & Recovery Methods

Ever wondered how to recover or reset a forgotten password in WindowsXP? This site will help you get back into your computer, all without reinstalling the operating system. - Version 1.0.0: Reference: http://www.intelligenceweb.org/showthread.php?t=2 (6 Replies)
Discussion started by: Neo
6 Replies

4. UNIX for Dummies Questions & Answers

Oracle guides for exams

hi people, I am very much interested to Oracle. I decided to write one exam soon and I am refering guides from certmagic.com. It seems good. Any of you know any good books than this ?! (0 Replies)
Discussion started by: developer_me
0 Replies

5. AIX

AIX study Guides

Please help me in getting some fine docs ( other than redbooks)to learn AIX. My mail ID: qsecofr400@gmail.com Thanks in advance. (3 Replies)
Discussion started by: secofr
3 Replies

6. What is on Your Mind?

Good Practice Guides

A recent post where someone suggested redirecting with a clobber ">" to a file the same command was reading from prompted me to post this sysad good practice list. Some items are from times where I have learned things the hard way. I think this would be helpful so we can learn from each others... (8 Replies)
Discussion started by: ilikecows
8 Replies
CRM_MON(8)							  [FIXME: manual]							CRM_MON(8)

NAME
crm_mon - monitor the cluster's status SYNOPSIS
crm_mon [-V] -d -pfilename -h filename crm_mon [-V] [-1|-n|-r] -h filename crm_mon [-V] [-n|-r] -X filename crm_mon [-V] [-n|-r] -c|-1 crm_mon [-V] -i interval crm_mon -? DESCRIPTION
The crm_mon command allows you to monitor your cluster's status and configuration. Its output includes the number of nodes, uname, uuid, status, the resources configured in your cluster, and the current status of each. The output of crm_mon can be displayed at the console or printed into an HTML file. When provided with a cluster configuration file without the status section, crm_mon creates an overview of nodes and resources as specified in the file. OPTIONS
--help, -? Provide help. --verbose, -V Increase the debug output. --interval seconds, -i seconds Determine the update frequency. If -i is not specified, the default of 15 seconds is assumed. --group-by-node, -n Group resources by node. --inactive, -r Display inactive resources. --as-console, -c Display the cluster status on the console. --one-shot, -1 Display the cluster status once on the console then exit (does not use ncurses). --as-html filename, -h filename Write the cluster's status to the specified file. --daemonize, -d Run in the background as a daemon. --pid-file filename, -p filename Specify the daemon's pid file. --xml-file filename, -X filename Specify an XML file containing a cluster configuration and create an overview of the cluster's configuration. EXAMPLES
Display your cluster's status and get an updated listing every 15 seconds: crm_mon Display your cluster's status and get an updated listing after an interval specified by -i. If -i is not given, the default refresh interval of 15 seconds is assumed: crm_mon -i interval[s] Display your cluster's status on the console: crm_mon -c Display your cluster's status on the console just once then exit: crm_mon -1 Display your cluster's status and group resources by node: crm_mon -n Display your cluster's status, group resources by node, and include inactive resources in the list: crm_mon -n -r Write your cluster's status to an HTML file: crm_mon -h filename Run crm_mon as a daemon in the background, specify the daemon's pid file for easier control of the daemon process, and create HTML output. This option allows you to constantly create HTML output that can be easily processed by other monitoring applications: crm_mon -d -p filename -h filename Display the cluster configuration laid out in an existing cluster configuration file (filename), group the resources by node, and include inactive resources. This command can be used for dry-runs of a cluster configuration before rolling it out to a live cluster. crm_mon -r -n -X filename FILES
/var/lib/heartbeat/crm/cib.xml--the CIB (minus status section) on disk. Editing this file directly is strongly discouraged. AUTHOR
crm_mon was written by Andrew Beekhof. [FIXME: source] 07/05/2010 CRM_MON(8)
All times are GMT -4. The time now is 01:56 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy