Sponsored Content
Special Forums UNIX and Linux Applications High Performance Computing How to check performance of your HPC cluster? Post 302643627 by albertspade on Sunday 20th of May 2012 06:09:03 AM
Old 05-20-2012
Thanks for your help Otheus.
Smilie
I am new to the field of HPC. I installed HPCC and HPL. Even I am able run it and get the results. But I am not able to understand it. Also its running for my colete cluster, I also want to run them for my single machine. And now I am not able to tell whether its running on both the cores of my machine or only one process per machine, as I am having core 2 duo machines.
 

8 More Discussions You Might Find Interesting

1. Solaris

Performance check without a counterpart

Hi, is it possible to check out the speed limits of a box without a remote peer for netio and stuff? I transferred 1 Terrabyte last night, which is not really much and need to find the bottleneck. The remote server is soon going to retire, yet I need to copy the 8T to the new machine. This... (9 Replies)
Discussion started by: PatrickBaer
9 Replies

2. Shell Programming and Scripting

how to check all the applications are in cluster using shell script

Hi I have an application running in four different node.The server is tomcat.Each node in each tomcat server.How do i check whether all the nodes are in cluster using shell script. any command to check this would be of great use.:) (2 Replies)
Discussion started by: ahamed
2 Replies

3. Shell Programming and Scripting

Monitoring script to check if cm cluster is up or not.

hi guys have this little problem, need some help from script gurus. basically I'm running hpux cmviewcl command, cmviewcl command will produce db1pkg up running enabled box1 my script PSV='box1' STAT='up' check_db1pkg() { # assign cmviewcl output... (2 Replies)
Discussion started by: sparcguy
2 Replies

4. UNIX for Dummies Questions & Answers

How to check if the server is on a Cluster

Hi im connecting to a datacenter remotely. is there a command to know if the server is on a cluster? i want to know the command to use in these OS(hp-ux,solaris,linux) Thanks (6 Replies)
Discussion started by: jinslick25
6 Replies

5. Linux

Cluster check

I am working on a Linux server and want to check the cluster status. I dont know how many server is in cluster and what is the command to check. could you please help me to get the cluster status? (4 Replies)
Discussion started by: anshu ranjan
4 Replies

6. AIX

How to check if a filesystem is part of a cluster

Hello, - How do I know if a filesystem is part of a cluster? - Or do I have to check if the vg related to the fs is part of a cluster instead? if so, how do I check it? - I would also need to check if there are vxfs type inside aix machines and if there are, how do I know if that type of... (2 Replies)
Discussion started by: asanchez
2 Replies

7. Red Hat

How to check performance?

Hi, all What would be the a,b,c in troubleshooting slow performance on RH box, I type and it became really slow, what commands or log files to examine. What parameters to check? Thanks all T (2 Replies)
Discussion started by: trento17
2 Replies

8. Solaris

System Check Performance Tuning

Hello Forum, Well I am fairly new to this Solaris os thing. One thing I would like to check for system health and performance. I know the codes like prstat,vmstat,sar,iostat,netstat,prtdiag -v, What else does a want to be sys admin have to look for when checking a solaris box? I know... (3 Replies)
Discussion started by: br1an
3 Replies
condor_checkpoint(1)					      General Commands Manual					      condor_checkpoint(1)

Name
       condor_checkpoint send - a checkpoint command to jobs running on specified hosts

Synopsis
       condor_checkpoint [-help -version]

       condor_checkpoint[-debug]  [-pool  centralmanagerhostname[:portnumber]]	[-name hostnamehostname-addr "<a.b.c.d:port>""<a.b.c.d:port>"-con-
       straint expression-all]

Description
       condor_checkpoint sends a checkpoint command to a set of machines within a single pool. This causes the startd daemon on each of the speci-
       fied  machines  to  take  a  checkpoint of any running job that is executing under the standard universe. The job is temporarily stopped, a
       checkpoint is taken, and then the job continues. If no machine is specified, then the command is sent to the machine that issued  the  con-
       dor_checkpoint command.

       The  command  sent  is  a periodic checkpoint. The job will take a checkpoint, but then the job will immediately continue running after the
       checkpoint is completed. condor_vacate, on the other hand, will result in the job exiting (vacating) after it produces a checkpoint.

       If the job being checkpointed is running under the standard universe, the job produces a checkpoint and then continues running on the  same
       machine.  If  the  job  is  running under another universe, or if there is currently no Condor job running on that host, then condor_check-
       pointhas no effect.

       There is generally no need for the user or administrator to explicitly run condor_checkpoint. Taking checkpoints of running Condor jobs	is
       handled automatically following the policies stated in the configuration files.

Options
       -help

	  Display usage information

       -version

	  Display version information

       -debug

	  Causes debugging information to be sent to  stderr , based on the value of the configuration variable  TOOL_DEBUG

       -pool centralmanagerhostname[:portnumber]

	  Specify a pool by giving the central manager's host name and an optional port number

       -name hostname

	  Send the command to a machine identified by hostname

       hostname

	  Send the command to a machine identified by hostname

       -addr <a.b.c.d:port>

	  Send the command to a machine's master located at "<a.b.c.d:port>"

       <a.b.c.d:port>

	  Send the command to a machine located at "<a.b.c.d:port>"

       -constraint expression

	  Apply this command only to machines matching the given ClassAd expression

       -all

	  Send the command to all machines in the pool

Exit Status
       condor_checkpointwill exit with a status value of 0 (zero) upon success, and it will exit with the value 1 (one) upon failure.

Examples
       To send a condor_checkpoint command to two named machines:

       % condor_checkpoint   robin cardinal

       To send the condor_checkpointcommand to a machine within a pool of machines other than the local pool, use the -pooloption. The argument is
       the name of the central manager for the pool. Note that one or more machines within the pool must be specified as the targets for the  com-
       mand. This command sends the command to a the single machine named cae17within the pool of machines that has condor.cae.wisc.eduas its cen-
       tral manager:

       % condor_checkpoint  -pool condor.cae.wisc.edu -name cae17

Author
       Condor Team, University of Wisconsin-Madison

Copyright
       Copyright (C) 1990-2012 Condor Team, Computer Sciences Department, University of  Wisconsin-Madison,  Madison,  WI.  All  Rights  Reserved.
       Licensed under the Apache License, Version 2.0.

       See the Condor Version 7.8.2 Manualor http://www.condorproject.org/licensefor additional notices. condor-admin@cs.wisc.edu

								  September 2012					      condor_checkpoint(1)
All times are GMT -4. The time now is 06:51 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy