I am new to cluster commands. But I have tried utilizing: -pe, but I do not know my parallel computing environment. We are running SGE, is there a simpler command to request more nodes?
first off, i am new to unix so please bear with me. i was reading somewhere that if your i-nodes get critical that it can slow your network down. what are i-nodes and when do they become a critical number? this is what mine states:
/ (/dev/root ): 777058 blocks 569290 i-nodes... (4 Replies)
Does anyone know something about this? I have no idea what it means and how to do it. but if anyone can give me and explanation and also point me to a website, i'd really appreciate it (5 Replies)
After rcp -rp from remote host, using du -k to verify the file size but total file size have different size. Check on individual file, file size is correct.
How can I confirm on the file size after ftp?
Pls advise.
Thank you. (15 Replies)
hello Gurus,
My current set up is 3 to 1 Cluster (SUN Cluster 3.2) running oracle database. Task is to reboot the servers. My query is about the procedure to do the same.
My understanding is suspend the databases to avoid switchover. Then execute the command scshutdown to down the cluster... (4 Replies)
Hi all. I have two nodes taken different places. They are connected together on a network. So, i have a service, it works on one of nodes and when the node is unavailable the service should will be launched on other node.
Solution: rhel cluster, keepalive, hearbeat...may be Carp
but what if... (2 Replies)
Hi all,
I have a list of node pairs separated with a comma and also, associated with their respective values. For example:
b0015,b1224 1.1
b0015,b2576 1.4
b0015,b3162 2.5
b0528,b1086 1.7
b0528,b1269 5.4
b0528,b3602 2.1
b0948,b2581 3.2
b1224,b0015 1.1... (8 Replies)
Hi,
A customer I'm supporting once upon a time broke their 2 cluster node database servers so they could use the 2nd standby node for something else. Now sometime later they want to bring the 2nd node back into the cluster for resilance. Problem is there are now 3 VG's that have been set-up... (1 Reply)
Hi folks.
I've been a developer for far too many years, but know very little of unix. I have setup a very inexpensive cluster of 6 raspberry pi nodes so I can play around with multi node programming. This is only for fun, but I want to learn properly, else what's the point?!
Setup can be a... (4 Replies)
Discussion started by: MuntyScrunt
4 Replies
LEARN ABOUT DEBIAN
qrerun
qrerun(1B) PBS qrerun(1B)NAME
qrerun - rerun a pbs batch job
SYNOPSIS
qrerun [-f] job_identifier ...
DESCRIPTION
The qrerun command directs that the specified jobs are to be rerun if possible.
To rerun a job is to terminate the session leader of the job and return the job to the queued state in the execution queue in which the job
currently resides.
If a job is marked as not rerunable then the rerun request will fail for that job. If the mini-server running the job is down, or the
rejects the request, the Rerun Job batch request will return a failure unless -f is used.
Using -f violates IEEE Batch Processing Services Std and should be handled with great care. It should only be used under exceptional cir-
cumstances. Best practice is to fix the problem mini-server host and letting qrerun run normally. The previous nodes may need manual
cleaning. See the -r option on the qsub and qalter commands.
OPERANDS
The qrerun command accepts one or more job_identifier operands of the form:
sequence_number[.server_name][@server]
STANDARD ERROR
The qrerun command will write a diagnostic message to standard error for each error occurrence.
EXIT STATUS
Upon successful processing of all the operands presented to the qrerun command, the exit status will be a value of zero.
If the qrerun command fails to process any operand, the command exits with a value greater than zero.
SEE ALSO qsub(1B), qalter(1B), pbs_alterjob(3B), pbs_rerunjob(3B)Localqrerun(1B)