10 More Discussions You Might Find Interesting
1. Homework & Coursework Questions
Use and complete the template provided. The entire template must be completed. If you don't, your post may be deleted!
I am VERY much a neophyte with shell scripting. I am working on the following,
1. The problem statement, all variables and given/known data:
"Create a script sends an... (7 Replies)
Discussion started by: doghouse308
7 Replies
2. Shell Programming and Scripting
So here I have a simple function that I wish to debug. However, I am unable to debug the desired function even with set -o functrace enabled. Before resorting to asking this question, I had managed to find a possible solution that did not produce the desired results, which is located here.
How... (5 Replies)
Discussion started by: BrandonD
5 Replies
3. Programming
hey
i have a problem with a switch case in program and the debugger is messy has hell ( we use normal VI and gdb in our schoool to make it more diffiacult)
any way i have a problom where for some unknown reason the debugger just skips a switch statment as if it wasent even there
the rest... (2 Replies)
Discussion started by: gotenxds
2 Replies
4. Homework & Coursework Questions
School:Syrian Virtual University - Bachelor in Information Technology - Tutor: A.Issa - course: S10-iti320
hi all, would you please help me correcting and debugging this script: fx-permiss.sh which accepts a list of users as argument resiting those files permissions:
say our directory... (0 Replies)
Discussion started by: erzal
0 Replies
5. Shell Programming and Scripting
hi all, would you please help me correcting and debugging this script: fx-permiss.sh which accepts a list of users as argument resiting those files permissions:
say our directory structure:
/home/erzal/file
/home/erzal/dire
/home/erzal/share
/home/erzal/share/file
/home/erzal/share/dire
1-... (1 Reply)
Discussion started by: erzal
1 Replies
6. Shell Programming and Scripting
is there any way you can add a breakpoint in a script so you can stop on it?
i have used -xv in my shebang but the script just runs and i want it to stop at a specific point in the script.
appreciate any help. (1 Reply)
Discussion started by: npatwardhan
1 Replies
7. Programming
Hi i want to debug the C program with GDB debugger. I want to debug the program by line by line. I want to debug program like as we debug the program in Turbo-C using the F8. Can any one help me?
I know i have to use single stepping. But i don't know how to use it. Any help can be appreciated..... (5 Replies)
Discussion started by: ps_sach
5 Replies
8. Solaris
when I tried to debug my application i got the following.
gdb -v
GNU gdb 6.6
file is in C and Xmotiff Languages
(gdb) attach 25499
Attaching to process 25499
Retry #1:
Retry #2:
Retry #3:
Retry #4:
0xfea40b68 in ?? ()
(gdb) where
#0 0xfea40b68 in ?? () (0 Replies)
Discussion started by: satish@123
0 Replies
9. Solaris
OS : SOLARIS 10
debug tool :$gdb -v
GNU gdb 6.6
compiler : $gcc -v
gcc version 2.95.3 20010315 (release)
When i tried to debug my application i got the following error.
$gdb Pal
GNU gdb 6.6
Copyright (C) 2006 Free Software Foundation, Inc.
This GDB was... (2 Replies)
Discussion started by: satish@123
2 Replies
10. Shell Programming and Scripting
Hi,
If we want to debug a shell script, then set -vx has to be included in the begining of the script. Just i want to know what purpose -vx is used.
Thanks in advace
Sarwan (2 Replies)
Discussion started by: sarwan
2 Replies
lamssi_cr(7) LAM SSI CR OVERVIEW lamssi_cr(7)
NAME
lamssi_checkpoint_restart - overview of LAM's MPI checkpoint / restart SSI modules
DESCRIPTION
The "kind" for checkpoint / restart SSI modules is "cr". Specifically, the string "cr" (without the quotes) is the prefix that should be
used with the mpirun command line with the -ssi switch. For example:
mpirun -ssi cr blcr C my_mpi_program
LAM/MPI can involuntarily checkpoint and restart parallel MPI jobs. Doing so requires that LAM/MPI was compiled with thread support and
that back-end checkpointing systems are available at run-time. MPI jobs will have to run with at least MPI_THREAD_SERIALIZED support. If
a job elects to run with checkpoint/restart support and an available cr module is found, the job's thread level will automatically be pro-
moted to MPI_THREAD_SERIALIZED. See the User's Guide for more details.
Checkpoint Phases
LAM defines three phases for checkpoint / restart support in each MPI process:
Checkpoint.
When the checkpoint request arrives, before the actual checkpoint occurs.
Continue.
After a checkpoint has successfully completed, in the same process as the checkpoint was invoked in.
Restart
After a checkpoint has successfully completed, in a new / restarted process.
The Continue and Restart phases are identical except for the process in which they are invoked -- the Continue phase is invoked in the same
process as the Checkpoint phase was invoked. The Restart phase is only invoked in newly restarted processes.
AVAILABLE MODULES
LAM currently has two cr modules: blcr and self. In order for an MPI job to be able to be checkpointed and restarted, all of its MPI SSI
modules must support checkpoint/restart. Currently, this means using the crtcp RPI module or the gm RPI module when compiled with gm_get()
support (see the User's Guide for more details).
blcr CR Module
The Berkeley Lab Checkpoint/Restart (BLCR) single-node checkpointer is a software system from Lawrence Berkeley Labs. See the project web
page for more details: http://www.nersc.gov/research/ftg/checkpoint/.
The blcr module has one SSI parameter:
cr_blcr_priority
blcr's default priority is 50.
self CR Module
The self module, when used with checkpoint/restart SSI modules, will invoke the user-defined functions to save and restore checkpoints. It
is simply a mechanism for user-defined functions to be invoked at LAM's Checkpoint, Continue, and Restart phases. Hence, the only data that
is saved during the checkpoint is what is written in the user's checkpoint function. No MPI library state is saved at all.
As such, the model for the self module is slightly different than, for example, the blcr module. Specifically, the Restart function is not
invoked in the same process image of the process that was checkpointed. The Restart phase is invoked during MPI_INIT of a new instance of
the application (i.e., it starts over from main()).
Multiple SSI parameters are available:
cr_self_user_prefix
Specify a string prefix for the name of the checkpoint, continue, and restart functions that should be invoked by LAM. That is, speci-
fying "-ssi cr_self_user_prefix foo" means that LAM expects to find three functions at run-time: int foo_checkpoint(), int foo_con-
tinue(), and int foo_restart(). This is a convenience parameter that can be used instead of the three parameters listed below.
cr_self_user_checkpoint
Name of the user function to invoke during the Checkpoint phase.
cr_self_user_continue
Name of the user function to invoke during the Continue phase.
cr_self_user_restart
Name of the user function to invoke during the Restart phase.
If none of these parameters are specified and the self module is selected, it will use the default prefix lam_cr_self
Finally, the usual priority SSI parameter is also available:
cr_self_priority
self's default priority is 25.
SEE ALSO
lamssi(7), mpirun(1), LAM User's Guide
LAM 7.1.4 July, 2007 lamssi_cr(7)