Problem running mpirun

 
Thread Tools Search this Thread
Special Forums UNIX and Linux Applications High Performance Computing Problem running mpirun
# 1  
Old 02-20-2009
Data Problem running mpirun

Hi, I have a difficulty running mpirun on my workstation, RHEL 4.

I have uninstalled lam-7.1.4 with 'rpm -e lam-7.1.4'. No problem.
The I downloaded openmpi-1.3.tar.gz. unzipped as normal. No problem.
Then did 'make clean'. Then './configure'. No problems.
Then I did 'make' and then 'make install'. No problems.

I put the path '/usr/local/bin/' and '/usr/local/lib/'into my .bashrc file, and did source .bashrc --> no problem.

The I tried to 'lamboot', and this is where the problem begins. Here is the response (I have swapped my real name for 'username':

username:2DCHANNEL$ lamboot

LAM 7.1.4/MPI 2 C++/ROMIO - Indiana University


So apparently, the old lam-7.1.4 is still hanging around? Still I have tried running a parallel job 'test' with: mpirun -np 4 ./test, and the result is startling:

username:2DCHANNEL$ mpirun -np 4 ./test
ssh: x1-6-00-17-a4-99-51-97: Name or service not known
--------------------------------------------------------------------------
A daemon (pid 7989) died unexpectedly with status 255 while attempting
to launch so we are aborting.

There may be more information reported by the environment (see above).

This may be because the daemon was unable to find all the needed shared
libraries on the remote node. You may set your LD_LIBRARY_PATH to have the
location of the shared libraries on the remote nodes and this will
automatically be forwarded to the remote nodes.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpirun noticed that the job aborted, but has no info as to the process
that caused that situation.
--------------------------------------------------------------------------
mpirun: clean termination accomplished

and when I check to see th date of lamboot with 'll /usr/local/bin/lam*' I get:

username:2DCHANNEL$ ll /usr/local/bin/lam*
-rwxr-xr-x 1 root root 194709 Jul 3 2008 /usr/local/bin/lamboot
-rwxr-xr-x 1 root root 162203 Jul 3 2008 /usr/local/bin/lamcheckpoint
-rwxr-xr-x 1 root root 95131 Jul 3 2008 /usr/local/bin/lamclean
-rwxr-xr-x 1 root root 284740 Jul 3 2008 /usr/local/bin/lamd
-rwxr-xr-x 1 root root 122140 Jul 3 2008 /usr/local/bin/lamexec
-rwxr-xr-x 1 root root 211554 Jul 3 2008 /usr/local/bin/lamgrow
-rwxr-xr-x 1 root root 88140 Jul 3 2008 /usr/local/bin/lamhalt
-rwxr-xr-x 1 root root 691644 Jul 3 2008 /usr/local/bin/laminfo
-rwxr-xr-x 1 root root 89662 Jul 3 2008 /usr/local/bin/lamnodes
-rwxr-xr-x 1 root root 162296 Jul 3 2008 /usr/local/bin/lamrestart
-rwxr-xr-x 1 root root 93683 Jul 3 2008 /usr/local/bin/lamshrink
-rwxr-xr-x 1 root root 93373 Jul 3 2008 /usr/local/bin/lamtrace
-rwxr-xr-x 1 root root 193821 Jul 3 2008 /usr/local/bin/lamwipe

Note that these are dated Jul 3 2008 when I first installed lammpi.

What also worries me is the line which says, "ssh: x1-6-00-17-a4-99-51-97: Name or service not known"

Now, ever since I first got my workstation, as I login with my username I get a message like '
x1-6-00-17-a4-99-51-97 not recognised. It may degrade GNOME perfromance. Try putting it in /etc/hosts'. However, it gives an option to ignore this message and continue, which I always do and I have never suffered anything at all, till now apparently after 2-3 years.

What is going on? How do I remove lam-7.1.4 competely from my system, and install the new mpirun suitable for openmpi? (I have installed opempi in order to use mpif90 -- this is ok now and my f90 programs compile ok with mpif90, it is just the running bit that is not working properly.)

How do I rectify the login warning message?

Please advise -- I am not a systems manager, but a newbie.

Cheers.
# 2  
Old 03-05-2009
You don't run mpirun with lam and vice versa. They can, however, peacefully co-exist. Once you set up openmpi, you must set up a cluster hosts file or something. Then for each process, openmpi will attempt to ssh to a host and run your program.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Programming

Problem with running lint

This is a strange problem that I can't figure out - I run lint on my C programs to weed out unused variables. The output can be quite large, so I use sed to cut out just unused variables section. The typical command looks like this: lint -I /usr/local/include -I./include -m hn.c As my... (3 Replies)
Discussion started by: migurus
3 Replies

2. UNIX for Advanced & Expert Users

Problem running texmacs

I have installed texmacs. I can load the program but only as root. I had installed it locally before and added the locaton to PATH. I think it is picking up the old PATH even if I removed it from my .cshrc ---------- Post updated at 11:44 AM ---------- Previous update was at 11:37 AM ----------... (1 Reply)
Discussion started by: kristinu
1 Replies

3. Programming

Problem running a makefile

I have written this makefile and am getting an error saying make nfd gfortran -O -Wall -fbacktrace -fno-align-commons -c -o fd.o fd.f fd.f:49: Error: Can't open included file 'fd.par' make: *** Error 1 The directory structure is as follows . ├── library │ ├── fd │ │ ├──... (3 Replies)
Discussion started by: kristinu
3 Replies

4. Shell Programming and Scripting

Problem on running a script

Hi all, Running follow command on terminal; $ glance -T cloudlive -I ubuntu -K ubuntu123 -N \ http://127.0.0.1:5000/v2.0/OpenX add name="cirros" is_public=true \ container_format=ovf disk_format=raw < \ /srv/cirros-0.3.0-x86_64-disk.img It works without problem. Putting it on a... (5 Replies)
Discussion started by: satimis
5 Replies

5. UNIX for Dummies Questions & Answers

Problem running executable with ./

Hey all, I'm trying to execute a program and despite it appearing to be there, I keep getting this: -bash: ./aisdispatcher: No such file or directoryTo run it, I'm going into the directory where it is stored and running ./aisdispatcher...the result of which should just be a listing of options... (10 Replies)
Discussion started by: pmd006
10 Replies

6. UNIX for Dummies Questions & Answers

problem running script

hey, i'm trying to run several processes due 2 loops : the code is: cd ${TIBCO_HOME}/tra/domain/${DOMAIN}/application/ for app_name in ls * do echo $app_name if && && ; then exit else echo $app_name cd $app_name pwd for ps_name in ls *.sh do echo... (2 Replies)
Discussion started by: ohadr
2 Replies

7. Programming

IRIX mpirun error!!

Dear Experts, I have been suffering in porting a CFD code called OpenFOAM to sgi-IRIX system for many months but to no avail, and the problem is about MPI! I installed gcc-4.3.0, gnu-binutils for my compilation. And after the building process, I can run the executive with single CPU, but when... (1 Reply)
Discussion started by: lakeat
1 Replies

8. Shell Programming and Scripting

Problem in Running Script

Hi friends, i have written one small script named as "size.sh"..... while running its giving syntax error at line no 2: 'str=$' unexpected. How to run this file ??? please help me. file content is : ----------------------------- #!/bin/sh str=$( df -k | grep /usr/local)... (14 Replies)
Discussion started by: samfrance
14 Replies

9. Solaris

how and where can i download sun solaris hpc mpirun software

how and where can i download sun solaris2.8 hpc mprun software?thanks (1 Reply)
Discussion started by: jingwp
1 Replies

10. Shell Programming and Scripting

Executing mpirun commands

Basically, I would like to run an mpirun query on my web server to query your databases via yours when using the BLAST program, however the server seems not able to execute even basic mpirun programs such as cpi (to calculate pi). Are there any settings I should take note of? I am running Perl... (0 Replies)
Discussion started by: tesswulf
0 Replies
Login or Register to Ask a Question