The UNIX and Linux Forums  
Hello and Welcome from United States to the UNIX and Linux Forums! Thank You for Visiting and Joining Our Global Community.

Go Back   The UNIX and Linux Forums > Special Forums > High Performance Computing
.
google unix.com



High Performance Computing Message Passing Interface (MPI) programming and tuning, MPI library installation and management, parallel administration tools, cluster monitoring, cluster optimization, and more HPC topics.

More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
IRIX mpirun error!! lakeat High Level Programming 1 05-14-2008 05:16 AM
Problem in mozilla running. balarajum HP-UX 3 03-21-2007 09:32 AM
Problem in running an AWK script rachana8p UNIX for Advanced & Expert Users 4 01-03-2007 03:05 AM
how and where can i download sun solaris hpc mpirun software jingwp SUN Solaris 1 07-12-2005 09:27 AM
Executing mpirun commands tesswulf Shell Programming and Scripting 0 02-11-2004 03:57 AM

Closed Thread
English Japanese Spanish French German Portuguese Italian Dutch Swedish Russian Norwegian Hungarian Hebrew Danish Powered by Powered by Google
 
LinkBack Thread Tools Search this Thread Rate Thread Display Modes
  #1 (permalink)  
Old 02-20-2009
The_Watcher The_Watcher is offline
Registered User
  
 

Join Date: Feb 2009
Posts: 2
Unhappy Problem running mpirun

Hi, I have a difficulty running mpirun on my workstation, RHEL 4.

I have uninstalled lam-7.1.4 with 'rpm -e lam-7.1.4'. No problem.
The I downloaded openmpi-1.3.tar.gz. unzipped as normal. No problem.
Then did 'make clean'. Then './configure'. No problems.
Then I did 'make' and then 'make install'. No problems.

I put the path '/usr/local/bin/' and '/usr/local/lib/'into my .bashrc file, and did source .bashrc --> no problem.

The I tried to 'lamboot', and this is where the problem begins. Here is the response (I have swapped my real name for 'username':

username:2DCHANNEL$ lamboot

LAM 7.1.4/MPI 2 C++/ROMIO - Indiana University


So apparently, the old lam-7.1.4 is still hanging around? Still I have tried running a parallel job 'test' with: mpirun -np 4 ./test, and the result is startling:

username:2DCHANNEL$ mpirun -np 4 ./test
ssh: x1-6-00-17-a4-99-51-97: Name or service not known
--------------------------------------------------------------------------
A daemon (pid 7989) died unexpectedly with status 255 while attempting
to launch so we are aborting.

There may be more information reported by the environment (see above).

This may be because the daemon was unable to find all the needed shared
libraries on the remote node. You may set your LD_LIBRARY_PATH to have the
location of the shared libraries on the remote nodes and this will
automatically be forwarded to the remote nodes.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpirun noticed that the job aborted, but has no info as to the process
that caused that situation.
--------------------------------------------------------------------------
mpirun: clean termination accomplished

and when I check to see th date of lamboot with 'll /usr/local/bin/lam*' I get:

username:2DCHANNEL$ ll /usr/local/bin/lam*
-rwxr-xr-x 1 root root 194709 Jul 3 2008 /usr/local/bin/lamboot
-rwxr-xr-x 1 root root 162203 Jul 3 2008 /usr/local/bin/lamcheckpoint
-rwxr-xr-x 1 root root 95131 Jul 3 2008 /usr/local/bin/lamclean
-rwxr-xr-x 1 root root 284740 Jul 3 2008 /usr/local/bin/lamd
-rwxr-xr-x 1 root root 122140 Jul 3 2008 /usr/local/bin/lamexec
-rwxr-xr-x 1 root root 211554 Jul 3 2008 /usr/local/bin/lamgrow
-rwxr-xr-x 1 root root 88140 Jul 3 2008 /usr/local/bin/lamhalt
-rwxr-xr-x 1 root root 691644 Jul 3 2008 /usr/local/bin/laminfo
-rwxr-xr-x 1 root root 89662 Jul 3 2008 /usr/local/bin/lamnodes
-rwxr-xr-x 1 root root 162296 Jul 3 2008 /usr/local/bin/lamrestart
-rwxr-xr-x 1 root root 93683 Jul 3 2008 /usr/local/bin/lamshrink
-rwxr-xr-x 1 root root 93373 Jul 3 2008 /usr/local/bin/lamtrace
-rwxr-xr-x 1 root root 193821 Jul 3 2008 /usr/local/bin/lamwipe

Note that these are dated Jul 3 2008 when I first installed lammpi.

What also worries me is the line which says, "ssh: x1-6-00-17-a4-99-51-97: Name or service not known"

Now, ever since I first got my workstation, as I login with my username I get a message like '
x1-6-00-17-a4-99-51-97 not recognised. It may degrade GNOME perfromance. Try putting it in /etc/hosts'. However, it gives an option to ignore this message and continue, which I always do and I have never suffered anything at all, till now apparently after 2-3 years.

What is going on? How do I remove lam-7.1.4 competely from my system, and install the new mpirun suitable for openmpi? (I have installed opempi in order to use mpif90 -- this is ok now and my f90 programs compile ok with mpif90, it is just the running bit that is not working properly.)

How do I rectify the login warning message?

Please advise -- I am not a systems manager, but a newbie.

Cheers.
  #2 (permalink)  
Old 03-05-2009
otheus's Avatar
otheus otheus is offline Forum Staff  
Moderator ala Mode
  
 

Join Date: Feb 2007
Location: Innsbruck, Austria
Posts: 1,884
You don't run mpirun with lam and vice versa. They can, however, peacefully co-exist. Once you set up openmpi, you must set up a cluster hosts file or something. Then for each process, openmpi will attempt to ssh to a host and run your program.
Closed Thread

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On




All times are GMT -4. The time now is 04:41 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited. Language Translations Powered by .
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios
The UNIX and Linux Forums Content Copyright ©1993-2009. All Rights Reserved.Ad Management by RedTyger

Content Relevant URLs by vBSEO 3.2.0