RSH connection problem while running MPI program


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting RSH connection problem while running MPI program
# 1  
Old 08-19-2011
RSH connection problem while running MPI program

I'm trying to run MPI programs on 8 machines, but I get the error

Code:
connect to address 127.0.0.1 port 544: Connection refused
Trying krb4 rsh...
connect to address 127.0.0.1 port 544: Connection refused
trying normal rsh (/usr/bin/rsh)
lagrid02: Connection refused

When I run it with a machinefile option, I get the error
Code:
lagrid03: No route to host

where lagrid03 is the neighbouring node connected to master node.

How should I rectify this ?
# 2  
Old 08-19-2011
What OS are you trying to run these MPI programs on? paste output from

Code:
uname -spr

# 3  
Old 08-19-2011
Using Centos 5
uname -spr gives
Linux 2.6.18-238.12.1.el5xen x86_64
# 4  
Old 08-19-2011
Try this URL and see if it helps you

Code:
http://www.open-mpi.org/faq/?category=rsh


Plus, can you try this on the shell:

Code:
telnet localhost 544

also post output from

Code:
netstat -apn|grep -w 544|grep LISTEN

Post output from

Code:
iptables -L


Last edited by dude2cool; 08-19-2011 at 04:32 PM..
# 5  
Old 08-19-2011
Code:
telnet localhost 544
Trying 127.0.0.1...
telnet: connect to address 127.0.0.1: Connection refused
telnet: Unable to connect to remote host: Connection refused

Chain INPUT (policy ACCEPT)
target     prot opt source               destination         
ACCEPT     udp  --  anywhere             anywhere            udp dpt:domain 
ACCEPT     tcp  --  anywhere             anywhere            tcp dpt:domain 
ACCEPT     udp  --  anywhere             anywhere            udp dpt:bootps 
ACCEPT     tcp  --  anywhere             anywhere            tcp dpt:bootps 
RH-Firewall-1-INPUT  all  --  anywhere             anywhere            

Chain FORWARD (policy ACCEPT)
target     prot opt source               destination         
ACCEPT     all  --  anywhere             192.168.122.0/24    state RELATED,ESTABLISHED 
ACCEPT     all  --  192.168.122.0/24     anywhere            
ACCEPT     all  --  anywhere             anywhere            
REJECT     all  --  anywhere             anywhere            reject-with icmp-port-unreachable 
REJECT     all  --  anywhere             anywhere            reject-with icmp-port-unreachable 
RH-Firewall-1-INPUT  all  --  anywhere             anywhere            

Chain OUTPUT (policy ACCEPT)
target     prot opt source               destination         

Chain RH-Firewall-1-INPUT (2 references)
target     prot opt source               destination         
ACCEPT     all  --  anywhere             anywhere            
ACCEPT     icmp --  anywhere             anywhere            icmp any 
ACCEPT     esp  --  anywhere             anywhere            
ACCEPT     ah   --  anywhere             anywhere            
ACCEPT     udp  --  anywhere             224.0.0.251         udp dpt:mdns 
ACCEPT     udp  --  anywhere             anywhere            udp dpt:ipp 
ACCEPT     tcp  --  anywhere             anywhere            tcp dpt:ipp 
ACCEPT     all  --  anywhere             anywhere            state RELATED,ESTABLISHED 
ACCEPT     tcp  --  anywhere             anywhere            state NEW tcp dpt:ssh 
REJECT     all  --  anywhere             anywhere            reject-with icmp-host-prohibited

# 6  
Old 08-19-2011
Do you need iptables? if not just disabling it will be the simplest solution.

chkconfig disable iptables

And

service iptables stop

Try telnet to port 544 after disabling iptables.
# 7  
Old 08-19-2011
I used iptables because I had to connect the machine in the internal network to the internet through a gateway.
Is there any round about method which might not require me to disable iptables ?
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. AIX

Strange problem running python program from within ant

Hello, I'm currently trying to port an ant based build to AIX 6.1. The build queries the underlying version control system (Mercurial) for some data, which works fine on other platforms (Linux, Solaris). However, on AIX the hg command fails to load python's md5 module when used in the build,... (1 Reply)
Discussion started by: dhs
1 Replies

2. UNIX for Advanced & Expert Users

How keep running a program n an another computer via a connection ssh when the connection is closed?

Hi everybody, I am running a program on a supercomputer via my personal computer through a ssh connection. My program take more than a day to run, so when I left work with my PC I stop the connection with the supercomputer and the program stop. I am wondering if someone know how I can manage... (2 Replies)
Discussion started by: TomTomGre
2 Replies

3. Programming

Compare times to run a program - Serial vs MPI

Hi, I have a fortran program with serial and MPI version. I want to compare the time taken by these programs to run. I use ifort/gfortran compiler. How to compare the time taken by each program to run? Is there any sample code for comparison? Thanks, rpd (1 Reply)
Discussion started by: rpd25
1 Replies

4. Shell Programming and Scripting

Running C# program problem

Hi All, How to run c# program using shell script ? (1 Reply)
Discussion started by: srikar.ch
1 Replies

5. Red Hat

Message parsing interface (MPI) program

Hi, I need to install the following program on my Linux box (Red Hat) Software for Pyrosequencing Noise Removal The software only runs on Linux computers with MPI -I do not have MPI on my computer. I was hoping someone could please point me in the right direction so I can download and... (2 Replies)
Discussion started by: Xterra
2 Replies

6. Shell Programming and Scripting

Duh, can I quit an ssh connection withOUT stopping a program running?

here's the sitch: remote freeBSD device. I am ssh'd in as root and running cgsecurity's TESTDISK program. Testdisk will take DAYS to scan this multi-terabyte RAID. Can I quit my terminal or ssh connection, disconnect from the remote computer, come back to it later and NOT stop or interrupt the... (3 Replies)
Discussion started by: jccbin
3 Replies

7. UNIX for Dummies Questions & Answers

vxWorks connection to solaris 9.0 rsh vs ftp problem

Can anyone point me in the right direction.. I have a test system which requires vxWorks to be loaded via TCPIP I am using a Sun ultra10 box with Sol 9.0 installed as the server I have configured the server and am able to load the boot image without any problems. I assume it is using the... (0 Replies)
Discussion started by: shortsrkt
0 Replies

8. Shell Programming and Scripting

rsh connection problem

I am trying to connect to a remote server using rsh. first i have given the following command. $ rsh 242.13.45.54 -l now i got the following message "connect to address 242.13.45.54: Connection refused Trying krb4 rlogin... connect to address 242.13.45.54: Connection refused trying... (1 Reply)
Discussion started by: ravi raj kumar
1 Replies

9. UNIX for Dummies Questions & Answers

Connection problem with gui java program to postgreaql database using unix

Having problem in connecting my gui java program to postgreaql database. I first used setenv classpath /home/share/postgresql/java/postgresql.jar:proj1, where proj1 is my folder conatining all java and class file, to set classpath. Then javac *.java. Then java proj1.Login. It gives me... (2 Replies)
Discussion started by: uci
2 Replies

10. Shell Programming and Scripting

rsh problem connection refused

I'm trying to use rsh command to read a variable on a remote machine. I can rlogin with no problem. If I rsh HOST I also get connection like with rlogin. There is no need for passwords. But when I rsh HOST COMMAND it waits 30 seconds then gives me a connection refused error message. Any ideas? ... (4 Replies)
Discussion started by: Mace
4 Replies
Login or Register to Ask a Question