Unix/Linux Go Back    


High Performance Computing Message Passing Interface (MPI) programming and tuning, MPI library installation and management, parallel administration tools, cluster monitoring, cluster optimization, and more HPC topics.

Python code runs on login node but not on cluster

High Performance Computing


Tags
hpc, parallel, python, slurm, su2

Reply    
 
Thread Tools Search this Thread Display Modes
    #1  
Old Unix and Linux 11-08-2016
devinmgibson devinmgibson is offline
Registered User
 
Join Date: Nov 2016
Last Activity: 18 November 2016, 1:51 PM EST
Posts: 1
Thanks: 0
Thanked 0 Times in 0 Posts
Python code runs on login node but not on cluster

I work for one of my professors and we are trying to run SU2 in parallel on a cluster owned by the university that uses slurm for its workload manager. The problem we are running into is that when we ssh into the cluster and run the command:


Code:
parallel_computation.py -f SU2.cfg

on an assigned node by slurm (using sbatch), the code hangs and wont run. The weird thing about this is if we run the same command on the login node, it works just fine. Do any of you know what could possibly be the problem?

Here is some additional information:
- We talked with the IT guy in charge of the cluster and he doesn't have enough background to know what is going on.
- On some of our output files we would get the escape key [!0134h, when we changed the terminal settings to get rid of the escape key the code behavior was consistent as above.
- We can run SU2_CFD "config file", the code in serial, on both the login node and the cluster just fine
- We have tried running an interactive session on a node (using srun), no change in behavior

Any thoughts would be appreciated! We really want to be able to run the code in-house instead of outsource.


Moderator's Comments:
Python code runs on login node but not on cluster Please use CODE tags as required by forum rules!

Last edited by RudiC; 11-09-2016 at 03:07 AM.. Reason: Added CODE tags.
Sponsored Links
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Linux More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
Cluster node not starting depam AIX 2 06-09-2013 12:01 PM
SVM metaset on 2 node Solaris cluster storage replicated to non-clustered Solaris node dn2011 Solaris 0 04-14-2011 08:34 AM
Active Sun cluster node? sreeniatbp Solaris 3 07-23-2009 07:52 AM
Node can't join cluster Tris HP-UX 1 03-02-2007 04:05 PM
The other node name of a SUN cluster heartwork Shell Programming and Scripting 11 10-09-2006 02:55 AM



All times are GMT -4. The time now is 02:44 AM.