Unix/Linux Go Back    

High Performance Computing Message Passing Interface (MPI) programming and tuning, MPI library installation and management, parallel administration tools, cluster monitoring, cluster optimization, and more HPC topics.

Python code runs on login node but not on cluster

High Performance Computing

hpc, parallel, python, slurm, su2

Thread Tools Search this Thread Display Modes
Old Unix and Linux 11-08-2016   -   Original Discussion by devinmgibson
devinmgibson's Unix or Linux Image
devinmgibson devinmgibson is offline
Registered User
Join Date: Nov 2016
Last Activity: 18 November 2016, 1:51 PM EST
Posts: 1
Thanks: 0
Thanked 0 Times in 0 Posts
Python code runs on login node but not on cluster

I work for one of my professors and we are trying to run SU2 in parallel on a cluster owned by the university that uses slurm for its workload manager. The problem we are running into is that when we ssh into the cluster and run the command:

parallel_computation.py -f SU2.cfg

on an assigned node by slurm (using sbatch), the code hangs and wont run. The weird thing about this is if we run the same command on the login node, it works just fine. Do any of you know what could possibly be the problem?

Here is some additional information:
- We talked with the IT guy in charge of the cluster and he doesn't have enough background to know what is going on.
- On some of our output files we would get the escape key [!0134h, when we changed the terminal settings to get rid of the escape key the code behavior was consistent as above.
- We can run SU2_CFD "config file", the code in serial, on both the login node and the cluster just fine
- We have tried running an interactive session on a node (using srun), no change in behavior

Any thoughts would be appreciated! We really want to be able to run the code in-house instead of outsource.

Moderator's Comments:
Python code runs on login node but not on cluster Please use CODE tags as required by forum rules!

Last edited by RudiC; 11-09-2016 at 04:07 AM.. Reason: Added CODE tags.
Sponsored Links

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Linux More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
Cluster node not starting depam AIX 2 06-09-2013 01:01 PM
SVM metaset on 2 node Solaris cluster storage replicated to non-clustered Solaris node dn2011 Solaris 0 04-14-2011 09:34 AM
Active Sun cluster node? sreeniatbp Solaris 3 07-23-2009 08:52 AM
Node can't join cluster Tris HP-UX 1 03-02-2007 05:05 PM
The other node name of a SUN cluster heartwork Shell Programming and Scripting 11 10-09-2006 03:55 AM

All times are GMT -4. The time now is 01:59 PM.