Query: slurm-llnl
OS: debian
Section: 1
Format: Original Unix Latex Style Formatted with HTML and a Horizontal Scroll Bar
SLURM(1) Slurm system SLURM(1)NAMEslurm - SLURM system overview.DESCRIPTIONThe Simple Linux Utility for Resource Management (SLURM) is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters. SLURM requires no kernel modifications for its operation and is relatively self-con- tained. As a cluster resource manager, SLURM has three key functions. First, it allocates exclusive and/or non-exclusive access to resources (compute nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, exe- cuting, and monitoring work (normally a parallel job) on the set of allocated nodes. Finally, it arbitrates contention for resources by managing a queue of pending work. Optional plugins can be used for accounting, advanced reservation, gang scheduling (time sharing for parallel jobs), backfill scheduling, resource limits by user or bank account, and sophisticated multifactor job prioritization algorithms. SLURM has a centralized manager, slurmctld, to monitor resources and work. There may also be a backup manager to assume those responsibili- ties in the event of failure. Each compute server (node) has a slurmd daemon, which can be compared to a remote shell: it waits for work, executes that work, returns status, and waits for more work. An optional slurmDBD (SLURM DataBase Daemon) can be used for accounting pur- poses and to maintain resource limit information. Basic user tools include srun to initiate jobs, scancel to terminate queued or running jobs, sinfo to report system status, and squeue to report the status of jobs. There is also an administrative tool scontrol available to monitor and/or modify configuration and state infor- mation. APIs are available for all functions. SLURM configuration is maintained in the slurm.conf file. Man pages are available for all SLURM commands, daemons, APIs, plus the slurm.conf file. Extensive documenation is also available on the internet at <http://www.schedmd.com/slurmdocs/>.COPYINGCopyright (C) 2005-2007 The Regents of the University of California. Copyright (C) 2008-2009 Lawrence Livermore National Security. Pro- duced at Lawrence Livermore National Laboratory (cf, DISCLAIMER). CODE-OCEC-09-009. All rights reserved. This file is part of SLURM, a resource management program. For details, see <http://www.schedmd.com/slurmdocs/>. SLURM is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. SLURM is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.SEE ALSOsacct(1), sacctmgr(1), salloc(1), sattach(1), sbatch(1), sbcast(1), scancel(1), scontrol(1), sinfo(1), smap(1), squeue(1), sreport(1), srun(1), sshare(1), sstate(1), strigger(1), sview(1), bluegene.conf(5), slurm.conf(5), slurmdbd.conf(5), wiki.conf(5), slurmctld(8), slurmd(8), slurmdbd(8), slurmstepd(8), spank(8) slurm 2.0 March 2009 SLURM(1)
Related Man Pages |
---|
sattach(1) - debian |
slurm_clear_trigger(3) - debian |
slurm_free_trigger_msg(3) - debian |
slurm_suspend(3) - debian |
gres.conf(5) - debian |
Similar Topics in the Unix Linux Community |
---|
SLURM script for quantum espresso |
MPICH2 version/Error |
problem in time limit of a job on slurm |
Python code runs on login node but not on cluster |