Submitting Jobs

How to run tasks on the cluster queues

Job Limits

All job submissions are subject to fair-share rules. If your job cannot be run without violating the rules enforced on the partition then you will receive an error message when submission is attempted.

After submission, if you see '(QOSxxxxx)' in the Node List/Reason column of squeue output then one of the fair-share rules is preventing the job from running now, but once your other running jobs complete they may be able to start (subject to fair-share between users). The text after 'QOS' indicates which rule is preventing the job from running.

Auto-submitting software

Some FSL commands and/or GUIs automatically queue themselves where appropriate, i.e., you do not need to use 'fsl_sub' to submit these programs.

Please note that this list may not be exhaustive, so you may come across more commands which have been adapted to queue themselves. If you do submit one of these tools to the queues then they will still run, but may not be able to make full use of the cluster resources (e.g. not be able to run multiple tasks in parallel).

Other commands run from the terminal command line will need to use the `fsl_sub` command, described below, to submit them to the queue.

Before submitting any tasks make sure you have loaded any shell modules you require - fsl_sub (and the required configuration information fsl_sub_config) are automatically loaded for you - for example to use FSL:

module add fsl

These lines can be added to your .bash_profile to ensure they take effect for every login session you have.

Submitting jobs with fsl_sub

Typing fsl_sub before the rest of your command will send the job to the cluster. By default, this will submit your task to the short partition. fsl_sub can automatically choose a queue for you if you provide information about your job's requirements - we would strongly recommend that you provide at least an estimated maximum run time (--jobtime) to allow SLURM to efficiently schedule job (See Why do I need to specify RAM and Time?). See How much RAM/Time does my job need to assist with setting these.

There are several ways to select a queue:

Use the -R (--jobram) and -T (--jobtime) options to fsl_sub to specify the maximum memory and run-time requirements for your job (in GB and minutes of wall time*) respectively. fsl_sub will then select the most appropriate queue for you.
GPU tasks can be requested using the --coprocessor options (see the Running GPU Tasks section).
Specify a specific partition with the -q (--queue) option. For further information on the available queues and which to use when see the queues section.

Notes:

The command you want to run on the queues must be in your path - this does NOT include the current folder. If it isn't then you must specify the path to the command; commands/scripts in the current folder must be prefixed with './', e.g. ''./script''.
The FMRIB SLURM cluster does not have a 'verylong' or 'bigmem' equivalent queue. See Long Running Tasks below.
Jobs submitted to the FMRIB SLURM cluster do NOT inherit the 'environment' of your login shell, e.g. environment variables such as FSLDIR are not copied over to your job. Load software configuration (such as FSL) from shell modules or use the '--export' option to fsl_sub to copy the variables to your job (see Passing Envrionment Variables to Queued Jobs).
Wall Time: Unlike the FMRIB Jalapeno cluster (which uses CPU time) the SLURM cluster measures job run-time in real time, often called wall time (as in the time on a clock on the wall).
To assess the time necessary for your job to complete you can look at the run-times of similar previous jobs using the 'sacct' command (see Monitoring Tasks).

Example Usage

To queue a job which requires 10GiB of memory and runs for 2 hours use:

fsl_sub -T 120 -R 10 ./myjob

This will result in a job being put on the short partition.

If your software task automatically queues then you can also specify the memory you expect the task to require with the environment variable FSLSUB_MEMORY_REQUIRED, for example:

FSLSUB_MEMORY_REQUIRED=32G feat mydesign.feat

would submit a FEAT task informing fsl_sub that you expect to require 32GiB of memory. If units aren't specified then the integer is assumed to be in the units specified in the configuration file (default GiB).

The different partitions have different run-times and memory limits, when a task reaches these limits it will be terminated; also shorter queues take precedence over the longer ones. It is advantageous to provide the scheduler with as much information about your job's memory and time requirements.

The command you submit cannot run any graphical interface, as they will have no where to display the output.

If you want to run a non-interactive MATLAB task on the queues then see MATLAB jobs.

fsl_sub Options

To see a full list of the available options use:

fsl_sub --help

In addition to the list of options this will also display a list of partitions available for use with descriptions of allowed run times and memory availability. For details on how to use these options see the Advanced Usage section.

Long running tasks

Unlike the Jalapeno cluster, the SLURM cluster does not offer 'infinite' partitions (equivalent to verylong.q, bigmem.q and the cuda.q on the jalapeno cluster). You must break your task up into shorter components or regularly save state to allow restart and submit these parts (or resubmit the job continuing where it left off) using job holds to prevent tasks running before the previous one completes.

Why do I need to specify RAM and Time?

When the SLURM scheduler works out where to run your job if often has to reserve space whilst existing tasks are completing. Ordinarily these RAM and CPU reservations cannot be used for other jobs, but if there are other pending jobs that will fit within this reservation and complete before the existing jobs do then these smaller tasks will be run. This can only work if the scheduler knows how much time and how much RAM all tasks will need.
Without -R/-T, jobs default to requesting the maximum configured for the partition chosen, e.g. -T 5760, -R 15 for the short partition. If all jobs in that partition have these identical values then nothing will be able to fill the reservation and that CPU/RAM will be left idle.

How much RAM/Time does my job need

If the software developer has provided minimum memory requirements for the software you are using then this is a good starting point for the amount of RAM. Time is trickier and would normally require you to run the software once.
Where RAM specifications aren't readily available, you should carry out a test run, possibly just on a named partition, or with RAM/time set to conservative values, e.g. 32GB, 24h. If the test job fails then check why and increase the appropriate resource.
While testing the resource requirements or after succesful completion of the job you can now query the cluster for usage statistics.

sacct -j <jobid> --format=JobID,Elapsed,MaxRSS,State

This will report at least three job 'segments' - some specialist jobs might report more sections:

JobID           Elapsed     MaxRSS      State 
------------ ---------- ---------- ---------- 
233            14:38:06             COMPLETED 
233.batch      14:38:06 160119060K  COMPLETED 
233.extern     14:38:06       112K  COMPLETED

This example show that the job ran for 14 hours, 38 minutes and the '.batch' component (the actual task) used up to 152GB.

With this information in hand, you should round up the hours, possibly adding an additional hour or two if the task takes longer than a day or is very close to the next hour boundary. RAM should be rounded up to the nearest 5GB boundary. So for this example, a minimum of -T 34200 and -R 155. If run time is very long, you may wish to extend these a little higher to avoid lost computation time if your particular run was just over the limit.

Cookies on this website