Frequently asked questions - Jobs
My job failed — where do I start?
Check the output file (slurm-<jobid>.out) for error messages, then use sacct to see how the job ended:
For a full walkthrough of common failure modes and how to fix them, see When your job fails.
What is the maximum job time on LUNARC resources?
Batch jobs can request up to 168 hours (7 days).
How do I run an interactive application?
LUNARC supports interactive HPC. Connect via the LUNARC HPC Desktop and use the Desktop on Demand to launch your application.
How do I specify a project or reservation?
Users with membership in multiple projects must specify the project name, and in some cases a reservation. This is explained in the batch system guide.
I want to use the COSMOS GPU nodes
See the batch system guide for the required changes to your submission script. The software guide covers compiler, CUDA, and MPI support for the GPU nodes.
Can I prevent my job from restarting after a node failure?
By default, SLURM restarts jobs if a node fails. To disable this, add the following to the header of your job script:
Authors: Joachim Hein (LUNARC), Rebecca Pitts (LUNARC)
Last Updated: 2024-08-06