Job Types on Olivia

Olivia is designed for large-scale parallel jobs and GPU-accelerated workloads. With its high-performance compute nodes featuring 256 or 288 CPUs and substantial memory per node, Olivia is suited for computationally intensive applications that can scale across many cores.

The basic allocation units on Olivia are cpus, memory and GPUs. The details about how the billing units are calculated can be found in Projects and accounting. Note that the number of GPUs is counted separately, not as part of the billing units.

Normal

  • Allocation units: cpus and memory

  • Job Limits:

    • maximum 1152 units

  • Maximum walltime: 7 days

  • Priority: normal

  • Available resources:

    • 252 nodes with 256 AMD cpus and 753 GiB RAM

  • Parameter for sbatch/salloc:

    • None, normal is the default

  • Job Scripts: Normal

This is the default job type. Most CPU-only jobs run as normal jobs. The large node size (256 CPUs) makes this partition good for:

  • Large-scale parallel computations

  • Memory-intensive applications requiring substantial RAM

  • Jobs that can efficiently utilize many CPU cores

  • High-throughput computing workflows

  • Scientific simulations requiring significant computational resources

Accel

  • Allocation units: cpus, memory and GPUs

  • Job Limits:

    • maximum 1152 billing units

    • maximum 32 GPUs

  • Maximum walltime: 7 days

  • Priority: normal

  • Available resources: 76 nodes (max 60 per project) with 288 ARM64 cpus, 858 GiB RAM and 4 GH200 GPUs.

  • Parameter for sbatch/salloc:

    • --partition=accel

    • --gpus=N, --gpus-per-node=N or similar, with N being the number of GPUs

  • Job Scripts: Accel

Accel jobs give access to use the Grace Hopper nodes that combine ARM64 CPUs with NVIDIA GH200 GPUs. This is useful for AI/ML training, inference, and other GPU-accelerated applications.

Can be combined with --qos=devel to get higher priority but maximum wall time (2h) and resource limits of devel apply.

Devel

  • Allocation units: cpus and memory and GPUs

  • Job Limits:

    • maximum 576 billing units per job

    • maximum 32 GPUs per job

    • maximum 1152 billing units in use at the same time

    • maximum 64 GPUs in use at the same time

    • maximum 2 running jobs per user

  • Maximum walltime: 2 hours

  • Priority: high

  • Available resources: devel jobs can run on any node on Olivia

  • Parameter for sbatch/salloc:

    • --qos=devel

  • Job Scripts: Devel

This is meant for small, short development or test jobs. Devel jobs get higher priority for them to run as soon as possible. On the other hand, there are limits on the size and number of devel jobs.

Can be combined with --partition=accel to increase priority while having max wall time and job limits of devel job.

If you have temporary development needs that cannot be fulfilled by the devel or short job types, please contact us at support@nris.no.