[slides] SLURM%3A Simple Linux Utility for Resource Management

This paper introduces SLURM, a new cluster resource management system designed for large Linux clusters. Initially developed at Lawrence Livermore National Laboratory (LLNL), SLURM is a simple, flexible, and fault-tolerant manager that can scale to thousands of processors. It is designed to be portable and can be easily adapted to different clusters of varying sizes and architectures. SLURM aims to provide a robust and highly scalable parallel job execution environment, benefiting both users and system architects. The system is open-source, written in C, and supports a variety of interconnects, making it versatile and adaptable to different computing environments. Key design goals include simplicity, open-source availability, portability, interconnect independence, and scalability.This paper introduces SLURM, a new cluster resource management system designed for large Linux clusters. Initially developed at Lawrence Livermore National Laboratory (LLNL), SLURM is a simple, flexible, and fault-tolerant manager that can scale to thousands of processors. It is designed to be portable and can be easily adapted to different clusters of varying sizes and architectures. SLURM aims to provide a robust and highly scalable parallel job execution environment, benefiting both users and system architects. The system is open-source, written in C, and supports a variety of interconnects, making it versatile and adaptable to different computing environments. Key design goals include simplicity, open-source availability, portability, interconnect independence, and scalability.

SLURM: Simple Linux Utility for Resource Management

2003 | Andy B. Yoo, Morris A. Jette, and Mark Grondona