[2025] Current issues on HPC Cluster

Description

Cluster: Baobab

The team managing the storage for the servers did a maintenance this morning and all our admin servers crashed. We are investigating as normally it is fully redundant

In the meantime, the running jobs are probably still running but slurm is stopped.

edit: the service is restored. We’ll now investigate with the storage team why this happened


HPC Team

Status : Resolved :green_circle:

start: 2025-09-04T09:07:00Z
end:2025-09-04T09:50:00Z