Yggdrasil scheduled maintenance: 28-30 January 2025

Dear users,

We will be performing software and hardware maintenance on the Yggdrasil HPC cluster from 28 to 30 January 2025, starting at 08:00 +0100.

You’ll receive an email when the maintenance is finished. The cluster will be completely unavailable during this time, with no access whatsoever (not even to retrieve files).

When submitting a job, make sure that the expected wall time (duration) doesn’t overlap with the start of the maintenance, or it will be scheduled after. What will be done during this maintenance

  1. All compute nodes will be reinstalled with a new major OS release, Rocky 9. Compute nodes are currently on Rocky 8.

  2. Upgrade Slurm to version 24.11.0.

  3. Update all servers to the latest security and bugfix releases.

During the maintenance period we are very busy and provide limited user support.

Thank you for your understanding.
Best regards, The HPC Team

Could you please resolve the conflicting information which cluster will get updated: the title states Yggdrasil, while the text states Bamboo. (The same holds for the email.)

Indeed, thanks for the notification. The cluster that will get updated is Yggdrasil!

Dear users, the maintenance is now finished.

What has been done:

  1. reinstall all compute nodes and login node to Rocky 9.5
  2. upgrade slurm to version 24.11.1
  3. all storage servers reinstalled on Rocky 9.5
  4. reinstall the slurm server to Rocky 9.5
  5. update the admin server with the latest security and bug fixes

As always, please let us know if you find anything that does not work as expected.

Best regards,

HPC Team, Yann, Adrien, Gaël