Baobab scheduled maintenance: 26-27 February 2020

Dear users,

as just announced on the baobab-announce@ mailing list, we will do a software and hardware maintenance of the Baobab HPC cluster on Wednesday 26th and Thursday 27th February 2020.

The maintenance will start at 08:00 +0100 and you will receive an email when the maintenance will be over.

The cluster will be totally unavailable during this period, with no access at all (not even to retrieve files).

If you submit a job in the meantime, be sure that the expected wall time (duration) does not overlap with the start of the maintenance or your job will be scheduled after the maintenance.

What should be done during this maintenance:

  1. upgrade of the ${HOME} and ${SCRATCH} clients and backends (BeeGFS 7.1.4 bugfix release)
  2. upgrade of the cluster job scheduler (Slurm 19.05.5 bugfix release)
  3. add soft quota to ${HOME}
  4. tests for emergency shutdown procedure

Thanks for your understanding.

Best regards,
the HPC team

Hi there,

as just announced on the baobab-announce@ mailing list, I forgot to send the maintenance-over email, sorry for the inconvenience.

The maintenance ended on Friday 28th at 09:15, even if the login node (not the computational ones) had a module problem until 10:30 (cf. Modules loading error since HPC update ).

Here a list of what have been done:

  1. upgrade to the latest CentOS 7 (7.7.1908 plus latest updates)
  2. upgrade of the ${HOME} and ${SCRATCH} clients and backends (BeeGFS 7.1.4 bugfix release)
  3. upgrade of the cluster job scheduler (Slurm 19.05.5 bugfix release)
  4. add soft quota to ${HOME}

Best regards,
the HPC team