[HPC][Boabab] Monday 2024-06-24:12:30:00 Action on login node

Dear HPC Users,

You may have noticed issues with mounting CIFS/SMB shares, blocking some research project.

More info:

As work-arround, we will apply a custom patch to the login1.baobab server. This will require a server restart, scheduled for June 24, 2024, at 12:30 PM. During this period, the server will be unavailable, and all active connections will be interrupted for approximately 25 minutes.

Please note that implementing this fix may affect the Infiniband status on the login node. Certain features may not work as usual. If these issues are significant, we may need to roll back the patch, requiring additional intervention.

This patch will not be deployed on compute nodes to guarantee optimal production.

We will keep you informed of our progress and notify you once the maintenance is complete.

Thank you for your understanding.

Best regards,


HPC Team

Dear HPC Users,

Due to unexpected behavior, intervention on login1.baobab must be extended.

The login node is not accessible, but you can access Baobab and continue your work via OpenOnDemand (there’s also a terminal).

Thank you for your understanding.


HPC team

Dear HPC Users,

While we successfully tested the update in our development environment, we encountered unexpected behavior in the production environment that prevented us from replicating the expected results. As a result, we have reverted the changes, and there are currently no modifications affecting login node.

Summary:

  • Issue: We have observed differences in behavior between our development and production environments.
  • Current status: No changes have been made, the problem is still ongoing.

We appreciate your understanding and patience as we work towards a resolution.

Best Regards,


HPC team