Primary informations
Username: mongin
Cluster: baobab
Description
Since the upgrade, I cannot load PyTorch-bundle/2.1.2-CUDA-12.1.1 , which was working just beofre
Steps to Reproduce
Before the last updrade, I was doing:
ml GCC/12.3.0 OpenMPI/4.1.5 PyTorch-bundle/2.1.2-CUDA-12.1.1
As decsribed by @Adrien.Albert New software installed: PyTorch-bundle version 2.1.2-CUDA-12.1.1.
But since last upgrade, OpenMPI/4.1.5
is not listed anymore:
module spider openMPI
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
OpenMPI:
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Description:
The Open MPI Project is an open source MPI-2 implementation.
Versions:
OpenMPI/1.10.3
OpenMPI/2.1.1
OpenMPI/2.1.2
OpenMPI/3.1.1
OpenMPI/3.1.3
OpenMPI/3.1.4
OpenMPI/4.0.3
OpenMPI/4.0.5
OpenMPI/4.1.1
OpenMPI/4.1.4
OpenMPI/4.1.6
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
For detailed information about a specific "OpenMPI" package (including how to load the modules) use the module's full name.
Note that names that have a trailing (E) are extensions provided by other modules.
For example:
$ module spider OpenMPI/4.1.6
-----------------------------------------------------------------------------------
If I do
ml GCC/13.2.0 OpenMPI/4.1.6 PyTorch-bundle/2.1.2-CUDA-12.1.1
I instead have
Lmod has detected the following error: The following module(s) are unknown: "OpenMPI/4.1.5" "PyTorch-bundle/2.1.2-CUDA-12.1.1"
Please check the spelling or version number. Also try "module spider ..."
It is also possible your cache file is out-of-date; it may help to try:
$ module --ignore_cache load "OpenMPI/4.1.5" "PyTorch-bundle/2.1.2-CUDA-12.1.1"
Also make sure that all modulefiles written in TCL start with the string #%Module
Would it be possible to fix so that the bundle can be used with OpenMPI/4.1.6 ?