Primary informations
Username: nussbaup
Cluster: Yggdrasl
Description
Today (Wednesday) from around 16:30 on, I was unable to access the CVMFS filesystem from nodes gpu002 and gpu003 (shared-gpu partiiton). Just one hour earlier this worked fine.
In my sbatch script I call at some point:
source /cvmfs/sft.cern.ch/lcg/views/LCG_104cuda/x86_64-centos8-gcc11-opt/setup.sh
which yields
zsh:source:1: no such file or directory: /cvmfs/sft.cern.ch/lcg/views/LCG_104cuda/x86_64-centos8-gcc11-opt/setup.sh
I then tried to simply ls the path to the setup.sh to see what would happen. This did not work on gpu002 and gpu003 (see the minimum example to reproduce below).
I tried to “ls” the setup.sh from login001, cpu001 (via debug-cpu) and cpu008 (via shared-cpu). From all three of these it worked perfectly.
Steps to Reproduce (last tested at 18:00)
salloc --partition shared-gpu --gpus 1 --time 00:02:00 --nodelist gpu002
ls /cvmfs/sft.cern.ch/lcg/views/LCG_104cuda/x86_64-centos8-gcc11-opt/setup.sh
Expected Result
I would expect the file path to be displayed:
/cvmfs/sft.cern.ch/lcg/views/LCG_104cuda/x86_64-centos8-gcc11-opt/setup.sh
Actual Result
Nothing (or cancelled due to timelimit)