Baobab: "No devices were found" on one of the GPU002

Hi HPC,

I allocated some resources on gpu002 and got a gpu. However when running code or nvidia-smi there is no device found (see picture).

Best,
Malte

Hello,

Yes I think it is GPU#3 (index starts at 0) on baobab:gpu002 which is down.
I created a report a while ago:

Indeed, @Raphael.Rubino I’ve answered your previous post right now. And @Malte.Algren we’ll check the node, thanks.

Best

This is fixed, the node as 6 GPUs again.
Best

1 Like