Hi all;
I am getting the below error when ssh into the GPU node that my job is assigned (gpu002) on Yggdrasil. Does anyone have any idea?
[vafaeisa@gpu002 ~]$ nvidia-smi
Unable to determine the device handle for GPU 0000:1A:00.0: Unknown Error
Hi all;
I am getting the below error when ssh into the GPU node that my job is assigned (gpu002) on Yggdrasil. Does anyone have any idea?
[vafaeisa@gpu002 ~]$ nvidia-smi
Unable to determine the device handle for GPU 0000:1A:00.0: Unknown Error
Hi,
thanks for letting us know. I tried to reboot the node but it isn’t responding anymore. I’ll check and let you know how it is going.
Best
Yann
Hi, it is fixed. Best
Yann