Getting started with TensorFlow on Baobab (ImportError: libcuda.so.1)

Hi there,

Thank you for the patch, merged (cf. Merge remote-tracking branch 'gitlab.unige.ch_Pablo.Strasser/master' (d10d722d) · Commits · hpc / softs · GitLab ).

FWIW, to ease such workflow, for the next time please fork the GitLab project hpc / softs · GitLab project and send a proper merge request.

You are right about the CUDA system libraries, which are available on GPU nodes only. Here the reason: while moving to CentOS 7 (cf. Baobab migration from CentOS6 to CentOS7 ) we decided to install as less extra software as possible, or, in other words, to have a basic installation shared between all the nodes and the servers as well. CUDA is obviously not part of a basic installation…

From a quick look, it seems that the CUDA application libraries loaded via module do not include libcuda.so* , but only a stub, which should refer to the corresponding system library (my guess is that the latter communicates with the NVIDIA kernel driver).

Moreover, according to the upstream documentation (cf. Build from source  |  TensorFlow ) while compiling, TensorFlow creates symbolic links to the CUDA system libraries, which de facto renders the compiled TensorFlow not portable.

Thx, bye,
Luca