Will TensorFlow 2.2.0 support CUDA 10.2? #38194

Farxial · 2020-04-03T12:03:53Z

Hi :) I am going to use neural networking and TensorFlow.
I'm trying to install different versions of tensorflow and tensorflow-gpu using pip (for example, 2.1.0 both tensorflow and tensorflow-gpu, 2.2.0-rc0 both tensorflow and tensorflow-gpu) and in Python (3.7) I get error about loading cudart64_101.dll, like this:
>>> import tensorflow as tf
2020-03-31 03:30:42.120394: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'cudart64_101.dll'; dlerror: cudart64_101.dll not found 2020-03-31 03:30:42.134395: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
I copied cuDNN files, also I set CUDA_HOME env. to value of CUDA_PATH env. My hardware meets the requirements.
As far as I understand, TensorFlow 2.1.0 should work fine with CUDA 10.1. But I don't want to use CUDA 10.1 unless emergency, I just install 10.2 and don't want to reinstall it to reinstall back to 10.2 again in future.
I ready to wait for 2.2.0 release, if that makes sense in my case. So my question is: Will TensorFlow 2.2.0 support CUDA 10.2?

The text was updated successfully, but these errors were encountered:

gadagashwini-zz · 2020-04-06T11:53:12Z

@Farxial, To use CUDA 10.2 with Tensorflow 2.2. Please build the Tensorflow from source.
Follow the instructions mentioned here. Thanks

mihaimaruseac · 2020-04-06T17:12:45Z

CUDA 10.2 should be compatible with CUDA 10.1. We are building the official pips with CUDA 10.1 as we already changed infrastructure a lot to enable Python3.8 pips. Next release will have infrastructure changed for newer CUDA versions.

Until then, you can try compiling from source, or symlinking the libraries.

Farxial · 2020-04-08T03:56:27Z

Symlinking works.
Nice :)
Thanks for answers :)

gadagashwini-zz · 2020-04-08T07:46:03Z

@Farxial, Closing since the issue is resolved. Thanks!

google-ml-butler · 2020-04-08T07:46:05Z

Are you satisfied with the resolution of your issue?
Yes
No

petervandenabeele · 2020-05-17T13:50:36Z

UPDATE: WARNING in #34759 (comment)

The symlink works for me too, details below (installed on Ubuntu 20.04):

actual 10.2 libcudart code is in /usr/local/cuda-10.2/
the tensorflow 2.2 code looks in a number of places (and fails to find it in all of them)

strace -o test1.log /usr/bin/python .../quick_tour.py
...
openat(AT_FDCWD, "/home/peter_v/.local/lib/python3.8/site-packages/tensorflow/python/../libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/home/peter_v/.local/lib/python3.8/site-packages/tensorflow/python/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/home/peter_v/.local/lib/python3.8/site-packages/tensorflow/python/../libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 20
fstat(20, {st_mode=S_IFREG|0644, st_size=83403, ...}) = 0
mmap(NULL, 83403, PROT_READ, MAP_PRIVATE, 20, 0) = 0x7fb8ad602000
close(20)                               = 0
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/tls/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/usr/lib/x86_64-linux-gnu/tls/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/usr/lib/x86_64-linux-gnu/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/lib/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/usr/lib/libcudart.so.10.1", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)

Somewhat at random, I decided to symlink from /usr/lib/x86_64-linux-gnu/ to the libcudart.so.10.2 file.

sudo ln -s /usr/local/cuda-10.2/targets/x86_64-linux/lib/libcudart.so.10.2 /usr/lib/x86_64-linux-gnu/libcudart.so.10.1

I am actually using mostly the CPU (my 8 core CPU seems faster than a smallish laptop GPU and also the GPU runs easily into OOM for real work-loads).

bbqf · 2020-05-25T10:04:07Z

Just to confirm, symlink idea works on Windows too. I symlinked C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin\cudart64_102.dll as cudart64_101.dll in the same folder.

palisadoes · 2020-05-26T17:52:08Z

In Ubuntu 20.04 you don't have to symlink, nor build from source. You just need to modify the installation steps in the TensorFlow documentation at https://www.tensorflow.org/install/gpu to match the new Cuda 10-2 package names.

Here are the modifications to the https://www.tensorflow.org/install/gpu instructions that worked for me:

# Download the 10-2 packages
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-repo-ubuntu1804_10.2.89-1_amd64.deb
sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/7fa2af80.pub
sudo dpkg -i cuda-repo-ubuntu1804_10.2.89-1_amd64.deb
sudo apt-get update
wget http://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64/nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
sudo apt install ./nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
sudo apt-get update

# Install the ubuntu drivers, if not done so already
sudo ubuntu-drivers autoinstall

# Install the 10-2 versions of packages
apt-get install -y --no-install-recommends \
cuda-10-2 \ 
libcudnn7=7.6.5.32-1+cuda10.2  \
libcudnn7-dev=7.6.5.32-1+cuda10.2 \
libnvinfer7=7.0.0-1+cuda10.2 \
libnvinfer-dev=7.0.0-1+cuda10.2 \
libnvinfer-plugin7=7.0.0-1+cuda10.2\
cuda-cudart-10-1

This works for a clean install.

For pre-existing configurations you may need to uninstall previous Cuda 10-1 packages beforehand.

legel · 2020-05-31T03:22:56Z

Just to confirm solution by @palisadoes works.
Just make sure you have installed everything (developer version) and then you can run:
sudo apt-get install cuda-cudart-10-1

saket424 · 2020-06-07T17:42:24Z

I had to install libnvinfer-plugin-dev to fix /usr/include/x86_64-linux-gnu/NvInferPlugin.h file not found

dpkg -l | grep libnvinfer
ii libnvinfer-dev 7.0.0-1+cuda10.2 amd64 TensorRT development libraries and headers
ii libnvinfer-plugin-dev 7.0.0-1+cuda10.2 amd64 TensorRT plugin libraries
ii libnvinfer-plugin7 7.0.0-1+cuda10.2 amd64 TensorRT plugin libraries
ii libnvinfer7 7.0.0-1+cuda10.2 amd64 TensorRT runtime libraries

thomasaarholt · 2020-07-12T15:03:08Z

Expanding on the Windows fix for people who aren't familiar (like myself) with symlinks and just want it to work.
As admin, in cmd, paste:

mklink /H "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin\cudart64_101.dll" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin\cudart64_102.dll"

Alternatively and more clearly written, navigate to the directory and do the same thing:

cd "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin"
mklink /H cudart64_101.dll cudart64_102.dll

I'm quite surprised that there doesn't exist out-of-the-box support for CUDA 10.2 yet. I mean, CUDA 11 is out.

jjl-jjl · 2020-07-21T00:22:47Z

after trying most every solution I could find for windows even with @thomasaarholt fix,
turned out tensor flow could not find any dll's .even with setting the system path
"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin."
the solution that worked for me with python 3.8 was

import os
os.add_dll_directory("C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.2/bin")

all the tensorflow dlls could be imported and everything works

got the hint to try this here , https://stackoverflow.com/questions/59330863/cant-import-dll-module-in-python

GoingMyWay · 2020-07-23T17:45:08Z

After many months, CUDA 10.2 still cannot work with TF 2.2?

mihaimaruseac · 2020-07-24T18:38:01Z

TF2.2 won't be patched to support newer CUDA versions. We can only bring new versions of CUDA with newer versions of TF (likely 2.4)

oliharvey · 2020-08-20T14:12:12Z

I am desperate for 10.2 support! - my company has bought me a graphics card and I can't get it to play with Cuda desite all the above suggestions. I have tried the nightly build of Tensorflow (which is 2.4) - but seems it still looks for 10.1.

Has anybody produced a build that supports 10.2 ?

GoingMyWay · 2020-08-21T02:09:36Z

Please try conda install -c anaconda tensorflow-gpu=1.15.0. Anaconda built TF under CUDA 10.2

oliharvey · 2020-08-23T20:42:41Z

Please try conda install -c anaconda tensorflow-gpu=1.15.0. Anaconda built TF under CUDA 10.2

thanks very much.
I did give this a shot, but no luck so far. For one thing it seems the max conda version of tensorflow is 2.1 - actually I need at least 2.2 for what I'm doing (Tensorflow.Net). Also - I am seeing the following error which I can't make much sense of:

The following specifications were found to be incompatible with your CUDA driver:

  - feature:/win-64::__cuda==10.2=0
  - feature:|@/win-64::__cuda==10.2=0

Your installed CUDA driver is: 10.2

mihaimaruseac · 2020-08-24T16:41:29Z

I think that in this case the best solution is to try building on the target machine with the 10.2 CUDA headers. NVidia claims compatibility between 10.1 and 10.2 so it should be possible to compile from source and have something working

ZhihuaLiuEd · 2020-09-19T09:09:47Z

on my windows machine with RTX2060, symlink works again for the cudnn.

cd "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin"
mklink /H cudnn64_7.dll cudnn64_8.dll

alexshvid · 2020-12-16T08:48:22Z

Compiled v2.3.1 for Cuda 10.2 in my fork:
v2.3.1-cuda10.2

jung-youjin · 2021-08-04T07:35:26Z

In Ubuntu 20.04 you don't have to symlink, nor build from source. You just need to modify the installation steps in the TensorFlow documentation at https://www.tensorflow.org/install/gpu to match the new Cuda 10-2 package names.

Here are the modifications to the https://www.tensorflow.org/install/gpu instructions that worked for me:
# Download the 10-2 packages
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-repo-ubuntu1804_10.2.89-1_amd64.deb
sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/7fa2af80.pub
sudo dpkg -i cuda-repo-ubuntu1804_10.2.89-1_amd64.deb
sudo apt-get update
wget http://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64/nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
sudo apt install ./nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
sudo apt-get update

# Install the ubuntu drivers, if not done so already
sudo ubuntu-drivers autoinstall

# Install the 10-2 versions of packages
apt-get install -y --no-install-recommends \
cuda-10-2 \ 
libcudnn7=7.6.5.32-1+cuda10.2  \
libcudnn7-dev=7.6.5.32-1+cuda10.2 \
libnvinfer7=7.0.0-1+cuda10.2 \
libnvinfer-dev=7.0.0-1+cuda10.2 \
libnvinfer-plugin7=7.0.0-1+cuda10.2\
cuda-cudart-10-1
This works for a clean install.

For pre-existing configurations you may need to uninstall previous Cuda 10-1 packages beforehand.

Is this valid only on Ubuntu 20.04? I'm curious if it works for Ubuntu 18.04 as well.

Farxial added the type:others issues not falling in bug, perfromance, support, build and install or feature label Apr 3, 2020

google-ml-butler bot assigned gadagashwini-zz Apr 3, 2020

gadagashwini-zz added type:build/install Build and install issues and removed type:others issues not falling in bug, perfromance, support, build and install or feature labels Apr 6, 2020

gadagashwini-zz added the stat:awaiting response Status - Awaiting response from author label Apr 6, 2020

gadagashwini-zz closed this as completed Apr 8, 2020

gadagashwini-zz added the TF 2.1 for tracking issues in 2.1 release label Apr 8, 2020

gadagashwini-zz mentioned this issue Apr 17, 2020

Could not load dynamic library 'libcudart.so.10.1'; dlerror: libcudart.so.10.1: cannot open shared object file: No such file or directory #38578

Closed

ravikyram mentioned this issue May 4, 2020

tensorflow-gpu: Could not load dynamic library 'libcudart.so.10.1' #39132

Closed

mihaimaruseac mentioned this issue May 18, 2020

Using tensorflow gpu 2.1 with Cuda 10.2 #34759

Closed

cardboardcode mentioned this issue May 30, 2020

How to predict by using gpu? idealo/image-super-resolution#108

Open

Saduf2019 mentioned this issue Jun 8, 2020

Tensorflow v2.2 build fails with cuda 10.2 TensorRT 7.0.0.11-1 #40245

Closed

Saduf2019 mentioned this issue Jun 26, 2020

Is it possible to use TensorFlow 2.2 with Cuda 10.0? #40800

Closed

jeremysalwen mentioned this issue Jul 4, 2020

Openspiel+Bazel+Tensorflow build failure google-deepmind/open_spiel#172

Closed

amahendrakar mentioned this issue Jul 21, 2020

TensorFlow 2.2.0 doesn't detect GPU with CUDA version 10.2 #41374

Closed

ravikyram mentioned this issue Aug 10, 2020

libcudart.so.10.1 is not included in the cuda 10.2 package #42166

Closed

amahendrakar mentioned this issue Aug 11, 2020

Tf-nightly 2.4.0 does not recognize Cuda 10.2 #42229

Closed

ravikyram mentioned this issue Aug 13, 2020

Could not load dynamic library 'libcusparse.so.10 #42318

Closed

This was referenced Sep 11, 2020

Upgrade JavaCPP to just released version 1.5.4 tensorflow/java#110

Merged

Try to build with Cuda 10.2 on Linux tensorflow/java#111

Closed

Flowingsun007 mentioned this issue Oct 12, 2020

DLPerf summary of test experience Oneflow-Inc/DLPerf#75

Open

monacv mentioned this issue Jun 8, 2021

How to get any tensorflow version to work with CUDA 10.2 in CentOS 7? #50136

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will TensorFlow 2.2.0 support CUDA 10.2? #38194

Will TensorFlow 2.2.0 support CUDA 10.2? #38194

Farxial commented Apr 3, 2020

gadagashwini-zz commented Apr 6, 2020

mihaimaruseac commented Apr 6, 2020

Farxial commented Apr 8, 2020

gadagashwini-zz commented Apr 8, 2020

google-ml-butler bot commented Apr 8, 2020

petervandenabeele commented May 17, 2020 •

edited

bbqf commented May 25, 2020

palisadoes commented May 26, 2020

legel commented May 31, 2020 •

edited

saket424 commented Jun 7, 2020 •

edited

thomasaarholt commented Jul 12, 2020

jjl-jjl commented Jul 21, 2020 •

edited

GoingMyWay commented Jul 23, 2020

mihaimaruseac commented Jul 24, 2020

oliharvey commented Aug 20, 2020

GoingMyWay commented Aug 21, 2020

oliharvey commented Aug 23, 2020

mihaimaruseac commented Aug 24, 2020

ZhihuaLiuEd commented Sep 19, 2020

alexshvid commented Dec 16, 2020

jung-youjin commented Aug 4, 2021

Will TensorFlow 2.2.0 support CUDA 10.2? #38194

Will TensorFlow 2.2.0 support CUDA 10.2? #38194

Comments

Farxial commented Apr 3, 2020

gadagashwini-zz commented Apr 6, 2020

mihaimaruseac commented Apr 6, 2020

Farxial commented Apr 8, 2020

gadagashwini-zz commented Apr 8, 2020

google-ml-butler bot commented Apr 8, 2020

petervandenabeele commented May 17, 2020 • edited

bbqf commented May 25, 2020

palisadoes commented May 26, 2020

legel commented May 31, 2020 • edited

saket424 commented Jun 7, 2020 • edited

thomasaarholt commented Jul 12, 2020

jjl-jjl commented Jul 21, 2020 • edited

GoingMyWay commented Jul 23, 2020

mihaimaruseac commented Jul 24, 2020

oliharvey commented Aug 20, 2020

GoingMyWay commented Aug 21, 2020

oliharvey commented Aug 23, 2020

mihaimaruseac commented Aug 24, 2020

ZhihuaLiuEd commented Sep 19, 2020

alexshvid commented Dec 16, 2020

jung-youjin commented Aug 4, 2021

petervandenabeele commented May 17, 2020 •

edited

legel commented May 31, 2020 •

edited

saket424 commented Jun 7, 2020 •

edited

jjl-jjl commented Jul 21, 2020 •

edited