Installation issue: undefined symbol: __cudaPopCallConfiguration #19

davidbau · 2018-12-17T22:56:48Z

On linux, when I try to install and use pytorch_scatter, I get undefined symbol: __cudaPopCallConfiguration immediately upon importing torch_scatter.

Using pytorch 1.0.0 and CUDA 9.0 is on the PATH (and include is on the CPATH):

$ python -c "import torch; print(torch.__version__)"
1.0.0
$ echo $CPATH
/usr/local/cuda-9.0/include
$ echo $PATH
/usr/local/cuda-9.0/bin:/afs/csail.mit.edu/u/d/davidbau/.conda/envs/p3t1/bin...

I've tried uninstalling and resintalling (without cache) on pip pip install --no-cache-dir torch_scatter, but the error remains. Any tips?

Details - ubuntu 16.04

$ lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 16.04.5 LTS
Release:        16.04
Codename:       xenial

Environment installed via conda using the following env.yml

name: p3t1
channels:
  - pytorch
dependencies:
  - python=3.6
  - cudatoolkit=9.0
  - cudnn=7.1.2
  - pytorch=1.0
  - torchvision
  - mkl-include
  - numpy
  - scipy
  - scikit-learn
  - matplotlib
  - graphviz
  - numba
  - jupyter
  - pyyaml
  - mkl
  - setuptools
  - cmake
  - cffi
  - ujson
  - tqdm
  - pip
  - pip:
    - torch-scatter

The text was updated successfully, but these errors were encountered:

rayush7 · 2018-12-23T14:32:50Z

I am facing the same problem. Did you figure out how to solve the problem?

davidbau · 2018-12-24T01:28:14Z

Not yet.

rusty1s · 2018-12-24T06:44:26Z

Did you try to download the repo and run python setup.py install?

davidbau · 2018-12-24T12:05:12Z

Yes, same issue occurs. python setup.py install looks fine, running a lot of build steps, but then

$ python -c "import torch; from torch_scatter import scatter_max"
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/afs/csail.mit.edu/u/d/davidbau/git/pytorch_scatter/torch_scatter/__init__.py", line 3, in <module>
    from .mul import scatter_mul
  File "/afs/csail.mit.edu/u/d/davidbau/git/pytorch_scatter/torch_scatter/mul.py", line 3, in <module>
    from torch_scatter.utils.ext import get_func
  File "/afs/csail.mit.edu/u/d/davidbau/git/pytorch_scatter/torch_scatter/utils/ext.py", line 5, in <module>
    import torch_scatter.scatter_cuda
ImportError: /afs/csail.mit.edu/u/d/davidbau/git/pytorch_scatter/torch_scatter/scatter_cuda.cpython-36m-x86_64-linux-gnu.so: undefined symbol: __cudaPopCallConfiguration

rusty1s · 2018-12-24T12:46:52Z

Does the PyTorch CUDA version torch.version.cuda match the one of the system?

Can you make sure the official PyTorch extensions run on your machine?

davidbau · 2018-12-24T13:39:46Z

Thanks for the tip! The system has multiple nvcc and even though the version on the PATH matched torch.version.cuda, it looks like setup.py was picking up the wrong one. It looks like the torch extension API is looking for CUDA_HOME. So doing this before pip install or python setup.py install fixes the problem (it's not necessary for the right nvcc to show up on PATH or for the right include to be on CPATH - everything is keyed off of CUDA_HOME):

export CUDA_HOME=/usr/local/cuda-9.0

Another problem I was having while testing configurations was failing to sufficiently clean out binaries built with the wrong compiler. For others following along, I found this was enough to clean things:

pip uninstall torch-scatter
rm -rf build torch_scatter/*.so; python setup.py clean # within the torch_scatter sources

And then use --no-cache-dir when reinstalling with the proper environment variable.

export CUDA_HOME=/usr/local/cuda-9.0
pip install --no-cache-dir torch-scatter

Problem solved!

rayush7 · 2018-12-24T16:28:26Z

@rusty1s Thanks for pointing out to check the PyTorch CUDA version and the Cuda version installed on the system.

I had Cuda-9.2 installed on my system and with PyTorch 1.0, Cuda-9.0 was getting installed by default. Therefore there was a mismatch.

I am using Ubuntu 18.04, therefore I changed to CUDA-10 instead of CUDA-9 in order to avoid mismatch of nvcc compiler & gcc/g++ compilers and then installed Pytorch for CUDA-10.0.

After that I followed the same steps as mentioned by @davidbau (with change of cuda-10.0) and everything worked.

Thank you both of you.

rusty1s · 2018-12-25T02:28:28Z

Cool that it works now :)

zc-alexfan · 2019-06-12T21:01:44Z

Thanks for the tip! The system has multiple nvcc and even though the version on the PATH matched torch.version.cuda, it looks like setup.py was picking up the wrong one. It looks like the torch extension API is looking for CUDA_HOME. So doing this before pip install or python setup.py install fixes the problem (it's not necessary for the right nvcc to show up on PATH or for the right include to be on CPATH - everything is keyed off of CUDA_HOME):
export CUDA_HOME=/usr/local/cuda-9.0
Another problem I was having while testing configurations was failing to sufficiently clean out binaries built with the wrong compiler. For others following along, I found this was enough to clean things:
pip uninstall torch-scatter
rm -rf build torch_scatter/*.so; python setup.py clean # within the torch_scatter sources
And then use --no-cache-dir when reinstalling with the proper environment variable.
export CUDA_HOME=/usr/local/cuda-9.0
pip install --no-cache-dir torch-scatter
Problem solved!

Thanks for the great response! I have the same problem here, but I am still getting stuck. I am wondering if I could get some suggestions.

My PyTorch was installed on a Conda environment. Inside the environment, the CUDA version is:

>>> import torch
>>> torch.version.cuda
'9.0.176'

My cudatoolkit's gives:

➜  bin ./nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2018 NVIDIA Corporation
Built on Tue_Jun_12_23:07:04_CDT_2018
Cuda compilation tools, release 9.2, V9.2.148

Yes, there is a mismatch. I tried your suggestion with CUDA_HOME=/path/to/cudatoolkit9.2, but the same error occurs. I am wondering if I should let CUDA_HOME be the path of CUDA that came with conda install pytorch torchvision cudatoolkit=9.0 -c pytorch when I installed PyTorch?

However, I cannot find the path of that cudatoolkit.

Thanks in Advanced.

davidbau · 2019-06-24T21:49:19Z

No, you want nvcc for 9.0 not 9.2 since that's the version of cuda you're running inside your environment. Since nvcc doesn't get included in the conda cuda packages, you need to just install it separately. E.g., on ubuntu apt-get install cuda-9-0 will end up installing in /usr/local/cuda-9.0. This can happily coexist with your 9.2 installlation. Then you can do the following to put cuda-9.0 on CUDA_HOME automaticallly within your conda environment when it is activated:

# Set up CUDA_HOME to set itself up correctly on every source activate
# https://stackoverflow.com/questions/31598963
mkdir -p ~/.conda/envs/${ENV_NAME}/etc/conda/activate.d
echo "export CUDA_HOME=/usr/local/cuda-9.0" > \
    ~/.conda/envs/${ENV_NAME}/etc/conda/activate.d/CUDA_HOME.sh

rusty1s closed this as completed Dec 25, 2018

mcarilli mentioned this issue Mar 13, 2019

bugs after apex installation NVIDIA/apex#187

Open

irfanICMLL mentioned this issue Dec 9, 2019

undefined symbol: __cudaPopCallConfiguration irfanICMLL/structure_knowledge_distillation#12

Closed

sfzhang15 mentioned this issue Jan 8, 2020

ImportError: /root/atss/ATSS/atss_core/_C.cpython-37m-x86_64-linux-gnu.so: undefined symbol: __cudaPopCallConfiguration sfzhang15/ATSS#17

Closed

skamano mentioned this issue Feb 21, 2020

ImportError: numpy.core.multiarray failed to import xiumingzhang/GenRe-ShapeHD#54

Open

RenYurui mentioned this issue Mar 6, 2020

ImportError RenYurui/StructureFlow#23

Open

yuweihao mentioned this issue Mar 16, 2020

facing an issue in training yuweihao/KERN#17

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation issue: undefined symbol: __cudaPopCallConfiguration #19

Installation issue: undefined symbol: __cudaPopCallConfiguration #19

davidbau commented Dec 17, 2018

rayush7 commented Dec 23, 2018

davidbau commented Dec 24, 2018

rusty1s commented Dec 24, 2018 •

edited

davidbau commented Dec 24, 2018

rusty1s commented Dec 24, 2018

davidbau commented Dec 24, 2018 •

edited

rayush7 commented Dec 24, 2018

rusty1s commented Dec 25, 2018

zc-alexfan commented Jun 12, 2019

davidbau commented Jun 24, 2019

Installation issue: undefined symbol: __cudaPopCallConfiguration #19

Installation issue: undefined symbol: __cudaPopCallConfiguration #19

Comments

davidbau commented Dec 17, 2018

rayush7 commented Dec 23, 2018

davidbau commented Dec 24, 2018

rusty1s commented Dec 24, 2018 • edited

davidbau commented Dec 24, 2018

rusty1s commented Dec 24, 2018

davidbau commented Dec 24, 2018 • edited

rayush7 commented Dec 24, 2018

rusty1s commented Dec 25, 2018

zc-alexfan commented Jun 12, 2019

davidbau commented Jun 24, 2019

rusty1s commented Dec 24, 2018 •

edited

davidbau commented Dec 24, 2018 •

edited