Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ImportError: undefined symbol: __cudaPopCallConfiguration #48

Open
rajarshighoshal opened this issue Jun 2, 2019 · 10 comments
Open

ImportError: undefined symbol: __cudaPopCallConfiguration #48

rajarshighoshal opened this issue Jun 2, 2019 · 10 comments

Comments

@rajarshighoshal
Copy link

I'm new to deep learning stuff and I walk through the readme till the Execute step. Where I'm getting this error. Can anyone please guide me what I'm doing wrong?
image

@pfjaeger
Copy link
Member

pfjaeger commented Jun 2, 2019

See here #41 . At the moment you have to downgrade to torch=0.4.1 . Sorry for that.

@rajarshighoshal
Copy link
Author

rajarshighoshal commented Jun 3, 2019

I am using pytorch version 0.4.1 still getting this error!
image

@alexfoo-dw
Copy link

@rajarshighoshal

I downgraded from CUDA 10.0 to 9.0 and ran these commands from the medicaldetectiontoolkit directory again after downgrading to torch 0.4.1. Fixed my import error.

cd cuda_functions/nms_xD/src/cuda/
nvcc -c -o nms_kernel.cu.o nms_kernel.cu -x cu -Xcompiler -fPIC -arch=[arch]
cd ../../
python build.py
cd ../

cd cuda_functions/roi_align_xD/roi_align/src/cuda/
nvcc -c -o crop_and_resize_kernel.cu.o crop_and_resize_kernel.cu -x cu -Xcompiler -fPIC -arch=[arch]
cd ../../
python build.py
cd ../../

@rajarshighoshal
Copy link
Author

rajarshighoshal commented Jun 6, 2019 via email

@sophie-isobel
Copy link

@rajarshighoshal did changing to cuda 9 work for you?

I am getting a similar Import Error: undefined symbol: __cudaRegisterFatBinaryEnd

!nvidia-smi returns that I have cuda version 10 and tesla K80 GPU (note i am using google colab).

I have specified arch='sm_37' for the K80 gpu and although I haven't downgraded the cuda version from 10 to 9 as @alexfoodw suggested, I can see that the torch.version.cuda is '9.0.176' for the torch ==0.4.1 I have installed, so the torch should be using cuda 9.

Would you know why I am having this issue?

@rajarshighoshal
Copy link
Author

rajarshighoshal commented Aug 14, 2019 via email

@sophie-isobel
Copy link

@rajarshighoshal so I changed to cuda 9 and also changed the default gcc version using the following code, and I am no longer getting this error.

!update-alternatives --remove-all gcc 
!update-alternatives --remove-all g++

!update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-5 20
!update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-5 20

!update-alternatives --install /usr/bin/cc cc /usr/bin/gcc 30
!update-alternatives --set cc /usr/bin/gcc

!update-alternatives --install /usr/bin/c++ c++ /usr/bin/g++ 30
!update-alternatives --set c++ /usr/bin/g++

@rajarshighoshal
Copy link
Author

rajarshighoshal commented Aug 14, 2019 via email

@heyitsguay
Copy link

heyitsguay commented May 21, 2020

Does this mean medicaldetectiontoolkit will not work with CUDA 10.2 and g++ 7.5.0 with a GTX 1080 (sm_61)? I'm getting the same "undefined symbol: __cudaRegisterFatBinaryEnd" error running python exec.py --mode train on the included toy experiment.

Update: Switched to the torch1x branch and got it working.

@henbucuoshanghai
Copy link

pytorch is 1.1? 1.4?@h @heyitsguay

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants