compile lib_kernel/lib_fast_nms/fast_nms using GPU V100 #35

emedinac · 2018-06-27T19:39:46Z

Hi,
I tried to compile this code using two GPUs V100 using sm_70 and I'm getting this warning during compiling and this error when I run the test.py:

/usr/local/cuda-9.0/bin/../targets/x86_64-linux/include/sm_30_intrinsics.hpp(213): here was declared deprecated ("__shfl_down() is not valid on compute_70 and above, and should be replaced with __shfl_down_sync().To continue using __shfl_down(), specify virtual architecture compute_60 when targeting sm_70 and above, for example, using the pair of compiler options: -arch=compute_60 -code=sm_70.")

NotFoundError: /home/edgar/light_head_rcnn/lib/lib_kernel/lib_fast_nms/fast_nms.so: undefined symbol: _ZN10tensorflow7strings6StrCatERKNS0_8AlphaNumE

Also, when I use -arch=compute_60 -code=sm_70, I got this warning during compiling and the same error when I run the test.py:

/usr/local/cuda/bin/../targets/x86_64-linux/include/sm_30_intrinsics.hpp(213): here was declared deprecated ("__shfl_down() is deprecated in favor of __shfl_down_sync() and may be removed in a future release (Use -Wno-deprecated-declarations to suppress this warning).")

The lines to be compiled are:

CUDA_PATH=/usr/local/cuda-9.0/
nvcc -std=c++11 -c -o nms_op.cu.o nms_op.cu.cc \
	-I $TF_INC -D GOOGLE_CUDA=1 -x cu -Xcompiler -fPIC -arch=compute_60 -code=sm_70 --expt-relaxed-constexpr -Wno-deprecated-declarations

The text was updated successfully, but these errors were encountered:

bl0 · 2018-07-02T07:22:42Z

After a long time of debugging, I find the solution:
Edit the file /src/detection/lib/lib_kernel/lib_fast_nms/make.sh and replace -D_GLIBCXX_USE_CXX11_ABI=0 to -D_GLIBCXX_USE_CXX11_ABI=1 and recompile, the annoying problem disappears.

g++ -std=c++11 -shared -D_GLIBCXX_USE_CXX11_ABI=1 -o fast_nms.so nms_op.cc \
        nms_op.cu.o -I $TF_INC -fPIC -lcudart -L $CUDA_PATH/lib64 -L$TF_LIB -ltensorflow_framework -I$TF_INC/external/nsync/public

emedinac · 2018-07-04T20:08:46Z

it worked, thanks. Also, I recommend working in root mode, because I initially installed TF using Conda.

fay0505 · 2018-09-12T01:21:47Z

@bl0 hello, your solution worked, Thanks! Can you account for it?

bl0 · 2018-09-12T06:27:51Z

I just search on the issue page of Tensorflow. The following page may help:
tensorflow/tensorflow#20899 (comment)

fay0505 · 2018-09-12T13:06:55Z

@bl0 Thanks!

This was referenced Jul 2, 2018

light_head_rcnn/lib/lib_kernel/lib_fast_nms/fast_nms.so: undefined symbol: #31

Open

undefined symbol when import fast_nms.so #29

Closed

zengarden closed this as completed Jul 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compile lib_kernel/lib_fast_nms/fast_nms using GPU V100 #35

compile lib_kernel/lib_fast_nms/fast_nms using GPU V100 #35

emedinac commented Jun 27, 2018 •

edited

bl0 commented Jul 2, 2018

emedinac commented Jul 4, 2018

fay0505 commented Sep 12, 2018

bl0 commented Sep 12, 2018

fay0505 commented Sep 12, 2018

compile lib_kernel/lib_fast_nms/fast_nms using GPU V100 #35

compile lib_kernel/lib_fast_nms/fast_nms using GPU V100 #35

Comments

emedinac commented Jun 27, 2018 • edited

bl0 commented Jul 2, 2018

emedinac commented Jul 4, 2018

fay0505 commented Sep 12, 2018

bl0 commented Sep 12, 2018

fay0505 commented Sep 12, 2018

emedinac commented Jun 27, 2018 •

edited