You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Just like the situation described in this commit message: a98c130, I am unable to link any code that uses parallel_reduce with Kokkos and CUDA, if and only if relocatable device code (RDC) is being used. The linker segfaults:
nvcc error : 'nvlink' died due to signal 11 (Invalid memory reference)
Several standalone example programs and one large library confirm that parallel_reduce+RDC is the problematic combination. Also replicated with many Kokkos commits going back to 2015, unable to find a commit without this behavior.
If this is supposed to work, here is more information to reproduce on Shannon:
This is a known bug in nvcc linker for all current versions of CUDA. A bug report has been filed with NVIDIA and a resolution should soon be available.
I see. Until then, I suppose my best option is to try not to rely on RDC. parallel_reduce is essential to my algorithms, so that has to stay. Thank you.
Just like the situation described in this commit message: a98c130, I am unable to link any code that uses
parallel_reduce
with Kokkos and CUDA, if and only if relocatable device code (RDC) is being used. The linker segfaults:Several standalone example programs and one large library confirm that
parallel_reduce
+RDC is the problematic combination. Also replicated with many Kokkos commits going back to 2015, unable to find a commit without this behavior.If this is supposed to work, here is more information to reproduce on Shannon:
The text was updated successfully, but these errors were encountered: