New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build with CUDA support fails with GCC >= 10.3 #48890
Comments
@pwuertz |
Not really, sorry, since I'm building everything at a non-isolated system level and require python 3.9 and CUDA 11.2 for other reasons. And as said before, building the configuration above with GCC-9 as NVCC host compiler works fine. I just wanted to make anyone who is looking for this aware and give a pointer to the upstream bug report at GCC. The problem seems to be pretty much identified at this point, so I suppose we'll have to see and wait for an upstream fix... |
@pwuertz In the meantime can you build in our docker devel container |
@bhack I'm fine with my GCC-9 based build, thanks. |
I know this isn't strictly how bugs should be fixed, however, if you're desperate and under a deadline like me, comment out that block of code
The segmentation error disappears |
See also tensorflow/tensorflow#48890 and https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100102 git-svn-id: file:///srv/repos/svn-community/svn@936348 9fca08f4-af9d-4005-b8df-a31f2cc04f65
See also tensorflow/tensorflow#48890 and https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100102 git-svn-id: file:///srv/repos/svn-community/svn@936348 9fca08f4-af9d-4005-b8df-a31f2cc04f65
@pwuertz, this is purely a GCC issue right? Or are you suggesting that there is some workaround possible in the TF source code? |
@sanjoy Yes, probably a pure GCC issue. No suggestions on how to handle this on the Tensorflow end other than monitoring what's happening upstream. A warning emitted by the Tensorflow build for known-bad compiler versions would be nice, but I don't know how much work this is. |
@pwuertz How do you change the NVCC host compiler to gcc-9? I installed gcc-9 but the nvcc still uses 10 I think. Thanks. |
When you |
The GCC project has committed a patch: https://gcc.gnu.org/git/gitweb.cgi?p=gcc.git;h=5357ab75dedef403b0eebf9277d61d1cbeb5898f |
This should be fixed for newer versions of GCC, as per the above bug report. "Fixed for GCC 10.4, 11.2 and 12." |
This issue has been automatically marked as stale because it has no recent activity. It will be closed if no further activity occurs. Thank you. |
Closing as stale. Please reopen if you'd like to work on this further. |
System information
Describe the problem
Building tensorflow with CUDA support with GCC 10.3 fails with the following error:
Apparently, this is a regression starting with GCC 10.3 (default compiler on Ubuntu 21.04) when using gcc in conjunction with nvcc. Here is the upstream bug report: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100102
Installing and using gcc-9 as NVCC host compiler in
configure
still works.The text was updated successfully, but these errors were encountered: