-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compilation of custom operations failing on TF 2.15/CUDA 12 #1523
Comments
Related is also #1513. Can you post the full output? |
Can you try to run on CPU only ( |
Please find here the full output of the compilation on CUDA. Answering your questions:
Should the |
I assume you tested that with |
Note, the main error is
I guess they moved/renamed that. I see in other TF code that it is Similarly, in our |
Hi, the compilation of
NativeLstm2.cc
is failing with TF 2.15/CUDA 12, and it hadn't failed with TF 2.13/CUDA 11. A colleague of mine is also having similar issues when compilingGetCtcFsaFastBwOp.cc
.There are many errors that are being thrown out by the compiler, but most of them are rather "silly", like:
error: expected a ";"
error: function "Ndarray_get_n_total_elements" has already been defined
error: name followed by "::" must be a class or namespace name
This leads me to think that the
nvcc
compiler might be doing weird stuff here, and as a consequence that the operationss don't work with CUDA 12 as they are. I was also told that TF might play a role here, so I also posted the TF versions. Could there be a redundant file? Maybe incompatible CUDA versions?nvcc
version where the compilation works:nvcc
version where the compilation doesn't work:Let me know if I can provide any further details. Thanks in advance!
The text was updated successfully, but these errors were encountered: