-
Notifications
You must be signed in to change notification settings - Fork 406
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cuda Clang spurious test failure in impl_view_accessible #1753
Comments
This is with Clang 6 Cuda 9.0 on Volta. |
Running this thing with |
Workaround looks similar to previous issue on nvcc with permanent block divergence, but can't detect permanent block divergence in this case with clang ... |
Might be that this is only an issue in Clang 6 with Cuda 9 for Volta (which is somewhat beta capability). Need to check again with Cuda 9.2 and Clang 7. |
Not fixed in Clang 7/Cuda 9.1 |
@crtrott is this fixed by your rewrite of the CUDA reduction? |
Yeah but am addressing more performance issues right now. |
I reduced it down to running:
If I remove any of the other 4 tests it doesn't fail reliably anymore.
The text was updated successfully, but these errors were encountered: