New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simple graph invoking tf.complex() doesn't work on GPU, but works on CPU #38443
Comments
@isaacgerg |
@Saduf2019 |
As mentioned in the error message
This is the reason for you running into the error on windows @isaacgerg |
@gowthamkpr Why doesnt the driver perform the ptx compilation then? The operation is simple, a FOIL multiply of complex numbers. |
I think the message from redzone_allocator.cc is a red herring and that the |
Hi @sanjoy, the fully log is in the first post. Let me know if you need anything else. |
@sanjoy Any update on this? How can i help? |
Hi @isaacgerg, It is quite difficult to say much from
if that's all the logs say. Can you try running with |
@isaacgerg |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you. |
Closing as stale. Please reopen if you'd like to work on this further. |
Environment: Windows 10, Python 3.6, Tensorflow 2.1.0-rc2
The code below demonstrates a minimal working example of the bug. This code results in CUDA_ERROR_LAUNCH_FAILED when run on the GPU. But, if you run on the CPU, the code has no issues. I suspect the problem lies in the tensor coming out of tf.complex() as if I do not use that function, the issues seems to go away.
A small working example shows the error I get along with working code to reproduce on Windows 10.
EDIT 1: Simplified code more.
The text was updated successfully, but these errors were encountered: