-
-
Notifications
You must be signed in to change notification settings - Fork 213
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grads algebra on GPU causes scalar getindex is disallowed
error
#971
Comments
Generally, it is better to use grads such that you extract the gradients explicitly. That is to say, doing |
Looks like the problem is the check to make sure the |
We can't guarantee the order in |
That was discussed in the original PR. The implementation does not rely on ordering. This is bug that can be fixed, and gradient algebra with many parameters gets verbose. |
We can always extend it to multiple parameters. I'd rather stick with a cleaner implementation. |
Hi! I was trying to do some gradient accumulation, but
Grads
algebra seems to cause error on GPU.Am I missing something or is there indeen an issue?
I was using it in the context of UNet model training, but below is MWE.:
Error:
The text was updated successfully, but these errors were encountered: