New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
Misaligned Address / Lane User Stack Overflow in cunn_SpatialSoftmax
#56325
Comments
HIgh priority for a crash |
After a quick look, it seems the failure appears in the call to |
ilpReduce should be called with grad_output_shift, and not shift pytorch/aten/src/ATen/native/cuda/SoftMax.cu Line 672 in 40483ac
|
Yup, can confirm this fixes the issue on V100. |
Fixed in #56304 |
Summary: CC ngimel ptrblck ref: pytorch#56325 Pull Request resolved: pytorch#56403 Reviewed By: mruberry Differential Revision: D27866625 Pulled By: ngimel fbshipit-source-id: 9dff0e9749f8de57fac6a653f685c14854611a02
馃悰 Bug
Reported in the forum by cameronb (thanks for reporting this issue!)
To Reproduce
Original error message:
$pc
info:After rebuilding with
-g -G
the error changes to:Backtrace:
I'm currently unsure, if the stack overflow might be caused by the debug flags or if it's the real issue.
Anyway, both issues point to
cunn_SpatialSoftMax
.Environment
@eqy would you like to take a shot at it?
cc @ezyang @gchanan @zou3519 @bdhirsh @jbschlosser @anjali411 @ngimel
The text was updated successfully, but these errors were encountered: