Fix gradient for min and max #4136

tafsiri · 2020-10-26T22:51:45Z

Closes #4135
Adds a regression test to guard against #4130 (which seems to be fixed at HEAD at the time of that issue being reported).

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

lina128

Thank you Yannick! Can you provide some context here, why we don't need transpose any more?

Reviewable status: complete! 1 of 1 approvals obtained (waiting on @annxingyuan and @lina128)

tafsiri · 2020-10-27T02:15:19Z

Good question. There were a few things I noticed.

The grad function would itself try and transpose the output if permutedAxes was not null. However the gradForMinAndMax helper function would also do this as part of the helper function, effectively nullifying each other (though the outer transpose was called without permutedAxes which just seemed odd).
The min and max grad functions seemed to be written against the eariler definition of the kernel in the forward func. For example looking at how the code was at 1.7.4, you can see that that op would sometimes to an internal transpose before passing the input to the backend implementation, however the original input is what gets passed to the gradient. So I believe the gradient was trying to match the shape of what gets passed to backend.max internally. However in new modular kernels there is no op level transformation and thus no transpose to account for (any internal transposition done within the forward kernel are completely hidden). So the extra 'transposition handling' code becomes unnecessary.

lina128 · 2020-10-27T17:40:44Z

Ah, makes sense!

tafsiri added 2 commits October 26, 2020 13:12

add regression test for 4130

bb7ad17

fix grads for min and max

1923d99

google-cla bot added the cla: yes label Oct 26, 2020

tafsiri requested review from annxingyuan and lina128 October 26, 2020 23:02

lina128 approved these changes Oct 26, 2020

View reviewed changes

lina128 merged commit 12c4bbf into master Oct 26, 2020

Provide feedback