[MPS] Fixes for LSTM. #94889

kulinseth · 2023-02-15T06:40:49Z

Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated.
Fixed bias tensor mistakenly getting overwritten to zeros
Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag.

Fixes #ISSUE_NUMBER

pytorch-bot · 2023-02-15T06:40:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/94889

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 67edca8:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2023-02-15T06:44:36Z

The committers listed above are authorized under a signed CLA.

✅ login: kulinseth / name: Kulin Seth (3e10b86)

- Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated. - Fixed bias tensor mistakenly getting overwritten to zeros - Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag. - Raise error for projections in LSTM. Currently not supported.

DenisVieriu97

Looks good

kulinseth · 2023-02-15T16:08:26Z

@pytorchbot merge -f "All tests are green."

pytorchmergebot · 2023-02-15T16:10:36Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

- Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated. - Fixed bias tensor mistakenly getting overwritten to zeros - Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag. Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#94889 Approved by: https://github.com/DenisVieriu97

@malfet

* [MPS] Fixes for LSTM. (#94889) - Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated. - Fixed bias tensor mistakenly getting overwritten to zeros - Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag. Fixes #ISSUE_NUMBER Pull Request resolved: #94889 Approved by: https://github.com/DenisVieriu97 * [MPS] LogSoftmax numerical stability (#95091) Fixes #94043 Calculations are now consistent with numericaly stable formula and CPU: $LogSoftmax(X, \dim) = X - \max(X, \dim) - \log(sum(X - \max(X, \dim), \dim))$ @malfet Pull Request resolved: #95091 Approved by: https://github.com/malfet, https://github.com/kulinseth * [MPS] Cast int64 to int32 for reduction ops (#95231) - give warnings of converting int64 for reduction ops - use cast tensor for reduction sum on trace - unblock trace from running Pull Request resolved: #95231 Approved by: https://github.com/razarmehr * [MPS] Fix Float16 issue with Reduction ops for macOS 12 (#94952) This would fix the issue with `__rdiv__` with float16 Pull Request resolved: #94952 Approved by: https://github.com/kulinseth --------- Co-authored-by: alexdremov <dremov.me@gmail.com> Co-authored-by: Denis Vieriu <dvieriu@apple.com> Co-authored-by: Ramin Azarmehr <razarmehr@apple.com>

This reverts commit 54ebf25.

@malfet

* [MPS] Fixes for LSTM. (pytorch#94889) - Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated. - Fixed bias tensor mistakenly getting overwritten to zeros - Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag. Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#94889 Approved by: https://github.com/DenisVieriu97 * [MPS] LogSoftmax numerical stability (pytorch#95091) Fixes pytorch#94043 Calculations are now consistent with numericaly stable formula and CPU: $LogSoftmax(X, \dim) = X - \max(X, \dim) - \log(sum(X - \max(X, \dim), \dim))$ @malfet Pull Request resolved: pytorch#95091 Approved by: https://github.com/malfet, https://github.com/kulinseth * [MPS] Cast int64 to int32 for reduction ops (pytorch#95231) - give warnings of converting int64 for reduction ops - use cast tensor for reduction sum on trace - unblock trace from running Pull Request resolved: pytorch#95231 Approved by: https://github.com/razarmehr * [MPS] Fix Float16 issue with Reduction ops for macOS 12 (pytorch#94952) This would fix the issue with `__rdiv__` with float16 Pull Request resolved: pytorch#94952 Approved by: https://github.com/kulinseth --------- Co-authored-by: alexdremov <dremov.me@gmail.com> Co-authored-by: Denis Vieriu <dvieriu@apple.com> Co-authored-by: Ramin Azarmehr <razarmehr@apple.com>

- Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated. - Fixed bias tensor mistakenly getting overwritten to zeros - Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag. Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#94889 Approved by: https://github.com/DenisVieriu97

kulinseth requested review from razarmehr and DenisVieriu97 February 15, 2023 06:40

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Feb 15, 2023

pytorchbot added the open source label Feb 15, 2023

kulinseth force-pushed the lstm_fixes branch from fa2216a to 3e10b86 Compare February 15, 2023 06:49

DenisVieriu97 approved these changes Feb 15, 2023

View reviewed changes

fix the build.

67edca8

pytorchmergebot added the Merged label Feb 15, 2023

pytorchmergebot closed this in 54ebf25 Feb 15, 2023

This was referenced Feb 22, 2023

[MPS] Numerical stability and reduction fixes #95317

Merged

[v.2.0.0] Release Tracker #94937

Closed

msaroufim mentioned this pull request Mar 3, 2023

Remove mention of dynamo.optimize() in docs #96002

Closed

pruthvistony added a commit to ROCm/pytorch that referenced this pull request May 2, 2023

Revert "[MPS] Fixes for LSTM. (pytorch#94889)"

81c9c05

This reverts commit 54ebf25.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MPS] Fixes for LSTM. #94889

[MPS] Fixes for LSTM. #94889

kulinseth commented Feb 15, 2023

pytorch-bot bot commented Feb 15, 2023 •

edited

linux-foundation-easycla bot commented Feb 15, 2023 •

edited

DenisVieriu97 left a comment

kulinseth commented Feb 15, 2023

pytorchmergebot commented Feb 15, 2023

[MPS] Fixes for LSTM. #94889

[MPS] Fixes for LSTM. #94889

Conversation

kulinseth commented Feb 15, 2023

pytorch-bot bot commented Feb 15, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/94889

✅ No Failures

linux-foundation-easycla bot commented Feb 15, 2023 • edited

DenisVieriu97 left a comment

Choose a reason for hiding this comment

kulinseth commented Feb 15, 2023

pytorchmergebot commented Feb 15, 2023

Merge started

pytorch-bot bot commented Feb 15, 2023 •

edited

linux-foundation-easycla bot commented Feb 15, 2023 •

edited