fix embedding_backward_dense decomp with broadcasting #95499

bdhirsh · 2023-02-24T20:29:08Z

cc @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire @ngimel for another decomp fix. For this one, I tried auditing the CPU and CUDA kernels for embedding_backward_dense and just could not figure out where the unsqueeze(1) was supposed to be coming from. In the failing example, our tensor shapes are (2, 4, 3) and (2, 4), and so I just assumed that the existing decomp had a typo - we should be unsqueezing the last dim, instead of dim index 1. That fixes the repro, and the existing decomp + meta tests appear to be passing.

Stack from ghstack (oldest at bottom):

cc @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire

[ghstack-poisoned]

pytorch-bot · 2023-02-24T20:29:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95499

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4919634:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 411189909c064677d8d2783af7b19bbb80577ee0 Pull Request resolved: #95499

ezyang

lgtm. It might be worth figuring out why the OpInfo inputs didn't exercise this.

ngimel

Yeah this is right, thanks for the fix @bdhirsh

Fixes #95182 cc ngimel for another decomp fix. For this one, I tried auditing the CPU and CUDA kernels for `embedding_backward_dense` and just could not figure out where the `unsqueeze(1)` was supposed to be coming from. In the failing example, our tensor shapes are `(2, 4, 3)` and `(2, 4)`, and so I just assumed that the existing decomp had a typo - we should be unsqueezing the last dim, instead of dim index 1. That fixes the repro, and the existing decomp + meta tests appear to be passing. cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

ghstack-source-id: dfea7c497fde1ad0c3030bc7bc6fe6e790de8954 Pull Request resolved: #95499

Fixes #95182 cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire ngimel for another decomp fix. For this one, I tried auditing the CPU and CUDA kernels for `embedding_backward_dense` and just could not figure out where the `unsqueeze(1)` was supposed to be coming from. In the failing example, our tensor shapes are `(2, 4, 3)` and `(2, 4)`, and so I just assumed that the existing decomp had a typo - we should be unsqueezing the last dim, instead of dim index 1. That fixes the repro, and the existing decomp + meta tests appear to be passing. cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

Fixes pytorch/pytorch#95182 Pull Request resolved: pytorch/pytorch#95499 Approved by: https://github.com/ezyang, https://github.com/ngimel

…h#95499)" This reverts commit ddd6b53.

Fixes pytorch#95182 Pull Request resolved: pytorch#95499 Approved by: https://github.com/ezyang, https://github.com/ngimel

fix embedding_backward_dense decomp with broadcasting

cec2d0f

[ghstack-poisoned]

bdhirsh mentioned this pull request Feb 24, 2023

fix primtorch handling for sub.scalar with alpha and float64 arg #95421

Closed

bdhirsh mentioned this pull request Feb 24, 2023

better error message when functionalization cant handle op #95392

Closed

bdhirsh added a commit that referenced this pull request Feb 24, 2023

fix embedding_backward_dense decomp with broadcasting

9bb23a8

ghstack-source-id: 411189909c064677d8d2783af7b19bbb80577ee0 Pull Request resolved: #95499

github-actions bot added the module: dynamo label Feb 24, 2023

github-actions bot requested review from albanD, antoniojkim, Chillee, ezyang, jbschlosser, miladm, SherlockNoMad, voznesenskym and wconstab February 24, 2023 20:29

bdhirsh mentioned this pull request Feb 24, 2023

[PT2.0][compile] embedding scale_grad_by_freq=True causes broadcast error #95182

Closed

ezyang approved these changes Feb 24, 2023

View reviewed changes

ngimel approved these changes Feb 24, 2023

View reviewed changes

bdhirsh mentioned this pull request Feb 24, 2023

fix spurious aot autograd warning #95521

Closed

bdhirsh added a commit that referenced this pull request Feb 24, 2023

fix embedding_backward_dense decomp with broadcasting

77cde89

ghstack-source-id: dfea7c497fde1ad0c3030bc7bc6fe6e790de8954 Pull Request resolved: #95499

bdhirsh added the release notes: composability release notes category label Feb 27, 2023

pytorchmergebot added the Merged label Feb 28, 2023

pytorchmergebot closed this in ddd6b53 Feb 28, 2023

msaroufim mentioned this pull request Mar 3, 2023

Remove mention of dynamo.optimize() in docs #96002

Closed

pruthvistony added a commit to ROCm/pytorch that referenced this pull request May 2, 2023

Revert "fix embedding_backward_dense decomp with broadcasting (pytorc…

52859b8

…h#95499)" This reverts commit ddd6b53.

facebook-github-bot deleted the gh/bdhirsh/384/head branch June 8, 2023 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix embedding_backward_dense decomp with broadcasting #95499

fix embedding_backward_dense decomp with broadcasting #95499

bdhirsh commented Feb 24, 2023 •

edited

pytorch-bot bot commented Feb 24, 2023 •

edited

ezyang left a comment

ngimel left a comment

fix embedding_backward_dense decomp with broadcasting #95499

fix embedding_backward_dense decomp with broadcasting #95499

Conversation

bdhirsh commented Feb 24, 2023 • edited

pytorch-bot bot commented Feb 24, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95499

✅ No Failures

ezyang left a comment

Choose a reason for hiding this comment

ngimel left a comment

Choose a reason for hiding this comment

bdhirsh commented Feb 24, 2023 •

edited

pytorch-bot bot commented Feb 24, 2023 •

edited