New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Inductor][fx pass] Fuse pointwise operators in the post grad #114778
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/114778
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (2 Unrelated Failures)As of commit 1cda761 with merge base a6736ac (): BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D51332067 |
This PR needs a
|
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P881792289 ### igctr ### dsnn ### icvr Reviewed By: xuzhao9 Differential Revision: D51332067
a014a7c
to
adb2046
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 Reviewed By: xuzhao9 Differential Revision: D51332067
This pull request was exported from Phabricator. Differential Revision: D51332067 |
adb2046
to
1adc1a0
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 Reviewed By: xuzhao9 Differential Revision: D51332067
This pull request was exported from Phabricator. Differential Revision: D51332067 |
1adc1a0
to
f662602
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 Reviewed By: xuzhao9 Differential Revision: D51332067
This pull request was exported from Phabricator. Differential Revision: D51332067 |
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 Reviewed By: xuzhao9 Differential Revision: D51332067
f662602
to
be3b9d3
Compare
This pull request was exported from Phabricator. Differential Revision: D51332067 |
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/6c5d1d31-e4d1-4865-bf79-1e7ac3b6e051 Test UI: https://www.internalfb.com/intern/testinfra/testrun/1970325050015770 Network: Up: 72KiB Down: 22KiB (reSessionID-44adc8b2-54e9-453a-bd20-710cefefaed1) Jobs completed: 20. Time elapsed: 1:44.6s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 7. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 Reviewed By: xuzhao9 Differential Revision: D51332067
be3b9d3
to
33eccc7
Compare
This pull request was exported from Phabricator. Differential Revision: D51332067 |
From the error log:
It seems you modify fx graph and make the input args point to the node itself, you can check this by:
|
BTW, the PR and diff is out of sync. |
33eccc7
to
0b4c7fa
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/6c5d1d31-e4d1-4865-bf79-1e7ac3b6e051 Test UI: https://www.internalfb.com/intern/testinfra/testrun/1970325050015770 Network: Up: 72KiB Down: 22KiB (reSessionID-44adc8b2-54e9-453a-bd20-710cefefaed1) Jobs completed: 20. Time elapsed: 1:44.6s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 7. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 Reviewed By: xuzhao9 Differential Revision: D51332067
This pull request was exported from Phabricator. Differential Revision: D51332067 |
0b4c7fa
to
5d7ecfd
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/6c5d1d31-e4d1-4865-bf79-1e7ac3b6e051 Test UI: https://www.internalfb.com/intern/testinfra/testrun/1970325050015770 Network: Up: 72KiB Down: 22KiB (reSessionID-44adc8b2-54e9-453a-bd20-710cefefaed1) Jobs completed: 20. Time elapsed: 1:44.6s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 7. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 ### mixed precison training P898569125 # e2e test baseline f509792379 proposal f509792025 https://pxl.cl/3Xbcf Reviewed By: xuzhao9 Differential Revision: D51332067
This pull request was exported from Phabricator. Differential Revision: D51332067 |
5d7ecfd
to
4228fe8
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/6c5d1d31-e4d1-4865-bf79-1e7ac3b6e051 Test UI: https://www.internalfb.com/intern/testinfra/testrun/1970325050015770 Network: Up: 72KiB Down: 22KiB (reSessionID-44adc8b2-54e9-453a-bd20-710cefefaed1) Jobs completed: 20. Time elapsed: 1:44.6s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 7. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 ### mixed precison training P898569125 # e2e test baseline f509792379 proposal f509792025 https://pxl.cl/3Xbcf Reviewed By: xuzhao9 Differential Revision: D51332067
This pull request was exported from Phabricator. Differential Revision: D51332067 |
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/6c5d1d31-e4d1-4865-bf79-1e7ac3b6e051 Test UI: https://www.internalfb.com/intern/testinfra/testrun/1970325050015770 Network: Up: 72KiB Down: 22KiB (reSessionID-44adc8b2-54e9-453a-bd20-710cefefaed1) Jobs completed: 20. Time elapsed: 1:44.6s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 7. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 ### mixed precison training P898569125 # e2e test baseline f509792379 proposal f509792025 https://pxl.cl/3Xbcf Reviewed By: xuzhao9 Differential Revision: D51332067
4228fe8
to
acfeca2
Compare
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/6c5d1d31-e4d1-4865-bf79-1e7ac3b6e051 Test UI: https://www.internalfb.com/intern/testinfra/testrun/1970325050015770 Network: Up: 72KiB Down: 22KiB (reSessionID-44adc8b2-54e9-453a-bd20-710cefefaed1) Jobs completed: 20. Time elapsed: 1:44.6s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 7. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P887605070 ### igctr P892987433 ### mai P893109069 ### icvr P893075846 ### oc P893109069 ### mixed precison training P898569125 # e2e test baseline f509792379 proposal f509792025 https://pxl.cl/3Xbcf Reviewed By: xuzhao9 Differential Revision: D51332067
acfeca2
to
1cda761
Compare
This pull request was exported from Phabricator. Differential Revision: D51332067 |
1 similar comment
This pull request was exported from Phabricator. Differential Revision: D51332067 |
@pytorchbot merge (Initiating merge automatically since Phabricator Diff has merged) |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P881792289 ### igctr ### dsnn ### icvr Reviewed By: xuzhao9 Differential Revision: D51332067 Pull Request resolved: pytorch#114778 Approved by: https://github.com/xuzhao9
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P881792289 ### igctr ### dsnn ### icvr Reviewed By: xuzhao9 Differential Revision: D51332067 Pull Request resolved: pytorch#114778 Approved by: https://github.com/xuzhao9
…h#114778) Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad Test Plan: # unit test ``` buck2 test 'fbcode//mode/dev-nosan' fbcode//caffe2/test/inductor:group_batch_fusion ``` Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016 Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353) Jobs completed: 18. Time elapsed: 1:19.8s. Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2) Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0 # local reproduce ### cmf P881792289 ### igctr ### dsnn ### icvr Reviewed By: xuzhao9 Differential Revision: D51332067 Pull Request resolved: pytorch#114778 Approved by: https://github.com/xuzhao9
Summary: We construct a unified API that can be easily add pointwise ops to be batched in the post grad
Test Plan:
unit test
Buck UI: https://www.internalfb.com/buck2/19b3f641-782f-4f94-a953-3ff9ce2cfa7b
Test UI: https://www.internalfb.com/intern/testinfra/testrun/1125900251953016
Network: Up: 67KiB Down: 32KiB (reSessionID-c2a80f26-8227-4f78-89fc-bcbda0ae8353)
Jobs completed: 18. Time elapsed: 1:19.8s.
Cache hits: 0%. Commands: 2 (cached: 0, remote: 0, local: 2)
Tests finished: Pass 6. Fail 0. Fatal 0. Skip 0. Build failure 0
local reproduce
cmf
P881792289
igctr
dsnn
icvr
Reviewed By: xuzhao9
Differential Revision: D51332067
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler