inductor: support conv+binary foldinig for freezing path #105048

XiaobingSuper · 2023-07-12T08:30:15Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

[ghstack-poisoned]

pytorch-bot · 2023-07-12T08:30:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105048

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Merge Blocking SEVs

There is 1 active merge blocking SEVs. Please view them below:

(merge blocking) 6-7 hour queue times caused by GPU instances not being available

If you must merge, use @pytorchbot merge -f.

✅ 2 Unrelated Failures

As of commit e3674cb:

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 52ff9b8 Pull Request resolved: #105048

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

ghstack-source-id: eb8432f Pull Request resolved: #105048

torch/_inductor/fx_passes/freezing_patterns.py

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

ghstack-source-id: b4f88d6 Pull Request resolved: #105048

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

ghstack-source-id: 2559d8f Pull Request resolved: #105048

XiaobingSuper · 2023-07-13T08:08:56Z

@eellison, I wonder if we may remove the conv_bn folding pass by directly calling many times of binary folding to avoid re-tracing the aot_model using make_fx?

eellison

still need to take a look at this but fyi theres a similar pass in jit, maybe worth checking that all of the same conditions are checked.

Can we reuse these tests ?

6971149#diff-533101d7d55b7513c4966fcda7f2554d9a7e19b82d54c6b00c6db59686d0e18cR1428-R1510

eellison · 2023-07-13T16:03:43Z

torch/_inductor/fx_passes/freezing_patterns.py

+    _binary_ops = [aten.add.Tensor, aten.sub.Tensor, aten.mul.Tensor, aten.div.Tensor]
+    _computation_calls = [CallFunction(aten.convolution.default, *_conv_args)]
+
+    def _is_constant_node(node):


https://github.com/pytorch/pytorch/blob/main/torch/csrc/jit/passes/frozen_conv_folding.cpp#L181

Yes, the checks are following it.

Cool! Would you mind copying over some of the comments and reusing the tests ? And maybe moving to a separate file ?

I think it'll cool to remove the batch norm pass and replace it with this.

Now, copy some of the comments and reuse the tests, for scalar case, I don't support it now, because it needs to use ```aten.full`` to create a new tensor, but it can't be folded(https://github.com/pytorch/pytorch/blob/main/torch/_inductor/freezing.py#L105). I will create another PR to remove the batch norm pass.

eellison · 2023-07-13T16:04:52Z

@eellison, I wonder if we may remove the conv_bn folding pass by directly calling many times of binary folding to avoid re-tracing the aot_model using make_fx?

Yea sounds good !

pytorchmergebot · 2023-07-20T01:26:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-07-20T02:27:21Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-bionic-cuda12.1-py3.10-gcc9-sm86 / test (default, 3, 5, linux.g5.4xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper · 2023-07-20T07:19:46Z

test/inductor/test_binary_folding.py

+            inp = torch.rand(inps).to(self.device)
+            out_eager = mod_eager(inp)
+            out_optimized = out_optimized(inp)
+            self.assertEqual(out_optimized, out_eager, atol=2e-04, rtol=1e-5)


CPU path works fine even using default tolerance, but for the GPU path, there seems has a big variance, but my local machine(A10) can't reproduce it, I need to find another machine to reproduce it.

Skip the pass which cudnn is enabled, it can be passed when disable cudnn convolution.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper · 2023-07-26T00:39:27Z

@pytorchbot merge

pytorchmergebot · 2023-07-26T00:41:06Z

Merge failed

Reason: Not merging any PRs at the moment because there is a merge blocking https://github.com/pytorch/pytorch/labels/ci:%20sev issue open at:
#105964

Details for Dev Infra team

Raised by workflow job

XiaobingSuper · 2023-07-26T01:48:58Z

@pytorchbot merge -f "unrelated failures"

pytorchmergebot · 2023-07-26T01:50:26Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

inductor: support conv+binary foldinig for freezing path

430d473

[ghstack-poisoned]

XiaobingSuper mentioned this pull request Jul 12, 2023

inductor: enable cpu fusion for dynamic shapes path #104945

Closed

github-actions bot added module: inductor ciflow/inductor labels Jul 12, 2023

XiaobingSuper added a commit that referenced this pull request Jul 12, 2023

inductor: support conv+binary foldinig for freezing path

369a6e8

ghstack-source-id: 52ff9b8 Pull Request resolved: #105048

XiaobingSuper requested review from eellison and jgong5 July 12, 2023 08:30

pytorchbot added the open source label Jul 12, 2023

Update on "inductor: support conv+binary foldinig for freezing path"

6e10528

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Jul 12, 2023

inductor: support conv+binary foldinig for freezing path

3ddaeca

ghstack-source-id: eb8432f Pull Request resolved: #105048

XiaobingSuper added the release notes: inductor label Jul 12, 2023

jgong5 requested changes Jul 12, 2023

View reviewed changes

XiaobingSuper added 2 commits July 12, 2023 20:05

Update on "inductor: support conv+binary foldinig for freezing path"

cd2305f

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

Update on "inductor: support conv+binary foldinig for freezing path"

35cb726

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

XiaobingSuper requested a review from jgong5 July 13, 2023 00:47

Update on "inductor: support conv+binary foldinig for freezing path"

298a6a3

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

jgong5 approved these changes Jul 13, 2023

View reviewed changes

XiaobingSuper requested review from desertfire and jansel July 13, 2023 03:02

Update on "inductor: support conv+binary foldinig for freezing path"

b4ed0f3

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Jul 13, 2023

inductor: support conv+binary foldinig for freezing path

9e3cb48

ghstack-source-id: b4f88d6 Pull Request resolved: #105048

Update on "inductor: support conv+binary foldinig for freezing path"

cc12561

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Jul 13, 2023

inductor: support conv+binary foldinig for freezing path

a78755e

ghstack-source-id: 2559d8f Pull Request resolved: #105048

eellison reviewed Jul 13, 2023

View reviewed changes

pytorchmergebot removed the merging label Jul 20, 2023

Update on "inductor: support conv+binary foldinig for freezing path"

748b81f

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Update on "inductor: support conv+binary foldinig for freezing path"

430a012

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Update on "inductor: support conv+binary foldinig for freezing path"

348521e

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper mentioned this pull request Jul 20, 2023

inductor: using binary folding path to do conv+bn folding #105650

Closed

Update on "inductor: support conv+binary foldinig for freezing path"

93cd82e

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper commented Jul 20, 2023

View reviewed changes

XiaobingSuper mentioned this pull request Jul 20, 2023

inductor: fix CSE issue when have symbolic shape input at the freezing path #105651

Closed

XiaobingSuper added 3 commits July 24, 2023 02:20

Update on "inductor: support conv+binary foldinig for freezing path"

f380a16

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Update on "inductor: support conv+binary foldinig for freezing path"

f2397cf

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Update on "inductor: support conv+binary foldinig for freezing path"

47ccb5a

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper mentioned this pull request Jul 25, 2023

test for cudnn folding #105899

Closed

Update on "inductor: support conv+binary foldinig for freezing path"

e3674cb

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

pytorchmergebot added the merging label Jul 26, 2023

pytorchmergebot removed the merging label Jul 26, 2023

pytorchmergebot added merging Merged and removed merging labels Jul 26, 2023

pytorchmergebot closed this in 837363c Jul 26, 2023

facebook-github-bot deleted the gh/XiaobingSuper/138/head branch July 29, 2023 14:16

inductor: support conv+binary foldinig for freezing path #105048

inductor: support conv+binary foldinig for freezing path #105048

Uh oh!

Conversation

XiaobingSuper commented Jul 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105048

❗ 1 Merge Blocking SEVs

✅ 2 Unrelated Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

XiaobingSuper commented Jul 13, 2023

Uh oh!

eellison left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eellison Jul 13, 2023

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper Jul 14, 2023

Choose a reason for hiding this comment

Uh oh!

eellison Jul 14, 2023

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

eellison commented Jul 13, 2023

Uh oh!

pytorchmergebot commented Jul 20, 2023

Merge started

Uh oh!

pytorchmergebot commented Jul 20, 2023

Merge failed

Uh oh!

XiaobingSuper Jul 20, 2023

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper Jul 25, 2023

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper commented Jul 26, 2023

Uh oh!

pytorchmergebot commented Jul 26, 2023

Merge failed

Uh oh!

XiaobingSuper commented Jul 26, 2023

Uh oh!

pytorchmergebot commented Jul 26, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

XiaobingSuper commented Jul 12, 2023 •

edited

Loading

pytorch-bot bot commented Jul 12, 2023 •

edited

Loading

eellison left a comment •

edited

Loading