inductor: using binary folding path to do conv+bn folding #105650

XiaobingSuper · 2023-07-20T07:06:20Z

Stack from ghstack (oldest at bottom):

This path will use binary folding to do conv+bn folding to avoid using make_fx which meets tracing errors in some model dynamic shape path.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

[ghstack-poisoned]

This path will use binary folding to do conv+bn folding to avoid using ```make_fx``` which meets tracing errors in some model dynamic shape path. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

pytorch-bot · 2023-07-20T07:06:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105650

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

6-7 hour queue times caused by GPU instances not being available

✅ 3 Unrelated Failures

As of commit 42a29db:

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This path will use binary folding to do conv+bn folding to avoid using ```make_fx``` which meets tracing errors in some model dynamic shape path. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

eellison

cool!

eellison · 2023-07-20T20:27:32Z

torch/_inductor/fx_passes/freezing_patterns.py



-def freezing_passes(gm: torch.fx.GraphModule):
+def freezing_passes(gm: torch.fx.GraphModule, aot_example_inputs):


can we special case this and just run the folding pass for a number of iterations first ? feels unnecessary to run each pass 4 times.

We should also be able to short-circuit if the binary folding path didn't find any additional matches

Yes, separate binary folding pass and short-circuit if the binary folding path finds no additional matches.

This path will use binary folding to do conv+bn folding to avoid using ```make_fx``` which meets tracing errors in some model dynamic shape path. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper · 2023-07-24T09:10:19Z

test/inductor/test_inductor_freezing.py

+        self.assertNotIn(
+            "aten._native_batch_norm_legit_no_training(",
+            code[0],
+        )


Change the code because the generated code has # Source Nodes: [l__mod___bn], Original ATen: [aten._native_batch_norm_legit_no_training].

This path will use binary folding to do conv+bn folding to avoid using ```make_fx``` which meets tracing errors in some model dynamic shape path. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

eellison

🚀

eellison · 2023-07-25T20:48:31Z

torch/_inductor/fx_passes/freezing_patterns.py

+    # We need a few rounds of binary folding to get rid of all the
+    # unnecessary nodes, but may need a good method to chose the rounds number.
+    # works like: conv+binary+binary.
+    binary_folding = 0


im not 100% sure the counters get reset so i would initialize binary_folding = counters["inductor"]["binary_folding"]

eellison · 2023-07-25T20:48:53Z

torch/_inductor/fx_passes/freezing_patterns.py

+    for _ in range(4):
+        constant_fold(gm)
+        # Make sure meta['val'] is properly set for all nodes
+        fake_tensor_prop(gm, aot_example_inputs, True)


TODO: remove the need to run fake_tensor_prop on the whole model,

This path will use binary folding to do conv+bn folding to avoid using ```make_fx``` which meets tracing errors in some model dynamic shape path. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

XiaobingSuper · 2023-07-26T07:36:05Z

@pytorchbot merge

pytorchmergebot · 2023-07-26T07:37:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

eellison · 2023-07-27T18:12:23Z

torch/_inductor/fx_passes/freezing_patterns.py

+        # TODO: remove the need to run fake_tensor_prop on the whole model.
+        if counters["inductor"]["binary_folding"] == binary_folding:
+            break
+        binary_folding += counters["inductor"]["binary_folding"]


From looking at res2next50, i think we still need to add support for no bias

XiaobingSuper added 2 commits July 20, 2023 02:26

inductor: using binary folding path to do conv+bn folding

f821d8d

[ghstack-poisoned]

XiaobingSuper mentioned this pull request Jul 20, 2023

inductor: support conv+binary foldinig for freezing path #105048

Closed

github-actions bot added module: inductor ciflow/inductor labels Jul 20, 2023

XiaobingSuper requested review from eellison and jgong5 July 20, 2023 07:06

pytorchbot added the open source label Jul 20, 2023

XiaobingSuper added the release notes: inductor label Jul 20, 2023

XiaobingSuper mentioned this pull request Jul 20, 2023

inductor: fix CSE issue when have symbolic shape input at the freezing path #105651

Closed

eellison reviewed Jul 20, 2023

View reviewed changes

XiaobingSuper added 2 commits July 24, 2023 02:20

XiaobingSuper commented Jul 24, 2023

View reviewed changes

XiaobingSuper requested a review from eellison July 25, 2023 05:18

XiaobingSuper mentioned this pull request Jul 25, 2023

test for cudnn folding #105899

Closed

eellison approved these changes Jul 25, 2023

View reviewed changes

XiaobingSuper added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 26, 2023

pytorchmergebot added the merging label Jul 26, 2023

pytorchmergebot added Merged and removed merging labels Jul 26, 2023

pytorchmergebot closed this in 9c1802f Jul 26, 2023

eellison reviewed Jul 27, 2023

View reviewed changes

facebook-github-bot deleted the gh/XiaobingSuper/147/head branch July 29, 2023 14:16



		def freezing_passes(gm: torch.fx.GraphModule):
		def freezing_passes(gm: torch.fx.GraphModule, aot_example_inputs):

inductor: using binary folding path to do conv+bn folding #105650

inductor: using binary folding path to do conv+bn folding #105650

Uh oh!

Conversation

XiaobingSuper commented Jul 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105650

❗ 1 Active SEVs

✅ 3 Unrelated Failures

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper commented Jul 26, 2023

Uh oh!

pytorchmergebot commented Jul 26, 2023

Merge started

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

XiaobingSuper commented Jul 20, 2023 •

edited

Loading

pytorch-bot bot commented Jul 20, 2023 •

edited

Loading