[pt][group_fusion] fix shape guarding in fusion candidate search #111174

chaekit · 2023-10-13T00:43:15Z

Summary:
without the all in the fix

node.kwargs.get("beta", 1.0) == 1.0
node.kwargs.get("alpha", 1.0) == 1.0
and len(input_shape) == 2
and len(weight_shape) == 2
and all(x % 2 == 0 for x in input_shape + weight_shape)
and shape <= MAX_FUSE_TENSOR_SIZE_GROUP_LINEAR # <----- HERE
for shape in input_shape + weight_shape

this statement defaults to a generator object which means it will always be true. One of the issues is that the shapes could be an odd number which forces gmm to load element-by-element rather than vectorized load. In VDDv3 torchbench example(posted in test plan), you can see there is a 37ms GMM call which swamps any gain from fusion. Overall this change makes the GMM fusion 24% faster

Differential Revision: D48696572

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

pytorch-bot · 2023-10-13T00:43:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/111174

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 228cbdf with merge base 898482f ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-10-13T00:43:24Z

This pull request was exported from Phabricator. Differential Revision: D48696572

facebook-github-bot · 2023-10-13T05:19:09Z

This pull request was exported from Phabricator. Differential Revision: D48696572

…1174) Summary: without the `all` in the fix ``` node.kwargs.get("beta", 1.0) == 1.0 node.kwargs.get("alpha", 1.0) == 1.0 and len(input_shape) == 2 and len(weight_shape) == 2 and all(x % 2 == 0 for x in input_shape + weight_shape) and shape <= MAX_FUSE_TENSOR_SIZE_GROUP_LINEAR # <----- HERE for shape in input_shape + weight_shape ``` this statement defaults to a generator object which means it will always be true. One of the issues is that the shapes could be an odd number which forces gmm to load element-by-element rather than vectorized load. In VDDv3 torchbench example(posted in test plan), you can see there is a 37ms GMM call which swamps any gain from fusion. Overall this change makes the GMM fusion 24% faster Reviewed By: davidberard98 Differential Revision: D48696572

facebook-github-bot · 2023-10-13T06:17:15Z

This pull request was exported from Phabricator. Differential Revision: D48696572

facebook-github-bot · 2023-10-13T16:26:18Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2023-10-13T16:27:50Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

facebook-github-bot added the fb-exported label Oct 13, 2023

github-actions bot added module: inductor ciflow/inductor labels Oct 13, 2023

davidberard98 added the topic: not user facing topic category label Oct 13, 2023

davidberard98 approved these changes Oct 13, 2023

View reviewed changes

chaekit force-pushed the export-D48696572 branch from 2b011e5 to f34fd52 Compare October 13, 2023 05:18

chaekit force-pushed the export-D48696572 branch from f34fd52 to 228cbdf Compare October 13, 2023 06:16

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 13, 2023

pytorchmergebot added the merging label Oct 13, 2023

pytorchmergebot added Merged and removed merging labels Oct 13, 2023

pytorchmergebot closed this in 5db9f91 Oct 13, 2023

facebook-github-bot deleted the export-D48696572 branch October 17, 2023 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pt][group_fusion] fix shape guarding in fusion candidate search #111174

[pt][group_fusion] fix shape guarding in fusion candidate search #111174

Uh oh!

chaekit commented Oct 13, 2023 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Oct 13, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

pytorchmergebot commented Oct 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[pt][group_fusion] fix shape guarding in fusion candidate search #111174

[pt][group_fusion] fix shape guarding in fusion candidate search #111174

Uh oh!

Conversation

chaekit commented Oct 13, 2023 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/111174

✅ No Failures

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

pytorchmergebot commented Oct 13, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chaekit commented Oct 13, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 13, 2023 •

edited

Loading