[Quant][Inductor] Enable quantization conv_unary(relu) pattern fusion inside inductor #105455

leslie-fang-intel · 2023-07-18T09:45:56Z

Stack from ghstack (oldest at bottom):

Summary
Enable the dequant-conv2d-unary_postop(relu)-quant pattern fusion and lowering inside inductor.

Test Plan

clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_unary

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

pytorch-bot · 2023-07-18T09:45:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105455

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit d9d6588 with merge base 97a291f ():

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

… inside inductor [ghstack-poisoned]

…tern fusion inside inductor" **Summary** Enable the `dequant-conv2d-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -u -m pytest -s -v test_mkldnn_pattern_matcher.py -k test_qconv2d_unary ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

… inside inductor ghstack-source-id: 1fe7343b23a99c19e1bb2a2418209a880828ae60 Pull Request resolved: pytorch#105455

…tern fusion inside inductor" **Summary** Enable the `dequant-conv2d-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_unary ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

leslie-fang-intel · 2023-07-25T04:34:02Z

Hi @eellison @jansel, could you kindly help to take a look of this PR?

… inside inductor ghstack-source-id: df11a2e91fe22278fc6707e0a484abafc84128a4 Pull Request resolved: pytorch#105455

…tern fusion inside inductor" **Summary** Enable the `dequant-conv2d-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_unary ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

… inside inductor ghstack-source-id: a1ff1ebd4ba0faf5f663863281e81f9fa362a1d8 Pull Request resolved: pytorch#105455

…tern fusion inside inductor" **Summary** Enable the `dequant-conv2d-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_unary ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

leslie-fang-intel · 2023-08-25T18:02:40Z

@pytorchbot merge

pytorchmergebot · 2023-08-25T18:04:26Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

leslie-fang-intel · 2023-08-25T18:04:58Z

@pytorchbot merge

pytorchmergebot · 2023-08-25T18:07:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…rn fusion inside inductor (#105456) **Summary** Enable the `dequant-conv2d-binary_postop(add)-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_binary ``` Pull Request resolved: #105456 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455

…ol2d) (#105639) **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` Pull Request resolved: #105639 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456

**Summary** Enable the `dq-maxpool2d-q` pattern match and lower into `torch.ops.quantized.max_pool2d`. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qmaxpool2d python -m pytest test_quantized_op.py -k test_max_pool2d_pt2e ``` Pull Request resolved: #105906 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639

**Summary** After oneDNN 3.1 upgrade, we don't need to do the weight scale reciprocal calculation. So, remove the redundant reciprocal calculation to optimize QConv performance and using IDeep version API to implement it in this PR: - This QConv implementation expects to work functionally both with current IDeep version and the following IDeep upgrade in PR: #107565. - With the following IDeep upgrade in PR: #107565, the QConv has better performance since the redundant reciprocal calculation are removed. Pull Request resolved: #105996 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906

…ht scale reciprocal calculation (#107565) **Summary** Upgrade IDeep which includes 1 IDeep change as IDeep PR: intel/ideep#226 - For IDeep PR: intel/ideep#226 which has done 2 things: - Remove the redundant QConv weight scale reciprocal calculation. - Pump IDEEP_VERSION_REVISION version from 0 to 1. So only QConv related calculation will be impacted and we already use IDeep version API in #105996 to make the corresponding change in PyTorch. Pull Request resolved: #107565 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906, #105996

… inside inductor (#105455) **Summary** Enable the `dequant-conv2d-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_unary ``` Pull Request resolved: #105455 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590

…rn fusion inside inductor (#105456) **Summary** Enable the `dequant-conv2d-binary_postop(add)-unary_postop(relu)-quant` pattern fusion and lowering inside inductor. **Test Plan** ``` clear && python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d_binary ``` Pull Request resolved: #105456 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455

…ol2d) (#105639) **Summary** In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. **Test Plan** ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` Pull Request resolved: #105639 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456

**Summary** Enable the `dq-maxpool2d-q` pattern match and lower into `torch.ops.quantized.max_pool2d`. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qmaxpool2d python -m pytest test_quantized_op.py -k test_max_pool2d_pt2e ``` Pull Request resolved: #105906 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639

**Summary** After oneDNN 3.1 upgrade, we don't need to do the weight scale reciprocal calculation. So, remove the redundant reciprocal calculation to optimize QConv performance and using IDeep version API to implement it in this PR: - This QConv implementation expects to work functionally both with current IDeep version and the following IDeep upgrade in PR: #107565. - With the following IDeep upgrade in PR: #107565, the QConv has better performance since the redundant reciprocal calculation are removed. Pull Request resolved: #105996 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906

…ht scale reciprocal calculation (#107565) **Summary** Upgrade IDeep which includes 1 IDeep change as IDeep PR: intel/ideep#226 - For IDeep PR: intel/ideep#226 which has done 2 things: - Remove the redundant QConv weight scale reciprocal calculation. - Pump IDEEP_VERSION_REVISION version from 0 to 1. So only QConv related calculation will be impacted and we already use IDeep version API in #105996 to make the corresponding change in PyTorch. Pull Request resolved: #107565 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639, #105906, #105996

This was referenced Jul 18, 2023

[Quant][PT2E] Remove x86 inductor pt2e backend config #105039

Closed

[Quant][Inductor] Use truncate instead of default rounding round when convert float to uint8 #105109

Closed

github-actions bot added module: inductor ciflow/inductor labels Jul 18, 2023

pytorchbot added the open source label Jul 18, 2023

leslie-fang-intel marked this pull request as draft July 18, 2023 09:46

[Quant][Inductor] Enable quantization conv_unary(relu) pattern fusion…

ce2ce82

… inside inductor [ghstack-poisoned]

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 18, 2023

leslie-fang-intel requested a review from jgong5 July 19, 2023 00:00

leslie-fang-intel added 2 commits July 19, 2023 09:01

This was referenced Jul 20, 2023

[Quant][PT2E] Re-enable test case of conv add/add_relu recipe for x86inductorquantizer #105638

Closed

[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) #105639

Closed

leslie-fang-intel added 3 commits July 24, 2023 13:57

jgong5 approved these changes Jul 25, 2023

View reviewed changes

leslie-fang-intel marked this pull request as ready for review July 25, 2023 04:32

leslie-fang-intel requested review from eellison and jansel July 25, 2023 04:33

leslie-fang-intel mentioned this pull request Aug 25, 2023

[Quant][PT2E]Make _fuse_conv_bn_ support graph capture by torch._dynamo.export #107951

Closed

pytorchmergebot added the merging label Aug 25, 2023

pytorchmergebot removed the merging label Aug 25, 2023

leslie-fang-intel added the topic: not user facing topic category label Aug 25, 2023

pytorchmergebot added the merging label Aug 25, 2023

pytorchmergebot added Merged and removed merging labels Aug 25, 2023

pytorchmergebot closed this in c1e0fb7 Aug 25, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/58/head branch August 29, 2023 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quant][Inductor] Enable quantization conv_unary(relu) pattern fusion inside inductor #105455

[Quant][Inductor] Enable quantization conv_unary(relu) pattern fusion inside inductor #105455

leslie-fang-intel commented Jul 18, 2023 •

edited

pytorch-bot bot commented Jul 18, 2023 •

edited

leslie-fang-intel commented Jul 25, 2023

leslie-fang-intel commented Aug 25, 2023

pytorchmergebot commented Aug 25, 2023

leslie-fang-intel commented Aug 25, 2023

pytorchmergebot commented Aug 25, 2023

[Quant][Inductor] Enable quantization conv_unary(relu) pattern fusion inside inductor #105455

[Quant][Inductor] Enable quantization conv_unary(relu) pattern fusion inside inductor #105455

Conversation

leslie-fang-intel commented Jul 18, 2023 • edited

pytorch-bot bot commented Jul 18, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105455

✅ You can merge normally! (2 Unrelated Failures)

leslie-fang-intel commented Jul 25, 2023

leslie-fang-intel commented Aug 25, 2023

pytorchmergebot commented Aug 25, 2023

Merge failed

leslie-fang-intel commented Aug 25, 2023

pytorchmergebot commented Aug 25, 2023

Merge started

leslie-fang-intel commented Jul 18, 2023 •

edited

pytorch-bot bot commented Jul 18, 2023 •

edited