[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering #112486

leslie-fang-intel · 2023-10-31T08:07:57Z

Stack from ghstack (oldest at bottom):

-> [Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering #112486

Summary

PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor [RFC] Enable Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640.
Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor.

TestPlan

python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

[ghstack-poisoned]

pytorch-bot · 2023-10-31T08:08:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112486

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 08aceee with merge base eb15340 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 2dda7766d50b605fb2f636dca6c4826065a459d5 Pull Request resolved: #112486

**Summary** - PR 6 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: efdd491d143e62dff0c001e3e93c25cd413aec9a Pull Request resolved: #112486

**Summary** - PR 6 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 11d7c513dc9eceaa00cf9040fc19c0a2ab169052 Pull Request resolved: #112486

**Summary** - PR 6 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 7b02bcb3054439cd276f14bf69d1f5cce96e4e27 Pull Request resolved: #112486

**Summary** - PR 6 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 7b02bcb3054439cd276f14bf69d1f5cce96e4e27 Pull Request resolved: #112486

**Summary** - PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 993d1e024cb473d02b1b0d081d3f1b11f2c51ccd Pull Request resolved: #112486

**Summary** - PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 602000ad05296e6b255b0e00d72536aec19e26d7 Pull Request resolved: #112486

ghstack-source-id: 5dcf1b614cff856f96027bd9b6b1bde2635efd48 Pull Request resolved: #112486

**Summary** - PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: f761f79300d4759f60d7c72ce553378bfca801b0 Pull Request resolved: #112486

leslie-fang-intel · 2023-11-06T00:34:16Z

Hi @jgong5, please help take look of this PR also.

eellison · 2023-11-07T23:58:06Z

torch/_inductor/fx_passes/quantization.py

+        if (
+            len(list(to_fp32_node.users)) != 1
+            or len(list(sub_node.users)) != 1
+            or len(list(mul_node.users)) != 1
+        ):


I think you should be able to match this in the pattern.

test/inductor/test_mkldnn_pattern_matcher.py

**Summary** - PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: d7da88230a860176f6c1587a4dc435b4b794c13d Pull Request resolved: #112486

leslie-fang-intel · 2023-11-10T09:18:20Z

@pytorchbot merge

pytorchmergebot · 2023-11-10T09:20:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-11-10T09:20:57Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x f1d028f18954d092f707fad41df43fd83b1a51fa returned non-zero exit code 1

Auto-merging test/inductor/test_mkldnn_pattern_matcher.py
CONFLICT (content): Merge conflict in test/inductor/test_mkldnn_pattern_matcher.py
error: could not apply f1d028f1895... Enable QLinear int8-mixed-bf16 Lowering
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".

Details for Dev Infra team

Raised by workflow job

**Summary** - PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 4cde6e5121be8242b92ce943bf6244a6c4a3358a Pull Request resolved: #112486

leslie-fang-intel · 2023-11-10T12:32:33Z

@pytorchbot merge

pytorchmergebot · 2023-11-10T12:34:36Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…12486) **Summary** - PR 7 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor pytorch#111640. - Enable the QLinear int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qlinear ``` Pull Request resolved: pytorch#112486 Approved by: https://github.com/jgong5, https://github.com/eellison, https://github.com/jerryzh168

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering

75c1b7d

[ghstack-poisoned]

leslie-fang-intel mentioned this pull request Oct 31, 2023

Enable oneDNN QConv FP32/BF16 output #112010

Closed

github-actions bot added module: inductor ciflow/inductor labels Oct 31, 2023

leslie-fang-intel added a commit that referenced this pull request Oct 31, 2023

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering

dad4778

ghstack-source-id: 2dda7766d50b605fb2f636dca6c4826065a459d5 Pull Request resolved: #112486

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 31, 2023

leslie-fang-intel marked this pull request as draft October 31, 2023 08:10

pytorchbot added the open source label Oct 31, 2023

leslie-fang-intel added a commit that referenced this pull request Oct 31, 2023

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering

87f9f01

ghstack-source-id: efdd491d143e62dff0c001e3e93c25cd413aec9a Pull Request resolved: #112486

leslie-fang-intel added a commit that referenced this pull request Nov 1, 2023

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering

e8d4041

ghstack-source-id: 11d7c513dc9eceaa00cf9040fc19c0a2ab169052 Pull Request resolved: #112486

leslie-fang-intel added a commit that referenced this pull request Nov 1, 2023

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering

7895e05

ghstack-source-id: 7b02bcb3054439cd276f14bf69d1f5cce96e4e27 Pull Request resolved: #112486

This was referenced Nov 1, 2023

[Inductor] [Quant] Enable QConv2d Unary int8-mixed-bf16 Lowering #112550

Closed

[Inductor] [Quant] Enable QConv2d Binary int8-mixed-bf16 Lowering #112551

Closed

leslie-fang-intel added a commit that referenced this pull request Nov 1, 2023

Enable QLinear int8-mixed-bf16 Lowering

ca59581

ghstack-source-id: 7b02bcb3054439cd276f14bf69d1f5cce96e4e27 Pull Request resolved: #112486

leslie-fang-intel marked this pull request as ready for review November 1, 2023 02:17

leslie-fang-intel requested review from jgong5 and Xia-Weiwen November 1, 2023 02:18

leslie-fang-intel added a commit that referenced this pull request Nov 2, 2023

Enable QLinear int8-mixed-bf16 Lowering

60f7e3f

ghstack-source-id: 993d1e024cb473d02b1b0d081d3f1b11f2c51ccd Pull Request resolved: #112486

leslie-fang-intel added a commit that referenced this pull request Nov 2, 2023

Enable QLinear int8-mixed-bf16 Lowering

9fca1f2

ghstack-source-id: 602000ad05296e6b255b0e00d72536aec19e26d7 Pull Request resolved: #112486

leslie-fang-intel added a commit that referenced this pull request Nov 3, 2023

Enable QLinear int8-mixed-bf16 Lowering

aa6ec26

ghstack-source-id: 5dcf1b614cff856f96027bd9b6b1bde2635efd48 Pull Request resolved: #112486

leslie-fang-intel added a commit that referenced this pull request Nov 4, 2023

Enable QLinear int8-mixed-bf16 Lowering

b5dfcff

ghstack-source-id: f761f79300d4759f60d7c72ce553378bfca801b0 Pull Request resolved: #112486

leslie-fang-intel added the topic: not user facing topic category label Nov 4, 2023

jgong5 approved these changes Nov 6, 2023

View reviewed changes

leslie-fang-intel requested review from eellison and jerryzh168 November 6, 2023 06:06

eellison reviewed Nov 7, 2023

View reviewed changes

eellison approved these changes Nov 7, 2023

View reviewed changes

jerryzh168 reviewed Nov 9, 2023

View reviewed changes

test/inductor/test_mkldnn_pattern_matcher.py Outdated Show resolved Hide resolved

jerryzh168 approved these changes Nov 9, 2023

View reviewed changes

leslie-fang-intel added a commit that referenced this pull request Nov 10, 2023

Enable QLinear int8-mixed-bf16 Lowering

f1d028f

ghstack-source-id: d7da88230a860176f6c1587a4dc435b4b794c13d Pull Request resolved: #112486

pytorchmergebot added the merging label Nov 10, 2023

pytorchmergebot removed the merging label Nov 10, 2023

leslie-fang-intel added a commit that referenced this pull request Nov 10, 2023

Enable QLinear int8-mixed-bf16 Lowering

cac889d

ghstack-source-id: 4cde6e5121be8242b92ce943bf6244a6c4a3358a Pull Request resolved: #112486

pytorchmergebot added the merging label Nov 10, 2023

pytorchmergebot added the Merged label Nov 10, 2023

pytorchmergebot closed this in 4f2b288 Nov 10, 2023

pytorchmergebot removed the merging label Nov 10, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/39/head branch November 13, 2023 15:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering #112486

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering #112486

leslie-fang-intel commented Oct 31, 2023 •

edited

pytorch-bot bot commented Oct 31, 2023 •

edited

leslie-fang-intel commented Nov 6, 2023

eellison Nov 7, 2023

leslie-fang-intel commented Nov 10, 2023

pytorchmergebot commented Nov 10, 2023

pytorchmergebot commented Nov 10, 2023

leslie-fang-intel commented Nov 10, 2023

pytorchmergebot commented Nov 10, 2023

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering #112486

[Inductor] [Quant] Enable QLinear int8-mixed-bf16 Lowering #112486

Conversation

leslie-fang-intel commented Oct 31, 2023 • edited

pytorch-bot bot commented Oct 31, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112486

✅ No Failures

leslie-fang-intel commented Nov 6, 2023

eellison Nov 7, 2023

Choose a reason for hiding this comment

leslie-fang-intel commented Nov 10, 2023

pytorchmergebot commented Nov 10, 2023

Merge started

pytorchmergebot commented Nov 10, 2023

Merge failed

leslie-fang-intel commented Nov 10, 2023

pytorchmergebot commented Nov 10, 2023

Merge started

leslie-fang-intel commented Oct 31, 2023 •

edited

pytorch-bot bot commented Oct 31, 2023 •

edited