[Quant] [PT2] Enable QLinear input with multi dims #113733

leslie-fang-intel · 2023-11-15T02:49:46Z

Stack from ghstack (oldest at bottom):

[Quant] [Inductor] Enable QLinear weight prepack when input dimension size exceeds 2 #113928
[Quant] [Inductor] Enable Dequant Promotion when Linear input dimension size exceeds 2 #113912
-> [Quant] [PT2] Enable QLinear input with multi dims #113733

Summary
In the previous QLinear implementation, it was assumed that inputs have a dimension of 2. In this update, we have modified QLinear to accept inputs with a dimension greater than 2, incorporating input and output reshaping accordingly.

Test Plan

python -u -m pytest -s -v test_quantized_op.py -k test_qlinear_pt2e

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

[ghstack-poisoned]

pytorch-bot · 2023-11-15T02:49:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113733

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 29479ce with merge base 7bbc19a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 3950269 Pull Request resolved: #113733

leslie-fang-intel · 2023-11-15T02:51:45Z

cc @jianan-gu

aten/src/ATen/native/quantized/cpu/qlinear.cpp

**Summary** Previously QLinear implementation assumes inputs is with dim of 2. In this diff, we make QLinear accepts input of dim more than 2 with input and output reshape. **Test Plan** ``` python -u -m pytest -s -v test_quantized_op.py -k test_qlinear_pt2e ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 72b2d8d Pull Request resolved: #113733

aten/src/ATen/native/quantized/cpu/qlinear.cpp

**Summary** In the previous QLinear implementation, it was assumed that inputs have a dimension of 2. In this update, we have modified QLinear to accept inputs with a dimension greater than 2, incorporating input and output reshaping accordingly. **Test Plan** ``` python -u -m pytest -s -v test_quantized_op.py -k test_qlinear_pt2e ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: b1796d3 Pull Request resolved: pytorch#113733

**Summary** In the previous QLinear implementation, it was assumed that inputs have a dimension of 2. In this update, we have modified QLinear to accept inputs with a dimension greater than 2, incorporating input and output reshaping accordingly. **Test Plan** ``` python -u -m pytest -s -v test_quantized_op.py -k test_qlinear_pt2e ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 7fd173b Pull Request resolved: pytorch#113733

leslie-fang-intel · 2023-11-21T04:38:12Z

Hi @jerryzh168, could you kindly help to take a look of this PR?

**Summary** In the previous QLinear implementation, it was assumed that inputs have a dimension of 2. In this update, we have modified QLinear to accept inputs with a dimension greater than 2, incorporating input and output reshaping accordingly. **Test Plan** ``` python -u -m pytest -s -v test_quantized_op.py -k test_qlinear_pt2e ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

leslie-fang-intel · 2023-12-06T01:14:18Z

@pytorchbot merge

pytorchmergebot · 2023-12-06T01:16:11Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…on size exceeds 2 (#113912) **Summary** When decomposing `Linear` to `addmm` or `mm` within Inductor, if the input dimension size exceeds 2, `reshape` nodes are introduced to convert the input into a 2-dimensional form before and after the `addmm` or `mm` node. It is essential to identify and match this pattern during quantization for dequantization promotion. For instance, ``` # quant # + - - - | - - - + # | dequant | # | | | # | reshape | # | / \ | # | node1 node2 | # + - | - - - | - + # reshape reshape # + - | - - - | - + # quant quant ``` In this PR, we mainly do 2 things: - Extend support for the dequantization pattern in QLinear when the input dimension size exceeds 2. - Revise the implementation of the dequant promotion pass, as it now needs to accommodate the matching of four different patterns. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k input_dim_exceeds_2 ``` Pull Request resolved: #113912 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #113733

… size exceeds 2 (#113928) **Summary** Enable the qlinear weight prepack when input dimension size exceeds 2. There are extra reshape node before and after the `addmm` or `mm` node if input dimension size exceeds 2. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k input_dim_exceeds_2 ``` Pull Request resolved: #113928 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #113733, #113912

**Summary** In the previous QLinear implementation, it was assumed that inputs have a dimension of 2. In this update, we have modified QLinear to accept inputs with a dimension greater than 2, incorporating input and output reshaping accordingly. **Test Plan** ``` python -u -m pytest -s -v test_quantized_op.py -k test_qlinear_pt2e ``` Pull Request resolved: pytorch#113733 Approved by: https://github.com/jgong5, https://github.com/eellison

…on size exceeds 2 (pytorch#113912) **Summary** When decomposing `Linear` to `addmm` or `mm` within Inductor, if the input dimension size exceeds 2, `reshape` nodes are introduced to convert the input into a 2-dimensional form before and after the `addmm` or `mm` node. It is essential to identify and match this pattern during quantization for dequantization promotion. For instance, ``` # quant # + - - - | - - - + # | dequant | # | | | # | reshape | # | / \ | # | node1 node2 | # + - | - - - | - + # reshape reshape # + - | - - - | - + # quant quant ``` In this PR, we mainly do 2 things: - Extend support for the dequantization pattern in QLinear when the input dimension size exceeds 2. - Revise the implementation of the dequant promotion pass, as it now needs to accommodate the matching of four different patterns. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k input_dim_exceeds_2 ``` Pull Request resolved: pytorch#113912 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: pytorch#113733

… size exceeds 2 (pytorch#113928) **Summary** Enable the qlinear weight prepack when input dimension size exceeds 2. There are extra reshape node before and after the `addmm` or `mm` node if input dimension size exceeds 2. **Test Plan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k input_dim_exceeds_2 ``` Pull Request resolved: pytorch#113928 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: pytorch#113733, pytorch#113912

Enable QLinear input with multi dims

c4edda4

[ghstack-poisoned]

leslie-fang-intel requested review from digantdesai, jerryzh168, jianyuh, kimishpatel and salilsdesai as code owners November 15, 2023 02:49

pytorch-bot bot added the release notes: quantization release notes category label Nov 15, 2023

github-actions bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Nov 15, 2023

leslie-fang-intel added a commit that referenced this pull request Nov 15, 2023

Enable QLinear input with multi dims

b8b25a1

ghstack-source-id: 3950269 Pull Request resolved: #113733

leslie-fang-intel changed the title ~~Enable QLinear input with multi dims~~ [Quant] [PT2] Enable QLinear input with multi dims Nov 15, 2023

leslie-fang-intel requested review from Xia-Weiwen and jgong5 November 15, 2023 02:51

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 15, 2023

pytorchbot added the open source label Nov 15, 2023

jgong5 requested changes Nov 16, 2023

View reviewed changes

aten/src/ATen/native/quantized/cpu/qlinear.cpp Outdated Show resolved Hide resolved

leslie-fang-intel added a commit that referenced this pull request Nov 16, 2023

Enable QLinear input with multi dims

eb11638

ghstack-source-id: 72b2d8d Pull Request resolved: #113733

github-actions bot added module: inductor ciflow/inductor labels Nov 16, 2023

leslie-fang-intel requested a review from jgong5 November 16, 2023 04:37

leslie-fang-intel mentioned this pull request Nov 17, 2023

[Quant] [Inductor] Enable Dequant Promotion when Linear input dimension size exceeds 2 #113912

Closed

jgong5 approved these changes Nov 17, 2023

View reviewed changes

aten/src/ATen/native/quantized/cpu/qlinear.cpp Outdated Show resolved Hide resolved

leslie-fang-intel mentioned this pull request Nov 17, 2023

[Quant] [Inductor] Enable QLinear weight prepack when input dimension size exceeds 2 #113928

Closed

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 17, 2023

Enable QLinear input with multi dims

998611f

ghstack-source-id: b1796d3 Pull Request resolved: pytorch#113733

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 20, 2023

Enable QLinear input with multi dims

d37c726

ghstack-source-id: 7fd173b Pull Request resolved: pytorch#113733

leslie-fang-intel requested a review from eellison November 21, 2023 04:37

eellison approved these changes Nov 27, 2023

View reviewed changes

pytorchmergebot added the merging label Dec 6, 2023

pytorchmergebot added the Merged label Dec 6, 2023

pytorchmergebot closed this in 4a624d1 Dec 6, 2023

pytorchmergebot removed the merging label Dec 6, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/41/head branch December 9, 2023 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quant] [PT2] Enable QLinear input with multi dims #113733

[Quant] [PT2] Enable QLinear input with multi dims #113733

Uh oh!

leslie-fang-intel commented Nov 15, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 15, 2023 •

edited

Loading

Uh oh!

leslie-fang-intel commented Nov 15, 2023

Uh oh!

Uh oh!

Uh oh!

leslie-fang-intel commented Nov 21, 2023

Uh oh!

leslie-fang-intel commented Dec 6, 2023

Uh oh!

pytorchmergebot commented Dec 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Quant] [PT2] Enable QLinear input with multi dims #113733

[Quant] [PT2] Enable QLinear input with multi dims #113733

Uh oh!

Conversation

leslie-fang-intel commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113733

✅ No Failures

Uh oh!

leslie-fang-intel commented Nov 15, 2023

Uh oh!

Uh oh!

Uh oh!

leslie-fang-intel commented Nov 21, 2023

Uh oh!

leslie-fang-intel commented Dec 6, 2023

Uh oh!

pytorchmergebot commented Dec 6, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

leslie-fang-intel commented Nov 15, 2023 •

edited

Loading

pytorch-bot bot commented Nov 15, 2023 •

edited

Loading