[Quant] [PT2] Enable Decomposed quant per tensor/channel to accept bfloat16 input #112225

leslie-fang-intel · 2023-10-27T01:32:26Z

Stack from ghstack (oldest at bottom):

Summary

PR 4 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor [RFC] Enable Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640.
Enable decomposed quant_per_tensor and quant_per_channel accepts bfloat16 input.

TestPlan

python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_tensor_bfloat16_input
python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_channel_bfloat16_input

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

[ghstack-poisoned]

pytorch-bot · 2023-10-27T01:32:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112225

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 89133ba with merge base 871e27a ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

trunk / macos-12-py3-arm64 / test (default, 2, 3, macos-m1-12, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 5ffd0718d48982432c98ff852d90d65021b83ec9 Pull Request resolved: #112225

…float16 input" **Summary** Enable `decomposed quant_per_tensor` and `quant_per_channel` accepts bfloat16 input. **TestPlan** ``` python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_tensor_bfloat16_input python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_channel_bfloat16_input ``` [ghstack-poisoned]

ghstack-source-id: 68b017b5a02de389cc8e238c4b71089c8051ac18 Pull Request resolved: #112225

…o accept bfloat16 input" **Summary** - PR 4 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable `decomposed quant_per_tensor` and `quant_per_channel` accepts bfloat16 input. **TestPlan** ``` python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_tensor_bfloat16_input python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_channel_bfloat16_input ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: dfcf5ff1baa403210254d5b5f9a1cd1b92b51816 Pull Request resolved: pytorch#112225

…o accept bfloat16 input" **Summary** - PR 4 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable `decomposed quant_per_tensor` and `quant_per_channel` accepts bfloat16 input. **TestPlan** ``` python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_tensor_bfloat16_input python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_channel_bfloat16_input ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 938f4dc617b28a2e66eac36cec326596ed7e50c9 Pull Request resolved: pytorch#112225

leslie-fang-intel · 2023-11-03T08:28:00Z

@pytorchbot merge

pytorchmergebot · 2023-11-03T08:33:38Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-11-03T08:34:13Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / macos-12-py3-arm64 / test (default, 1, 3, macos-m1-12)

Details for Dev Infra team

Raised by workflow job

…o accept bfloat16 input" **Summary** - PR 4 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable `decomposed quant_per_tensor` and `quant_per_channel` accepts bfloat16 input. **TestPlan** ``` python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_tensor_bfloat16_input python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_channel_bfloat16_input ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

leslie-fang-intel · 2023-11-03T23:45:22Z

@pytorchbot merge

pytorchmergebot · 2023-11-03T23:47:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…loat16 input (pytorch#112225) **Summary** - PR 4 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor pytorch#111640. - Enable `decomposed quant_per_tensor` and `quant_per_channel` accepts bfloat16 input. **TestPlan** ``` python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_tensor_bfloat16_input python -m pytest test_quantized_tensor.py -k test_decomposed_quantize_per_channel_bfloat16_input ``` Pull Request resolved: pytorch#112225 Approved by: https://github.com/jgong5, https://github.com/jerryzh168

[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input

e55c48b

[ghstack-poisoned]

leslie-fang-intel requested a review from jerryzh168 as a code owner October 27, 2023 01:32

pytorch-bot bot added the release notes: quantization release notes category label Oct 27, 2023

This was referenced Oct 27, 2023

Enable oneDNN QConv FP32/BF16 output #112010

Closed

Enable oneDNN QLinear FP32/BF16 output #112126

Closed

[Quant] [PT2] Remove the output Annotation of Conv/Linear in x86InductorQuantizer #112140

Closed

leslie-fang-intel marked this pull request as draft October 27, 2023 01:32

leslie-fang-intel added a commit that referenced this pull request Oct 27, 2023

[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input

510575b

ghstack-source-id: 5ffd0718d48982432c98ff852d90d65021b83ec9 Pull Request resolved: #112225

pytorchbot added the open source label Oct 27, 2023

leslie-fang-intel added a commit that referenced this pull request Oct 27, 2023

[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input

0de4a7a

ghstack-source-id: 68b017b5a02de389cc8e238c4b71089c8051ac18 Pull Request resolved: #112225

github-actions bot added module: inductor ciflow/inductor labels Oct 27, 2023

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 27, 2023

leslie-fang-intel changed the title ~~[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input~~ [Quant] [PT2] Enable Decomposed quant per tensor/channel to accept bfloat16 input Oct 30, 2023

leslie-fang-intel marked this pull request as ready for review November 1, 2023 02:15

leslie-fang-intel requested review from jgong5 and Xia-Weiwen November 1, 2023 02:16

jgong5 approved these changes Nov 1, 2023

View reviewed changes

jerryzh168 approved these changes Nov 1, 2023

View reviewed changes

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 3, 2023

[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input

844b95b

ghstack-source-id: dfcf5ff1baa403210254d5b5f9a1cd1b92b51816 Pull Request resolved: pytorch#112225

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 3, 2023

[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input

6faed07

ghstack-source-id: dfcf5ff1baa403210254d5b5f9a1cd1b92b51816 Pull Request resolved: pytorch#112225

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 3, 2023

[Quant] [PT2] Decomposed quant per tensor/channel accept bfloat16 input

002caff

ghstack-source-id: 938f4dc617b28a2e66eac36cec326596ed7e50c9 Pull Request resolved: pytorch#112225

pytorchmergebot added the merging label Nov 3, 2023

pytorchmergebot removed the merging label Nov 3, 2023

pytorchmergebot added the merging label Nov 3, 2023

pytorchmergebot added Merged and removed merging labels Nov 3, 2023

pytorchmergebot closed this in 6ba2748 Nov 3, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/37/head branch November 7, 2023 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quant] [PT2] Enable Decomposed quant per tensor/channel to accept bfloat16 input #112225

[Quant] [PT2] Enable Decomposed quant per tensor/channel to accept bfloat16 input #112225

leslie-fang-intel commented Oct 27, 2023 •

edited

pytorch-bot bot commented Oct 27, 2023 •

edited

leslie-fang-intel commented Nov 3, 2023

pytorchmergebot commented Nov 3, 2023

pytorchmergebot commented Nov 3, 2023

leslie-fang-intel commented Nov 3, 2023

pytorchmergebot commented Nov 3, 2023

[Quant] [PT2] Enable Decomposed quant per tensor/channel to accept bfloat16 input #112225

[Quant] [PT2] Enable Decomposed quant per tensor/channel to accept bfloat16 input #112225

Conversation

leslie-fang-intel commented Oct 27, 2023 • edited

pytorch-bot bot commented Oct 27, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112225

✅ You can merge normally! (1 Unrelated Failure)

leslie-fang-intel commented Nov 3, 2023

pytorchmergebot commented Nov 3, 2023

Merge started

pytorchmergebot commented Nov 3, 2023

Merge failed

leslie-fang-intel commented Nov 3, 2023

pytorchmergebot commented Nov 3, 2023

Merge started

leslie-fang-intel commented Oct 27, 2023 •

edited

pytorch-bot bot commented Oct 27, 2023 •

edited