[quant][pt2][be] Rewrite QAT annotations using subgraph matcher #113709

andrewor14 · 2023-11-14T23:03:52Z

Stack from ghstack (oldest at bottom):

Summary: This is the recommended way to write quantizers according
to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow.
It is agnostic to changes in the aten IR and can be easily extended
to support conv1d-bn and conv3d-bn fusion patterns in the future.
This is the first step towards rewriting XNNPACKQuantizer using
this subgraph matcher.

Test Plan:
python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d

Reviewers: jerryzh168, kimishpatel

Subscribers: jerryzh168, kimishpatel, supriyar

Differential Revision: D51366525

Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar [ghstack-poisoned]

pytorch-bot · 2023-11-14T23:03:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113709

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 89df0e4 with merge base 05d9492 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar ghstack-source-id: de45e114d20c21064b5d0869aed8336c70a423e8 Pull Request resolved: #113709

…tcher" Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar [ghstack-poisoned]

Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar ghstack-source-id: d6797c849f4a2ceda94e3d95b4d908d9c746be99 Pull Request resolved: #113709

torch/ao/quantization/quantizer/xnnpack_quantizer_utils.py

…tcher" Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar [ghstack-poisoned]

jerryzh168 · 2023-11-15T18:43:45Z

torch/ao/quantization/quantizer/xnnpack_quantizer_utils.py

+        weight_user = list(weight_node.users.keys())[0]
+        if weight_user is not input_user:
+            raise ValueError("Expected weight user to be the same as input user")
+        conv_node = input_user


This still assumes conv_node is not decomposed. (when it's decomposed, input_user, weight_user and bias_user could be different nodes, e.g. linear is a good example)

I think you'll need to annotate input_user, weight_user and bias_user separately to be 100% robust to decompositions of the conv op

As discussed offline, we will just use the conv_node returned from the subgraph matcher here and just validate the conv args for now. This is because we can't always rely on the first user of the input node, for example in resnet18 where there are skip connections

…tcher" Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar [ghstack-poisoned]

andrewor14 · 2023-11-15T20:15:19Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

…tcher" Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D51366525](https://our.internmc.facebook.com/intern/diff/D51366525) [ghstack-poisoned]

andrewor14 · 2023-11-15T21:44:46Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

jerryzh168

looks good, thanks for addressing the comments!

andrewor14 · 2023-11-16T01:23:25Z

@pytorchbot merge

pytorchmergebot · 2023-11-16T01:25:27Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

andrewor14 mentioned this pull request Nov 14, 2023

[quant][pt2][be] Refactor QAT tests for future patterns #113658

Closed

pytorch-bot bot added the release notes: AO frontend label Nov 14, 2023

github-actions bot added the release notes: quantization release notes category label Nov 14, 2023

andrewor14 requested a review from jerryzh168 November 14, 2023 23:21

andrewor14 mentioned this pull request Nov 14, 2023

[quant][pt2] Support conv1d-bn QAT fusion #113714

Closed

jerryzh168 reviewed Nov 14, 2023

View reviewed changes

torch/ao/quantization/quantizer/xnnpack_quantizer_utils.py Outdated Show resolved Hide resolved

jerryzh168 reviewed Nov 14, 2023

View reviewed changes

torch/ao/quantization/quantizer/xnnpack_quantizer_utils.py Show resolved Hide resolved

andrewor14 added 2 commits November 14, 2023 15:48

jerryzh168 reviewed Nov 15, 2023

View reviewed changes

andrewor14 added 2 commits November 15, 2023 11:58

jerryzh168 approved these changes Nov 16, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 16, 2023

pytorchmergebot added the merging label Nov 16, 2023

pytorchmergebot added Merged and removed merging labels Nov 16, 2023

pytorchmergebot closed this in 8241fe6 Nov 16, 2023

facebook-github-bot deleted the gh/andrewor14/43/head branch November 19, 2023 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant][pt2][be] Rewrite QAT annotations using subgraph matcher #113709

[quant][pt2][be] Rewrite QAT annotations using subgraph matcher #113709

andrewor14 commented Nov 14, 2023 •

edited

pytorch-bot bot commented Nov 14, 2023 •

edited

jerryzh168 Nov 15, 2023

andrewor14 Nov 15, 2023

andrewor14 commented Nov 15, 2023

andrewor14 commented Nov 15, 2023

jerryzh168 left a comment

andrewor14 commented Nov 16, 2023

pytorchmergebot commented Nov 16, 2023

[quant][pt2][be] Rewrite QAT annotations using subgraph matcher #113709

[quant][pt2][be] Rewrite QAT annotations using subgraph matcher #113709

Conversation

andrewor14 commented Nov 14, 2023 • edited

pytorch-bot bot commented Nov 14, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113709

✅ No Failures

jerryzh168 Nov 15, 2023

Choose a reason for hiding this comment

andrewor14 Nov 15, 2023

Choose a reason for hiding this comment

andrewor14 commented Nov 15, 2023

andrewor14 commented Nov 15, 2023

jerryzh168 left a comment

Choose a reason for hiding this comment

andrewor14 commented Nov 16, 2023

pytorchmergebot commented Nov 16, 2023

Merge started

andrewor14 commented Nov 14, 2023 •

edited

pytorch-bot bot commented Nov 14, 2023 •

edited