[Quant] Bug fix #107899

kimishpatel · 2023-08-24T21:45:04Z

Stack from ghstack (oldest at bottom):

Summary:
When two layers are quantized differently, observer map update updates
map for key (observed_node, node), whereas it should really be
(original_input, node)

Test Plan:
Test in the next diff adds a test where it otherwise fails

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D48663145

Summary: When two layers are quantized differently, observer map update updates map for key (observed_node, node), whereas it should really be (original_input, node) Test Plan: Test in the next diff adds a test where it otherwise fails Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

pytorch-bot · 2023-08-24T21:45:06Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107899

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b25f1b6 with merge base 0f1a225 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kimishpatel · 2023-08-24T21:50:17Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

jerryzh168 · 2023-08-24T22:11:05Z

torch/ao/quantization/pt2e/prepare.py

@@ -92,6 +92,16 @@ def _maybe_insert_input_observer_for_arg_or_kwarg(
            new_obs_node = _insert_obs_or_fq(
                arg, arg_as_input_act_obs_or_fq, model, named_modules, model.graph)  # type: ignore[arg-type]
            new_arg = new_obs_node
+            # When quantizing two layers with different configs we can have


can you add a test for this? surprised that this is not caught earlier..

so didnt add separate test because the PR after this one fails for exactly this reason. Also we havent tested quantizing a different layers with different configs (which is what the next diff does). Hence did not catch it.

OK sounds good

Summary: When two layers are quantized differently, observer map update updates map for key (observed_node, node), whereas it should really be (original_input, node) Test Plan: Test in the next diff adds a test where it otherwise fails Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48663145](https://our.internmc.facebook.com/intern/diff/D48663145) [ghstack-poisoned]

kimishpatel · 2023-08-31T16:03:07Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

kimishpatel · 2023-08-31T17:28:33Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

kimishpatel · 2023-08-31T23:30:46Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

kimishpatel · 2023-09-01T00:49:34Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: When two layers are quantized differently, observer map update updates map for key (observed_node, node), whereas it should really be (original_input, node) Test Plan: Test in the next diff adds a test where it otherwise fails Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48663145](https://our.internmc.facebook.com/intern/diff/D48663145) [ghstack-poisoned]

kimishpatel · 2023-09-01T15:40:54Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

kimishpatel · 2023-09-01T20:20:35Z

@kimishpatel has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: During convert step observers are first replaced by Q-DQ pair. In some scenarios like following output DQ has a fan out. ---> OP2 -> Q -> DQ / OP -> Q -> DQ - \ ---> OP3 -> Q -> DQ If either op OP2 or OP3 are configured to be quantized, then the input is expected to quantized. In this case quantized equivalent of some pattern, that quantizer asked to be quantized, should look like: [DQ -> {pattern} -> Q]. However, in scenario like above where DQ node is shared between multiple "quantized" patterns, boundary of "quantized" pattern is not clear because DQ now belongs to multiple quantized patterns. This poses challenge for: - Porting metadata: which "quantized" partition this DQ node belongs - Quantized representation, equivalently, needs to identify self-contained quantized pattern that is replaced by its equivalent pattern that captures compute in the quantized precision. Test Plan: test_duplicate_dq_pass Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48663147](https://our.internmc.facebook.com/intern/diff/D48663147) Pull Request resolved: #107900 Approved by: https://github.com/jerryzh168, https://github.com/andrewor14, https://github.com/leslie-fang-intel ghstack dependencies: #107105, #107106, #107899

…107107) Summary: This diff adds adding metadata to q-dq nodes by inferring the quatization intent from node annotations. Annotations on the node are way for user to specify how a node or subgraph is supposed to be quantized. We continue to use that information to copy metadata on Q/DQ node from appropriate nodes. Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48488416](https://our.internmc.facebook.com/intern/diff/D48488416) Pull Request resolved: #107107 Approved by: https://github.com/jerryzh168 ghstack dependencies: #107105, #107106, #107899, #107900

pytorch-bot bot added the release notes: AO frontend label Aug 24, 2023

github-actions bot added the release notes: quantization release notes category label Aug 24, 2023

jerryzh168 reviewed Aug 24, 2023

View reviewed changes

jerryzh168 approved these changes Aug 25, 2023

View reviewed changes

kimishpatel added 3 commits August 30, 2023 11:29

pytorchmergebot added the Merged label Sep 2, 2023

pytorchmergebot closed this in f8d1ca9 Sep 2, 2023

facebook-github-bot deleted the gh/kimishpatel/176/head branch September 5, 2023 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quant] Bug fix #107899

[Quant] Bug fix #107899

kimishpatel commented Aug 24, 2023 •

edited

Loading

pytorch-bot bot commented Aug 24, 2023 •

edited

Loading

kimishpatel commented Aug 24, 2023

jerryzh168 Aug 24, 2023

kimishpatel Aug 25, 2023

jerryzh168 Aug 25, 2023

kimishpatel commented Aug 31, 2023

kimishpatel commented Aug 31, 2023

kimishpatel commented Aug 31, 2023

kimishpatel commented Sep 1, 2023

kimishpatel commented Sep 1, 2023

kimishpatel commented Sep 1, 2023

[Quant] Bug fix #107899

[Quant] Bug fix #107899

Conversation

kimishpatel commented Aug 24, 2023 • edited Loading

pytorch-bot bot commented Aug 24, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107899

✅ No Failures

kimishpatel commented Aug 24, 2023

jerryzh168 Aug 24, 2023

Choose a reason for hiding this comment

kimishpatel Aug 25, 2023

Choose a reason for hiding this comment

jerryzh168 Aug 25, 2023

Choose a reason for hiding this comment

kimishpatel commented Aug 31, 2023

kimishpatel commented Aug 31, 2023

kimishpatel commented Aug 31, 2023

kimishpatel commented Sep 1, 2023

kimishpatel commented Sep 1, 2023

kimishpatel commented Sep 1, 2023

kimishpatel commented Aug 24, 2023 •

edited

Loading

pytorch-bot bot commented Aug 24, 2023 •

edited

Loading