[quant][graphmode][fx] Merge all quantization mode #45292

jerryzh168 · 2020-09-24T19:12:04Z

Stack from ghstack:

[quant][graphmode][fx][eagermode] Support sigmoid/hardsigmoid/tanh in eager and fx graph mode #45539 [quant][graphmode][fx][eagermode] Support sigmoid/hardsigmoid/tanh in eager and fx graph mode
[quant] Add FixedQParamsFakeQuantize module #45538 [quant] Add FixedQParamsFakeQuantize module
[quant][graphmode][fx][fix] Fix observer insert logic for ops that inherits quantization parameters for input #45473 [quant][graphmode][fx][fix] Fix observer insert logic for ops that inherits quantization parameters for input
[quant][graphmode][fx] Merge all quantization mode #45292 [quant][graphmode][fx] Merge all quantization mode
[quant] Use PlaceholderObserver as default dynamic quant observer #45343 [quant] Use PlaceholderObserver as default dynamic quant observer

Summary:
This PR merges all quantization mode and will only expose the following top level functions:

prepare_fx
prepare_qat_fx
convert_fx

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D23913105

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 13d0aa89c93df0ba0707deeedecc5350e146dc62 Pull Request resolved: #45292

dr-ci · 2020-09-24T19:23:37Z

💊 CI failures summary and remediations

As of commit 1d191e0 (more details on the Dr. CI page):

Commit 1d191e0 was recently pushed. Waiting for builds...

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 109 times.

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23913105](https://our.internmc.facebook.com/intern/diff/D23913105) [ghstack-poisoned]

vkuzo

looks awesome, the new names are much easier to read!

vkuzo · 2020-09-25T00:32:53Z

torch/quantization/fx/utils.py

@@ -138,3 +138,38 @@ def get_next_qparams_idx(module, qparams):
        qparam_full_path = key + str(idx)
        inputs.append(graph.create_node('get_attr', qparam_full_path))
    return graph.create_node('call_function', quantize_op, tuple(inputs), {})
+
+def activation_is_dynamically_quantized(qconfig):


looks super clean, thanks!

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23913105](https://our.internmc.facebook.com/intern/diff/D23913105) [ghstack-poisoned]

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 264ca59e1e9255e9e5ca08194101e6f2ccc10af6 Pull Request resolved: #45292

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23913105](https://our.internmc.facebook.com/intern/diff/D23913105) [ghstack-poisoned]

torch/quantization/fx/utils.py

torch/quantization/fx/quantize.py

supriyar · 2020-09-25T17:04:24Z

torch/quantization/fx/quantize.py

-                if self.is_dynamic_quant:
+                # don't need to insert observer for output if activation does not
+                # need to be statically quantized
+                if not activation_is_statically_quantized(qconfig):


Can we use activation_is_dynamically_quantized instead here for better readability?

I'm planning to remove that check since it checks for both dynamic and weight only dtypes.

Not sure I follow. What is the check we will use for dynamic quant in that case?

activation_is_dynamically_quantized checks for float32 dtype which can be both dynamic or weight only quantization.

supriyar · 2020-09-25T17:08:14Z

Will we also be updating the top level APIs to remove the quantize_{static, dynamic}_fx calls ? https://github.com/pytorch/pytorch/blob/master/torch/quantization/quantize_fx.py#L108-L218

jerryzh168 · 2020-09-25T17:11:24Z

Will we also be updating the top level APIs to remove the quantize_{static, dynamic}_fx calls ? https://github.com/pytorch/pytorch/blob/master/torch/quantization/quantize_fx.py#L108-L218

yes will do that, I'm waiting for previous PRs to be landed to avoid conflicts.

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23913105](https://our.internmc.facebook.com/intern/diff/D23913105) [ghstack-poisoned]

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 184888bba530fefefcf0fac48ead6b87c095c6a2 Pull Request resolved: #45292

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23913105](https://our.internmc.facebook.com/intern/diff/D23913105) [ghstack-poisoned]

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 329ee8318e83e200afac2b77b855ee0627f2be20 Pull Request resolved: #45292

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23913105](https://our.internmc.facebook.com/intern/diff/D23913105) [ghstack-poisoned]

facebook-github-bot · 2020-10-01T06:11:04Z

This pull request has been merged in ffcb098.

t-vi · 2020-10-01T07:22:55Z

This seems to break the CI on ROCm with an error that looks like it is this patch:
https://ci.pytorch.org/jenkins/job/pytorch-builds/job/pytorch-linux-bionic-rocm3.7-py3.6-test2/5653//console

FAIL: test_qat_prepare_device_affinity (quantization.test_quantize_fx.TestQuantizeFx)
06:50:29 ----------------------------------------------------------------------
06:50:29 Traceback (most recent call last):
06:50:29   File "/var/lib/jenkins/.local/lib/python3.6/site-packages/torch/testing/_internal/common_quantized.py", line 122, in test_fn
06:50:29     qfunction(*args, **kwargs)
06:50:29   File "/var/lib/jenkins/workspace/test/quantization/test_quantize_fx.py", line 268, in test_qat_prepare_device_affinity
06:50:29     model = prepare_fx(model, qconfig_dict)
06:50:29   File "/var/lib/jenkins/.local/lib/python3.6/site-packages/torch/quantization/quantize_fx.py", line 127, in prepare_fx
06:50:29     'eval mode'
06:50:29 AssertionError: prepare_fx only works for models ineval mode
06:50:29

Also appreas to break pytorch_linux_xenial_cuda9_2_cudnn7_py3_gcc7_test with the same thing.

mruberry · 2020-10-01T10:09:34Z

Unlanding. @t-vi nailed it.

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Imported from OSS Reviewed By: vkuzo [ghstack-poisoned]

…#45292)" Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: [D24053439](https://our.internmc.facebook.com/intern/diff/D24053439) [ghstack-poisoned]

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Imported from OSS Reviewed By: vkuzo ghstack-source-id: 5208f72f119cea51fe4cf3fbf5358783415076d5 Pull Request resolved: #45672

…45672) Summary: Pull Request resolved: #45672 This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Imported from OSS Imported from OSS Reviewed By: z-a-f Differential Revision: D24053439 fbshipit-source-id: 03d545e26a36bc22a73349061b751eeb35171e64

[quant][graphmode][fx] Merge all quantization mode

ab5a2a9

Summary: This PR merges all quantization mode and will only expose the following top level functions: ``` prepare_fx prepare_qat_fx convert_fx ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 mentioned this pull request Sep 24, 2020

[quant][graphmode][fx] Support quantization for standalone module #44074

Closed

facebook-github-bot added the fx label Sep 24, 2020

vkuzo approved these changes Sep 25, 2020

View reviewed changes

jerryzh168 mentioned this pull request Sep 25, 2020

[quant] Use PlaceholderObserver as default dynamic quant observer #45343

Closed

supriyar reviewed Sep 25, 2020

View reviewed changes

torch/quantization/fx/utils.py Show resolved Hide resolved

supriyar reviewed Sep 25, 2020

View reviewed changes

torch/quantization/fx/quantize.py Outdated Show resolved Hide resolved

supriyar reviewed Sep 25, 2020

View reviewed changes

jerryzh168 mentioned this pull request Sep 29, 2020

[quant][graphmode][fx][fix] Fix observer insert logic for ops that inherits quantization parameters for input #45473

Closed

jerryzh168 added 5 commits September 28, 2020 18:19

This was referenced Sep 29, 2020

[quant] Add FixedQParamsFakeQuantize module #45538

Closed

[quant][graphmode][fx][eagermode] Support sigmoid/hardsigmoid/tanh in eager and fx graph mode #45539

Closed

jerryzh168 added 3 commits September 30, 2020 14:37

facebook-github-bot closed this in ffcb098 Oct 1, 2020

facebook-github-bot added the merged label Oct 1, 2020

jerryzh168 mentioned this pull request Oct 1, 2020

[reland][quant][graphmode][fx] Merge all quantization mode (#45292) #45672

Closed

ezyang mentioned this pull request Oct 3, 2020

DISABLED test_functional_no_debug (quantization.test_quantize_fx.TestQuantizeFx) #45761

Closed

facebook-github-bot deleted the gh/jerryzh168/440/head branch October 4, 2020 14:18

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant][graphmode][fx] Merge all quantization mode #45292

[quant][graphmode][fx] Merge all quantization mode #45292

jerryzh168 commented Sep 24, 2020 •

edited

dr-ci bot commented Sep 24, 2020 •

edited

vkuzo left a comment

vkuzo Sep 25, 2020

supriyar Sep 25, 2020

jerryzh168 Sep 25, 2020

supriyar Sep 25, 2020

jerryzh168 Sep 25, 2020

supriyar commented Sep 25, 2020

jerryzh168 commented Sep 25, 2020

facebook-github-bot commented Oct 1, 2020

t-vi commented Oct 1, 2020 •

edited

mruberry commented Oct 1, 2020

[quant][graphmode][fx] Merge all quantization mode #45292

[quant][graphmode][fx] Merge all quantization mode #45292

Conversation

jerryzh168 commented Sep 24, 2020 • edited

dr-ci bot commented Sep 24, 2020 • edited

💊 CI failures summary and remediations

vkuzo left a comment

Choose a reason for hiding this comment

vkuzo Sep 25, 2020

Choose a reason for hiding this comment

supriyar Sep 25, 2020

Choose a reason for hiding this comment

jerryzh168 Sep 25, 2020

Choose a reason for hiding this comment

supriyar Sep 25, 2020

Choose a reason for hiding this comment

jerryzh168 Sep 25, 2020

Choose a reason for hiding this comment

supriyar commented Sep 25, 2020

jerryzh168 commented Sep 25, 2020

facebook-github-bot commented Oct 1, 2020

t-vi commented Oct 1, 2020 • edited

mruberry commented Oct 1, 2020

jerryzh168 commented Sep 24, 2020 •

edited

dr-ci bot commented Sep 24, 2020 •

edited

t-vi commented Oct 1, 2020 •

edited