fx quant: do not insert observers at quantized inputs #49239

vkuzo · 2020-12-11T18:12:47Z

Stack from ghstack:

fx quant: make sure observer is inserted before a quantized output #49420 fx quant: make sure observer is inserted before a quantized output
fx quant: fix fq when input is quantized and node does not need fq #49382 fx quant: fix fq when input is quantized and node does not need fq
fx quant: do not insert observers at quantized inputs #49239 fx quant: do not insert observers at quantized inputs
fx quant: move {input|output}_quantized_idxs cfg from convert to prepare #49238 fx quant: move {input|output}_quantized_idxs cfg from convert to prepare
eager quant: remove fake_quant after add/mul nodes during QAT #49213 eager quant: remove fake_quant after add/mul nodes during QAT

Summary:

Context: the existing implementation of quantized_input_idxs is convert-only.
Therefore, observers are inserted between the input and the first
quantized node. This is a problem during QAT, because the initial
input is a fake_quant, and it starts with scale=1 and zp=0. This does
not match the quantization parameters of the graph input, which can
lead to incorrect numerics.

Fix: do not insert observer for a quantized input.

Test Plan:

python test/test_quantization.py TestQuantizeFx

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D25499486

Summary: Context: the existing implementation of `quantized_input_idxs` is convert-only. Therefore, observers are inserted between the input and the first quantized node. This is a problem during QAT, because the initial input is a fake_quant, and it starts with scale=1 and zp=0. This does not match the quantization parameters of the graph input, which can lead to incorrect numerics. Fix: do not insert observer for a quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

facebook-github-bot · 2020-12-11T18:12:58Z

💊 CI failures summary and remediations

As of commit bd25bc0 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

This comment has been revised 21 times.

Summary: Context: the existing implementation of `quantized_input_idxs` is convert-only. Therefore, observers are inserted between the input and the first quantized node. This is a problem during QAT, because the initial input is a fake_quant, and it starts with scale=1 and zp=0. This does not match the quantization parameters of the graph input, which can lead to incorrect numerics. Fix: do not insert observer for a quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: c9f76756f5c34e1f65289984421e4a9df4fc2744 Pull Request resolved: #49239

jerryzh168

LG

Summary: Context: the existing implementation of `quantized_input_idxs` is convert-only. Therefore, observers are inserted between the input and the first quantized node. This is a problem during QAT, because the initial input is a fake_quant, and it starts with scale=1 and zp=0. This does not match the quantization parameters of the graph input, which can lead to incorrect numerics. Fix: do not insert observer for a quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D25499486](https://our.internmc.facebook.com/intern/diff/D25499486) [ghstack-poisoned]

Summary: Context: the existing implementation of `quantized_input_idxs` is convert-only. Therefore, observers are inserted between the input and the first quantized node. This is a problem during QAT, because the initial input is a fake_quant, and it starts with scale=1 and zp=0. This does not match the quantization parameters of the graph input, which can lead to incorrect numerics. Fix: do not insert observer for a quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 936518ba30591e75900c36168183f68f7e97e5ab Pull Request resolved: #49239

Summary: Context: the existing implementation of `quantized_input_idxs` is convert-only. Therefore, observers are inserted between the input and the first quantized node. This is a problem during QAT, because the initial input is a fake_quant, and it starts with scale=1 and zp=0. This does not match the quantization parameters of the graph input, which can lead to incorrect numerics. Fix: do not insert observer for a quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D25499486](https://our.internmc.facebook.com/intern/diff/D25499486) [ghstack-poisoned]

codecov · 2020-12-16T19:26:25Z

Codecov Report

Merging #49239 (bd25bc0) into gh/vkuzo/187/base (fe74970) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@                Coverage Diff                 @@
##           gh/vkuzo/187/base   #49239   +/-   ##
==================================================
  Coverage              80.62%   80.62%           
==================================================
  Files                   1875     1875           
  Lines                 202790   202797    +7     
==================================================
+ Hits                  163501   163510    +9     
+ Misses                 39289    39287    -2

facebook-github-bot · 2020-12-17T03:15:17Z

This pull request has been merged in 7542076.

Summary: Pull Request resolved: pytorch#49239 Context: the existing implementation of `quantized_input_idxs` is convert-only. Therefore, observers are inserted between the input and the first quantized node. This is a problem during QAT, because the initial input is a fake_quant, and it starts with scale=1 and zp=0. This does not match the quantization parameters of the graph input, which can lead to incorrect numerics. Fix: do not insert observer for a quantized input. Test Plan: ``` python test/test_quantization.py TestQuantizeFx ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D25499486 fbshipit-source-id: 303b49cc9d95a9fd06fef3b0859c08be34e19d8a

This was referenced Dec 11, 2020

eager quant: remove fake_quant after add/mul nodes during QAT #49213

Closed

fx quant: move {input|output}_quantized_idxs cfg from convert to prepare #49238

Closed

facebook-github-bot added the cla signed label Dec 11, 2020

facebook-github-bot added the fx label Dec 11, 2020

vkuzo requested review from jerryzh168 and raghuramank100 December 11, 2020 18:14

jerryzh168 approved these changes Dec 11, 2020

View reviewed changes

This was referenced Dec 15, 2020

fx quant: fix fq when input is quantized and node does not need fq #49382

Closed

fx quant: make sure observer is inserted before a quantized output #49420

Closed

vkuzo added 2 commits December 15, 2020 15:50

facebook-github-bot closed this in 7542076 Dec 17, 2020

facebook-github-bot added the Merged label Dec 17, 2020

facebook-github-bot deleted the gh/vkuzo/187/head branch December 20, 2020 15:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fx quant: do not insert observers at quantized inputs #49239

fx quant: do not insert observers at quantized inputs #49239

vkuzo commented Dec 11, 2020 •

edited

facebook-github-bot commented Dec 11, 2020 •

edited

jerryzh168 left a comment

codecov bot commented Dec 16, 2020

facebook-github-bot commented Dec 17, 2020

fx quant: do not insert observers at quantized inputs #49239

fx quant: do not insert observers at quantized inputs #49239

Conversation

vkuzo commented Dec 11, 2020 • edited

facebook-github-bot commented Dec 11, 2020 • edited

💊 CI failures summary and remediations

jerryzh168 left a comment

Choose a reason for hiding this comment

codecov bot commented Dec 16, 2020

Codecov Report

facebook-github-bot commented Dec 17, 2020

vkuzo commented Dec 11, 2020 •

edited

facebook-github-bot commented Dec 11, 2020 •

edited