[quant] Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` #29416

jerryzh168 · 2019-11-07T23:03:47Z

Stack from ghstack:

[quant][graphmode] Returning axis from calculate_qparams #29494 [quant][graphmode] Returning axis from calculate_qparams
[quant][graphmode] Test for per channel graph mode quantization #29493 [quant][graphmode] Test for per channel graph mode quantization
[quant] Make PerChannelMinMaxObserver scriptable using torch.jit.ignore #29416 [quant] Make PerChannelMinMaxObserver scriptable using torch.jit.ignore

Summary:
att

Test Plan:
python test/test_quantization.py

Reviewers:
pt1quant

Subscribers:

Tasks:

Tags:

Differential Revision: D18580906

…ore` Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ore` Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: ghstack-source-id: 878d6a9 Pull Request resolved: #29416

Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in graph mode" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ph mode" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in graph mode" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ph mode" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in graph mode" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ph mode" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls - add support of folding for prepacked per channel quantized weight Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in insert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…equant and fold_prepack Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: ghstack-source-id: ea7a8fb Pull Request resolved: #29492

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

ZolotukhinM

Looks good!

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in insert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in insert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

test/test_quantization.py

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in insert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…ert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: [ghstack-poisoned]

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

…ntization in insert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: Differential Revision: [D18580444](https://our.internmc.facebook.com/intern/diff/D18580444) [ghstack-poisoned]

…ert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: Differential Revision: [D18580444](https://our.internmc.facebook.com/intern/diff/D18580444) [ghstack-poisoned]

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: Differential Revision: [D18580906](https://our.internmc.facebook.com/intern/diff/D18580906) [ghstack-poisoned]

…ntization in insert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: Differential Revision: [D18580444](https://our.internmc.facebook.com/intern/diff/D18580444) [ghstack-poisoned]

…ert_quant_dequant and fold_prepack" Summary: Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Reviewers: mvz Subscribers: Tasks: Tags: Differential Revision: [D18580444](https://our.internmc.facebook.com/intern/diff/D18580444) [ghstack-poisoned]

…pack (#29492) Summary: Pull Request resolved: #29492 Previously graph mode quantization only works for per tensor quantization, this PR added support for per channel quantization as well, changes include - insert per channel quantization calls(insert_quant_dequant) - add support of folding for prepacked per channel quantized weight (fold_prepack) Test Plan: test is not possible until we can script PerChannelObserver, which comes in #29416 we'll add test in a separate PR after that. Imported from OSS Differential Revision: D18580444 fbshipit-source-id: 347c07f201648ec49f070523642a9170278f8aa4

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: Differential Revision: [D18580906](https://our.internmc.facebook.com/intern/diff/D18580906) [ghstack-poisoned]

facebook-github-bot · 2019-11-20T14:10:36Z

This pull request has been merged in b2291d4.

facebook-github-bot · 2019-11-20T14:10:41Z

This pull request has been merged in b2291d4.

[quant] Make PerChannelMinMaxObserver scriptable using `torch.jit.ign…

1a5da58

…ore` Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 mentioned this pull request Nov 7, 2019

[quant][graphmode] Make HistogramObserver scriptable with @torch.jit.ignore #27950

Closed

jerryzh168 mentioned this pull request Nov 9, 2019

[quant][graphmode] Support per channel quantization in insert_quant_dequant and fold_prepack #29492

Closed

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

963c2dd

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

This was referenced Nov 9, 2019

[quant][graphmode] Test for per channel graph mode quantization #29493

Closed

[quant][graphmode] Returning axis from calculate_qparams #29494

Closed

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

468b9a9

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

62ce0ce

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 mentioned this pull request Nov 13, 2019

[quant] Per channel quantization support in insert_prepack_unpack #29701

Closed

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

2d5c072

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 requested review from ZolotukhinM and raghuramank100 November 13, 2019 02:18

ZolotukhinM approved these changes Nov 13, 2019

View reviewed changes

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

c6ecb5e

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

d540de9

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

raghuramank100 reviewed Nov 15, 2019

View reviewed changes

test/test_quantization.py Show resolved Hide resolved

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

f825f75

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

Update on "[quant] Make PerChannelMinMaxObserver scriptable using `to…

c21619c

…rch.jit.ignore`" Summary: att Test Plan: python test/test_quantization.py Reviewers: pt1quant Subscribers: Tasks: Tags: [ghstack-poisoned]

facebook-github-bot closed this in b2291d4 Nov 20, 2019

facebook-github-bot added the merged label Nov 20, 2019

facebook-github-bot deleted the gh/jerryzh168/134/head branch November 23, 2019 15:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant] Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` #29416

[quant] Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` #29416

Uh oh!

jerryzh168 commented Nov 7, 2019 •

edited

Loading

Uh oh!

ZolotukhinM left a comment

Uh oh!

Uh oh!

facebook-github-bot commented Nov 20, 2019

Uh oh!

facebook-github-bot commented Nov 20, 2019

Uh oh!

Uh oh!

[quant] Make PerChannelMinMaxObserver scriptable using torch.jit.ignore #29416

[quant] Make PerChannelMinMaxObserver scriptable using torch.jit.ignore #29416

Uh oh!

Conversation

jerryzh168 commented Nov 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZolotukhinM left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facebook-github-bot commented Nov 20, 2019

Uh oh!

facebook-github-bot commented Nov 20, 2019

Uh oh!

Uh oh!

[quant] Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` #29416

[quant] Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` #29416

jerryzh168 commented Nov 7, 2019 •

edited

Loading