onnx export of per channel fake quantize functions (#42835) #52430

SplitInfinity · 2021-02-18T06:31:46Z

Summary:
Fixes #39502

This PR adds support for exporting fake_quantize_per_channel_affine to a pair of QuantizeLinear and DequantizeLinear. Per tensor support was added by PR #39738.

axis attribute of QuantizeLinear and DequantizeLinear, which is required for per channel support, is added in opset13 added by onnx/onnx#2772.

[update 1/20/2021]: opset13 is being supported on master, the added function is now properly tested. Code also rebased to new master.

The function is also tested offline with the following code

import torch
from torch import quantization

from torchvision import models
qat_resnet18 = models.resnet18(pretrained=True).eval().cuda()

qat_resnet18.qconfig = quantization.QConfig(
    activation=quantization.default_fake_quant, weight=quantization.default_per_channel_weight_fake_quant)
quantization.prepare_qat(qat_resnet18, inplace=True)
qat_resnet18.apply(quantization.enable_observer)
qat_resnet18.apply(quantization.enable_fake_quant)

dummy_input = torch.randn(16, 3, 224, 224).cuda()
_ = qat_resnet18(dummy_input)
for module in qat_resnet18.modules():
    if isinstance(module, quantization.FakeQuantize):
        module.calculate_qparams()
qat_resnet18.apply(quantization.disable_observer)

qat_resnet18.cuda()

input_names = [ "actual_input_1" ]
output_names = [ "output1" ]

torch.onnx.export(qat_resnet18, dummy_input, "quant_model.onnx", verbose=True, opset_version=13)

It can generate the desired graph.

Pull Request resolved: #42835

Reviewed By: houseroad

Differential Revision: D26293823

Pulled By: SplitInfinity

fbshipit-source-id: 300498a2e24b7731b12fa2fbdea4e73dde80e7ea

Summary: Fixes #39502 This PR adds support for exporting **fake_quantize_per_channel_affine** to a pair of QuantizeLinear and DequantizeLinear. Per tensor support was added by PR #39738. `axis` attribute of QuantizeLinear and DequantizeLinear, which is required for per channel support, is added in opset13 added by onnx/onnx#2772. [update 1/20/2021]: opset13 is being supported on master, the added function is now properly tested. Code also rebased to new master. The function is also tested offline with the following code ```python import torch from torch import quantization from torchvision import models qat_resnet18 = models.resnet18(pretrained=True).eval().cuda() qat_resnet18.qconfig = quantization.QConfig( activation=quantization.default_fake_quant, weight=quantization.default_per_channel_weight_fake_quant) quantization.prepare_qat(qat_resnet18, inplace=True) qat_resnet18.apply(quantization.enable_observer) qat_resnet18.apply(quantization.enable_fake_quant) dummy_input = torch.randn(16, 3, 224, 224).cuda() _ = qat_resnet18(dummy_input) for module in qat_resnet18.modules(): if isinstance(module, quantization.FakeQuantize): module.calculate_qparams() qat_resnet18.apply(quantization.disable_observer) qat_resnet18.cuda() input_names = [ "actual_input_1" ] output_names = [ "output1" ] torch.onnx.export(qat_resnet18, dummy_input, "quant_model.onnx", verbose=True, opset_version=13) ``` It can generate the desired graph. Pull Request resolved: #42835 Reviewed By: houseroad Differential Revision: D26293823 Pulled By: SplitInfinity fbshipit-source-id: 300498a2e24b7731b12fa2fbdea4e73dde80e7ea

facebook-github-bot · 2021-02-18T06:31:59Z

💊 CI failures summary and remediations

As of commit 1cf3ff6 (more details on the Dr. CI page):

2/2 failures possibly* introduced in this PR
- 2/2 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm4.0.1-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

codecov · 2021-02-18T11:32:02Z

Codecov Report

Merging #52430 (1cf3ff6) into release/1.8 (f7c4afc) will increase coverage by 0.33%.
The diff coverage is 41.93%.

@@               Coverage Diff               @@
##           release/1.8   #52430      +/-   ##
===============================================
+ Coverage        80.48%   80.82%   +0.33%     
===============================================
  Files             1948     1948              
  Lines           213198   213229      +31     
===============================================
+ Hits            171600   172335     +735     
+ Misses           41598    40894     -704

facebook-github-bot added the cla signed label Feb 18, 2021

SplitInfinity mentioned this pull request Feb 18, 2021

[v.1.8.0] Release Tracker #51886

Closed

malfet approved these changes Feb 18, 2021

View reviewed changes

malfet merged commit 32758d3 into release/1.8 Feb 18, 2021

github-actions bot deleted the 42835-cherry-pick branch February 10, 2024 01:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnx export of per channel fake quantize functions (#42835) #52430

onnx export of per channel fake quantize functions (#42835) #52430

SplitInfinity commented Feb 18, 2021

facebook-github-bot commented Feb 18, 2021 •

edited

codecov bot commented Feb 18, 2021

onnx export of per channel fake quantize functions (#42835) #52430

onnx export of per channel fake quantize functions (#42835) #52430

Conversation

SplitInfinity commented Feb 18, 2021

facebook-github-bot commented Feb 18, 2021 • edited

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

codecov bot commented Feb 18, 2021

Codecov Report

facebook-github-bot commented Feb 18, 2021 •

edited