Quantized Convolution error on Mobile with some scale values. #33466

supriyar · 2020-02-18T22:36:06Z

🐛 Bug

QNNPACK throws an error with certain scale values for input and weight tensors. The error happens when the convolution scale is greater than 1.0. convolution scale is computed as input_scale * kernel_scale / output_scale;

This is a problem which arises when model is trained with QAT and run on QNNPACK mobile backend.

To Reproduce

Script to repro the error

import torch
import torch.nn as nn

qconv = torch.ops.quantized.conv2d
qconv_prepack = torch.ops.quantized.conv2d_prepack

strides = (1, 1)
pads = (0, 0)
dilations = (1, 1)
groups = 1


for name in ["fbgemm", "qnnpack"]:
    torch.backends.quantized.engine = name
    print("Running on backend ", name)
    x = torch.randn(1, 4, 4, 4)
    qx = torch.quantize_per_tensor(x, scale=0.052, zero_point=0, dtype=torch.quint8)
    weight = torch.randn(2, 4, 2, 2)
    qweight = torch.quantize_per_tensor(weight, scale=2.39, zero_point=0, dtype=torch.qint8)
    w_prepack = qconv_prepack(qweight, None, strides, pads, dilations, groups)
    print(qconv(qx, w_prepack, strides, pads, dilations, groups, 0.112, 0))

Expected behavior

Output from FBGEMM

tensor([[[[0.0000, 0.0000, 0.0000],
          [0.0000, 0.0000, 0.0000],
          [0.0000, 0.0000, 0.0000]],

         [[1.2320, 0.2240, 0.0000],
          [0.0000, 0.0000, 2.6880],
          [0.4480, 0.0000, 0.0000]]]], size=(1, 2, 3, 3), dtype=torch.quint8,
       quantization_scheme=torch.per_tensor_affine, scale=0.112, zero_point=0)

Output from QNNPACK
Error in QNNPACK: failed to create convolution with 0.052 input scale, 2.39 kernel scale, and 0.112 output scale: convolution scale 1.109643 is greater or equal to 1.0

cc @jerryzh168 @jianyuh @dzhulgakov @raghuramank100 @jamesr66a, @kimishpatel

The text was updated successfully, but these errors were encountered:

kimishpatel · 2020-02-19T18:37:45Z

@raghuramank100 @supriyar, isn't it strange to have kernel scale 2.39? What is the weight tensor like?

supriyar · 2020-02-19T19:22:51Z

I believe the scale value is computed as (max - min)/(qmax - qmin). Given that, I wouldn't expect scale of 2.39 to be strange. Actually the convolution scale is 1.1 which is just slightly over 1.0. Since FBGEMM supports this, I anticipate this being an issue in running models on mobile. Would it be possible to make the requirement of 1.0 a little slack?

kimishpatel · 2020-02-19T19:41:58Z

@supriyar, my bad, I misunderstood this. Let's discuss in person with Raghu, why we are running into this. On the side of fixing this in pytorch QNNPACK, this needs to be carefully looked at because this may impact the assumptions made in requantization logic. (Probably doable if we follow similar requantization semantics as fbgemm).

kimishpatel · 2020-05-12T23:50:41Z

#37683 and #35856 should resolve this issue.

vkuzo · 2020-07-08T04:32:07Z

closing since this is fixed by @kimishpatel

yf225 added oncall: quantization Quantization support in PyTorch triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module oncall: mobile Related to mobile support, including iOS and Android labels Feb 20, 2020

raghuramank100 assigned kimishpatel Mar 3, 2020

neeraj-j mentioned this issue Apr 10, 2020

[quantization] Error in QNNPACK: failed to create convolution with 0.6145506 input scale, 1.624538 kernel scale, and 0.2153073 output scale: convolution scale 4.636909 is greater or equal to 1.0 #36253

Closed

vkuzo closed this as completed Jul 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantized Convolution error on Mobile with some scale values. #33466

Quantized Convolution error on Mobile with some scale values. #33466

supriyar commented Feb 18, 2020 •

edited by pytorch-probot bot

kimishpatel commented Feb 19, 2020

supriyar commented Feb 19, 2020

kimishpatel commented Feb 19, 2020

kimishpatel commented May 12, 2020

vkuzo commented Jul 8, 2020

Quantized Convolution error on Mobile with some scale values. #33466

Quantized Convolution error on Mobile with some scale values. #33466

Comments

supriyar commented Feb 18, 2020 • edited by pytorch-probot bot

🐛 Bug

To Reproduce

Expected behavior

kimishpatel commented Feb 19, 2020

supriyar commented Feb 19, 2020

kimishpatel commented Feb 19, 2020

kimishpatel commented May 12, 2020

vkuzo commented Jul 8, 2020

supriyar commented Feb 18, 2020 •

edited by pytorch-probot bot