QuantTensor quantization invariant op on per tensor vs per channel #728

volcacius · 2023-10-18T10:26:39Z

Currently we don't check whether a QuantTensor is per channel or per tensor when we do certain ops like flatten or shuffle/unshuffle that are sensitive to it.

ScXfjiang · 2024-04-20T22:43:50Z

PyTorch Tensor has the qscheme attribute to specify how to quantize a tensor, which includes:

torch.per_tensor_affine
torch.per_tensor_symmetric
torch.per_channel_affine
torch.per_channel_symmetric

In Brevitas, if granularity is our only concern so far, maybe we can assess it by examining the shape of the quantization parameters (scale, zero point):

shape [1] --> per_tensor quantization
otherwise --> per_channel quantization

#891

nickfraser added the good first issue Good for newcomers label Feb 7, 2024

Giuseppe5 mentioned this issue Mar 2, 2024

Add squeeze / unsqueeze operations to quant invariant functions in torch_handler.py #891

Open

ScXfjiang mentioned this issue Apr 21, 2024

Dev quant squeeze #941

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QuantTensor quantization invariant op on per tensor vs per channel #728

QuantTensor quantization invariant op on per tensor vs per channel #728

volcacius commented Oct 18, 2023

ScXfjiang commented Apr 20, 2024

QuantTensor quantization invariant op on per tensor vs per channel #728

QuantTensor quantization invariant op on per tensor vs per channel #728

Comments

volcacius commented Oct 18, 2023

ScXfjiang commented Apr 20, 2024