Activation Bits Support + qconfig kwargs for QuantizationModifier #542

rahul-tuli · 2022-02-03T23:49:17Z

import torch
from sparseml.pytorch.models import resnet50
from sparseml.pytorch.optim import QuantizationModifier


# Pretrained Model
model = resnet50(pretrained=True)

# Modifier init
modifier = QuantizationModifier(start_epoch=0, activation_bits=4)

# Modifier Application
modifier.apply(
    module=model,
)

# Book-Keeping, the model should have non infinity weights
print(model(torch.rand(1, 3, 224, 224)))

for param in model.parameters():
    print(param.data.min(), param.data.max())

bfineran

as discussed we need to have modifier properties to control both the weight and activation qconfig kwargs

src/sparseml/pytorch/optim/modifier_quantization.py

src/sparseml/pytorch/utils/quantization/helpers.py

src/sparseml/pytorch/optim/modifier_quantization.py

bfineran · 2022-02-14T20:53:25Z

@rahul-tuli pushed the discussed fix for propagating observer quant range

src/sparseml/pytorch/optim/modifier_quantization.py

rahul-tuli · 2022-02-16T06:01:05Z

@rahul-tuli pushed the discussed fix for propagating observer quant range

Thanks, incorporated the requested changes!

Fix: tests, test helper names, style Fix - Merge Conflicts Renamed - `enable_in4_activations` to `int4_activations`

src/sparseml/pytorch/utils/quantization/helpers.py

src/sparseml/pytorch/optim/modifier_quantization.py

Use: A copy of `activation_qconfig_kwargs` for overriding quant_max, quant_min values

…f hardcoding to 4 bits

bfineran

looks close! left a few comments then good from my end

src/sparseml/pytorch/optim/modifier_quantization.py

src/sparseml/pytorch/utils/helpers.py

src/sparseml/pytorch/utils/quantization/helpers.py

tests/sparseml/pytorch/optim/test_modifier_quantization.py

bfineran

one more comment then good to go

tests/sparseml/pytorch/optim/test_modifier_quantization.py

KSGulin

Good stuff

src/sparseml/pytorch/optim/modifier_quantization.py

This reverts commit 7f6ac86.

synced offline

rahul-tuli force-pushed the feature-int4-quant branch from fcea317 to e99b870 Compare February 4, 2022 19:17

rahul-tuli requested a review from bfineran February 4, 2022 19:26

rahul-tuli self-assigned this Feb 4, 2022

rahul-tuli marked this pull request as draft February 4, 2022 19:34

rahul-tuli requested a review from mgoin February 4, 2022 19:34

bfineran suggested changes Feb 4, 2022

View reviewed changes

natuan reviewed Feb 15, 2022

View reviewed changes

src/sparseml/pytorch/optim/modifier_quantization.py Outdated Show resolved Hide resolved

rahul-tuli and others added 3 commits February 16, 2022 08:16

Add: Support for 4 bit activation quantization

12dd82c

Apply consistent typing for activation_kwargs

a4bc21a

observer quant_min/max bugfix

40aab37

rahul-tuli force-pushed the feature-int4-quant branch from 37a93d3 to a4955dd Compare February 16, 2022 05:58

Add: Support for weight_qconfig_kwargs

4cb3a46

Fix: tests, test helper names, style Fix - Merge Conflicts Renamed - `enable_in4_activations` to `int4_activations`

rahul-tuli force-pushed the feature-int4-quant branch from a4955dd to 4cb3a46 Compare February 16, 2022 08:50

bfineran suggested changes Feb 16, 2022

View reviewed changes

rahul-tuli added 2 commits February 16, 2022 22:34

Address: Test failures

cce8177

Use: A copy of `activation_qconfig_kwargs` for overriding quant_max, quant_min values

Update: To a more general approach, using activation_bits instead o…

8a5e5ed

…f hardcoding to 4 bits

bfineran suggested changes Feb 16, 2022

View reviewed changes

rahul-tuli added 2 commits February 17, 2022 19:55

Merge branch 'main' into feature-int4-quant

36c20ca

Addressed review comments from bfineran

95af573

rahul-tuli force-pushed the feature-int4-quant branch from f34644b to 95af573 Compare February 18, 2022 14:45

rahul-tuli added 2 commits February 18, 2022 20:19

Merge branch 'main' into feature-int4-quant

9bdce13

Style

b5201fe

rahul-tuli requested review from bfineran and natuan February 18, 2022 14:56

bfineran marked this pull request as ready for review February 18, 2022 14:58

bfineran changed the title ~~[WIP] int4-quant~~ int4 quantized activations support + QuantizationModifier qconfig kwargs support Feb 18, 2022

bfineran reviewed Feb 18, 2022

View reviewed changes

tests/sparseml/pytorch/optim/test_modifier_quantization.py Show resolved Hide resolved

Addressed: Conv3d rebase

ec820b1

Add: Tests for activation_qconfig_kwargs

e88521e

rahul-tuli requested a review from bfineran February 18, 2022 15:34

bfineran previously approved these changes Feb 18, 2022

View reviewed changes

rahul-tuli changed the title ~~int4 quantized activations support + QuantizationModifier qconfig kwargs support~~ Activation Bits Support + qconfig kwargs for QuantizationModifier Feb 18, 2022

KSGulin previously approved these changes Feb 18, 2022

View reviewed changes

KSGulin dismissed stale reviews from bfineran and themself via 82e94d3 February 18, 2022 16:27

KSGulin previously approved these changes Feb 18, 2022

View reviewed changes

bfineran previously approved these changes Feb 18, 2022

View reviewed changes

natuan suggested changes Feb 18, 2022

View reviewed changes

src/sparseml/pytorch/optim/modifier_quantization.py Show resolved Hide resolved

src/sparseml/pytorch/optim/modifier_quantization.py Show resolved Hide resolved

src/sparseml/pytorch/optim/modifier_quantization.py Show resolved Hide resolved

rahul-tuli dismissed stale reviews from bfineran and KSGulin via ad4580c February 18, 2022 17:47

rahul-tuli force-pushed the feature-int4-quant branch from 6cf2160 to ad4580c Compare February 18, 2022 17:47

rahul-tuli requested review from KSGulin, bfineran and natuan February 18, 2022 17:53

Addressed: Review comments from @natuan

7f6ac86

rahul-tuli force-pushed the feature-int4-quant branch from ad4580c to 7f6ac86 Compare February 18, 2022 17:55

natuan previously requested changes Feb 18, 2022

View reviewed changes

src/sparseml/pytorch/optim/modifier_quantization.py Show resolved Hide resolved

rahul-tuli added 2 commits February 23, 2022 13:47

Revert "Addressed: Review comments from @natuan"

28d5384

This reverts commit 7f6ac86.

Merge branch 'main' into feature-int4-quant

a8f9abd

rahul-tuli requested a review from natuan February 23, 2022 08:23

KSGulin approved these changes Feb 23, 2022

View reviewed changes

bfineran approved these changes Feb 23, 2022

View reviewed changes

rahul-tuli merged commit a78f340 into main Feb 23, 2022

rahul-tuli deleted the feature-int4-quant branch February 23, 2022 15:29

bfineran mentioned this pull request Mar 10, 2022

fix symmetric zero points for unit8 quantization #604

Merged

Activation Bits Support + qconfig kwargs for QuantizationModifier #542

Activation Bits Support + qconfig kwargs for QuantizationModifier #542

Uh oh!

Conversation

rahul-tuli commented Feb 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bfineran commented Feb 14, 2022

Uh oh!

Uh oh!

rahul-tuli commented Feb 16, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

KSGulin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rahul-tuli commented Feb 3, 2022 •

edited

Loading