Fix for ONNX export for quantized BERT models #935

anmarques · 2022-07-08T21:37:23Z

Remove quantization of identity branch on BERT models.
Replace array quantization in NumPy for torch.quantize_per_tensor.

NOTE: This PR does NOT remove quantization of the identity branch for ResNet models. This will require fixes on the model side as well. Will be addressed in a future PR.

github-actions · 2022-07-08T21:37:39Z

@kylesayrs assigned for review

bfineran

LGTM pending comments

src/sparseml/pytorch/sparsification/quantization/quantize_qat_export.py

…export.py Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com>

natuan

LGTM, pending Ben's suggestion to delete the unused func

* Remove quantization of identity branch on BERT models * Style and quality fixes. * Update src/sparseml/pytorch/sparsification/quantization/quantize_qat_export.py Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> * Removed unused function Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com>

* Bump up version id * Fix for ONNX export for quantized BERT models (#935) * Remove quantization of identity branch on BERT models * Style and quality fixes. * Update src/sparseml/pytorch/sparsification/quantization/quantize_qat_export.py Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> * Removed unused function Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com>

Remove quantization of identity branch on BERT models

31cd5cb

github-actions bot assigned anmarques Jul 8, 2022

github-actions bot requested a review from kylesayrs July 8, 2022 21:37

anmarques requested review from a team, bfineran and natuan July 8, 2022 21:39

Style and quality fixes.

07726f4

bfineran reviewed Jul 8, 2022

View reviewed changes

Update src/sparseml/pytorch/sparsification/quantization/quantize_qat_…

6a75e8e

…export.py Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com>

natuan previously approved these changes Jul 8, 2022

View reviewed changes

Removed unused function

425562f

anmarques dismissed natuan’s stale review via 425562f July 8, 2022 21:49

bfineran approved these changes Jul 8, 2022

View reviewed changes

anmarques requested a review from natuan July 8, 2022 21:51

natuan approved these changes Jul 8, 2022

View reviewed changes

natuan merged commit adb8429 into main Jul 8, 2022

natuan deleted the fix_onnx_export_bert branch July 8, 2022 22:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix for ONNX export for quantized BERT models #935

Fix for ONNX export for quantized BERT models #935

Uh oh!

anmarques commented Jul 8, 2022

Uh oh!

github-actions bot commented Jul 8, 2022

Uh oh!

bfineran left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

natuan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix for ONNX export for quantized BERT models #935

Fix for ONNX export for quantized BERT models #935

Uh oh!

Conversation

anmarques commented Jul 8, 2022

Uh oh!

github-actions bot commented Jul 8, 2022

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

natuan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants