Switch off fp16 on QAT start #703

rahul-tuli · 2022-04-12T15:36:10Z

This PR updates ImageClassificationTrainer to switch off fp16 on QAT start

Noting that torch > 1.9. does not crash if mixed precision is not switched off before QAT start thus the additional check

src/sparseml/pytorch/image_classification/utils/trainer.py

src/sparseml/pytorch/image_classification/utils/helpers.py

KSGulin

LGTM! Just one small comment/question

src/sparseml/pytorch/image_classification/utils/trainer.py

@KSGulin

* Avoid numerically unstable log (#694) * fix QAT->Quant conversion of repeated Gemm layers with no activation QDQ (#698) * Revert rn residual quant (#691) * Revert ResNet definition to not quantize input to add op in residual branches. * Correct typo. Co-authored-by: Mark Kurtz <mark@neuralmagic.com> * Fix: Add linebreak before 'Supplied' for better readability (#701) * Bump notebook in /research/information_retrieval/doc2query (#679) Bumps [notebook](http://jupyter.org) from 6.4.1 to 6.4.10. --- updated-dependencies: - dependency-name: notebook dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Mark Kurtz <mark@neuralmagic.com> Co-authored-by: Michael Goin <michael@neuralmagic.com> * Added integration to masked_language_modeling training command (#707) * Switch off fp16 on QAT start (#703) * Switch off fp16 on QAT start * address: review comments * Disable fp16 when torch version is lesser than `1.9` * Fix transformer prediction step (#716) * Fix for prediction step when teacher model has more inputs than student. * Updated signature of prediction_step method. * Style and quality fixes. * bump main to 0.13 (#696) Co-authored-by: dhuang <dhuang@dhuangs-MacBook-Pro.local> * Fix: default python log calls to debug level (#719) * Feature/integrations (#688) * added tutorials to root readme split by domain * readme update * edited text/structure * grammar edits * fix QATWrapper not properly overwritting qconfig properties for symmetric activations (#724) * re-add fix symmetric zero points for unit8 quantization (#604) (#725) * Fix 'self' and 'disable' not working for transformers distillation (#731) * Click refactor for SparseML-PyTorch integration with Image Classification models (#711) * Click refactor for SparseML-PyTorch integration * Click refactor for `Pruning Sensitivity` analysis (#714) * Click refactor for SparseML-PyTorch pr_sensitivity analysis integration * Review comments from @KSGulin * Click refactor for SparseML-PyTorch `lr-analysis` integration (#713) * Click refactor for SparseML-PyTorch lr-analysis integration * Review comments from @KSGulin * Click refactor for SparseML PyTorch `export` integration (#712) * Click refactor for SparseML-PyTorch export integration * Review comments from @KSGulin * Addressed all review comments from @bfineran, @dbogunowicz and @KSGulin * Regenerate and Update the train-cli docstring due to changes in a few cli-args * `nm_argparser.py` not needed anymore * removed `nm_argparser.py` from init * Remove All CLI args aliases and updated doctrings accordingly * [Fix] Follow-up fix for #731 (Fix 'self' and 'disable' not working for transformers distillation) (#737) * initial commit * added more files and fixed quality * Update trainer.py * Added flag to exclude quantization of embedding activations. (#738) * Added flag to exclude quantization of embedding activations. * Updated testing to contemplate quantize_embedding_activations flag. * Updated testing to contemplate quantize_embedding_activations flag. * Updated debugging * Revert "Updated debugging" This reverts commit 449703d. * Corrected order of arguments to pass assertion. * Update src/sparseml/version.py Co-authored-by: Eldar Kurtic <eldar.ciki@gmail.com> Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> Co-authored-by: Alexandre Marques <alexandre@neuralmagic.com> Co-authored-by: Konstantin Gulin <66528950+KSGulin@users.noreply.github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Michael Goin <michael@neuralmagic.com> Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> Co-authored-by: dhuangnm <74931910+dhuangnm@users.noreply.github.com> Co-authored-by: dhuang <dhuang@dhuangs-MacBook-Pro.local> Co-authored-by: Ricky Costa <79061523+InquestGeronimo@users.noreply.github.com> Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com>

rahul-tuli requested review from a team, KSGulin, bfineran and markurtz and removed request for a team April 12, 2022 15:36

rahul-tuli self-assigned this Apr 12, 2022

rahul-tuli added the 0.13 release A label for release sparseml release 0.13 label Apr 12, 2022

Switch off fp16 on QAT start

1f27177

rahul-tuli force-pushed the disable-fp16-on-quant-start branch from e3239c7 to 1f27177 Compare April 12, 2022 15:38

bfineran reviewed Apr 12, 2022

View reviewed changes

src/sparseml/pytorch/image_classification/utils/trainer.py Outdated Show resolved Hide resolved

address: review comments

cf0641c

rahul-tuli requested a review from bfineran April 12, 2022 16:17

Disable fp16 when torch version is lesser than 1.9

755b91c

bfineran marked this pull request as ready for review April 12, 2022 18:29

bfineran reviewed Apr 12, 2022

View reviewed changes

src/sparseml/pytorch/image_classification/utils/helpers.py Show resolved Hide resolved

rahul-tuli requested a review from bfineran April 12, 2022 20:07

bfineran approved these changes Apr 12, 2022

View reviewed changes

KSGulin reviewed Apr 14, 2022

View reviewed changes

src/sparseml/pytorch/image_classification/utils/trainer.py Show resolved Hide resolved

Merge branch 'main' into disable-fp16-on-quant-start

28d71b6

KSGulin approved these changes Apr 15, 2022

View reviewed changes

rahul-tuli merged commit 9c9e217 into main Apr 15, 2022

rahul-tuli deleted the disable-fp16-on-quant-start branch April 15, 2022 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switch off fp16 on QAT start #703

Switch off fp16 on QAT start #703

Uh oh!

rahul-tuli commented Apr 12, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

KSGulin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Switch off fp16 on QAT start #703

Switch off fp16 on QAT start #703

Uh oh!

Conversation

rahul-tuli commented Apr 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KSGulin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rahul-tuli commented Apr 12, 2022 •

edited

Loading