Post-train quantization based on stats + additional modules quantized #136

guyjacob · 2019-01-24T16:48:48Z

This is a big PR, will be easiest to review commit-by-commit. Not exactly best-practice, but as it is I prefer this to separate PRs.

Summary of changes:
(1) Post-train quantization based on pre-collected statistics
(2) Quantized concat, element-wise addition / multiplication and embeddings
(3) Move post-train quantization command line args out of sample code
(4) Configure post-train quantization from YAML for more fine-grained control

…-usability)

* Expose command line arguments for collecting and loading stats * Integrate in image classification sample * range_linear.py: * Load stats from YAML file or dict in post-train quantizer * Refactor layer wrappers to handle both dynamic and stats-based (aka "static") cases * collector.py: * Add dedicated collector for quantization stats * Allow None classes filter, in which case collect stats for all types * Expose abstract 'save' function instead of existing 'to_xlsx' * Fixes to typos and some comments * Extract code for setting deterministic execution into function, move to utils.py * Add ability to shuffle test dataset (required since we don't collect stats on the whole test set) * Move YAML loading functionality to utils.py * Add unit-tests for quantization based on stats

…d control * Command line argument to configure post-train quantizer from file * Integrated in image classification sample * Add ability to load specific "component" from YAML file, without a full- blown scheduler * Minor change in Quantizer initialization - change default of bits_overrides to None

* Concat + element-wise add / mult supported in post-training * Embeddings supported in quantization-aware training * Wrapped PyTorch concat + element-wise add/mult ops in Modules so they could be recognized by the Quantizer * Modified our ResNet implementation to use element-wise add modules instead of operator * Unit tests or concat, add and mult

galnov

All in all looks good.
Commented on a few typos.
One thing to make sure before merging this, is that post training quantization works well on a CPU-only machine.

docs-src/docs/schedule.md

docs-src/docs/algo_quantization.md

distiller/data_loggers/collector.py

nzmora

Let's discuss controlling the amount of dataset used for training/validation/test - there are small changes I'd prefer you make (details in the specific remarks)
Thanks
Neta

nzmora · 2019-01-29T12:50:22Z

distiller/config.py

            return dict_config(model, optimizer, sched_dict, scheduler)
        except yaml.YAMLError as exc:
            print("\nFATAL parsing error while parsing the schedule configuration file %s" % filename)
            raise
+            raise


This raise is not needed...

This is a weird merge artifact in GitHub, the code on the branch doesn't actually have the redundant statement. So I can't fix it now. If it sticks after the merge I'll remove it then.

distiller/data_loggers/collector.py

examples/classifier_compression/compress_classifier.py

distiller/utils.py

examples/classifier_compression/compress_classifier.py

distiller/modules/eltwise.py

* Change semantics of 'qe_calibration' argument to match the new implementation of using part of dataset

Summary of changes: (1) Post-train quantization based on pre-collected statistics (2) Quantized concat, element-wise addition / multiplication and embeddings (3) Move post-train quantization command line args out of sample code (4) Configure post-train quantization from YAML for more fine-grained control (See PR #136 for more detailed changes descriptions)

guyjacob added 2 commits January 24, 2019 17:30

Move post-train quantization command line args out of sample code (re…

10c13d3

…-usability)

guyjacob requested review from nzmora and galnov January 24, 2019 16:48

guyjacob added 4 commits January 26, 2019 23:58

Updated quantization examples

1bfa10c

Updated quantization docs

1f10b9e

galnov approved these changes Jan 29, 2019

View reviewed changes

docs-src/docs/schedule.md Outdated Show resolved Hide resolved

docs-src/docs/algo_quantization.md Outdated Show resolved Hide resolved

distiller/data_loggers/collector.py Outdated Show resolved Hide resolved

nzmora suggested changes Jan 29, 2019

View reviewed changes

guyjacob added 4 commits February 4, 2019 10:39

Fix typos

3bfa1a5

Add standard deviation in quant stats collector + more review fixes

85d0ec9

Merge branch 'master' into quantization_updates

edb1076

Merge branch 'master' into quantization_updates

b7b753f

* Change semantics of 'qe_calibration' argument to match the new implementation of using part of dataset

guyjacob merged commit 28a8ee1 into master Feb 11, 2019

guyjacob deleted the quantization_updates branch February 11, 2019 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Post-train quantization based on stats + additional modules quantized #136

Post-train quantization based on stats + additional modules quantized #136

guyjacob commented Jan 24, 2019 •

edited

Loading

galnov left a comment

nzmora left a comment

nzmora Jan 29, 2019

guyjacob Feb 4, 2019

Post-train quantization based on stats + additional modules quantized #136

Post-train quantization based on stats + additional modules quantized #136

Conversation

guyjacob commented Jan 24, 2019 • edited Loading

galnov left a comment

Choose a reason for hiding this comment

nzmora left a comment

Choose a reason for hiding this comment

nzmora Jan 29, 2019

Choose a reason for hiding this comment

guyjacob Feb 4, 2019

Choose a reason for hiding this comment

guyjacob commented Jan 24, 2019 •

edited

Loading