Skip to content

v0.1.1: Intel Neural Compressor's dynamic, post-training and aware-training quantization support

Compare
Choose a tag to compare
@echarlaix echarlaix released this 10 Nov 17:54
· 996 commits to main since this release

With this release, we enable Intel Neural Compressor v1.7 PyTorch dynamic, post-training and aware-training quantization for a variety of NLP tasks. This support includes the overall process, from quantization application to the loading of the resulting quantized model. The latter being enabled by the introduction of the IncQuantizedModel class.