v0.1.1: Intel Neural Compressor's dynamic, post-training and aware-training quantization support

echarlaix released this 10 Nov 17:54

· 996 commits to main since this release

v0.1.1

ff6a10a

With this release, we enable Intel Neural Compressor v1.7 PyTorch dynamic, post-training and aware-training quantization for a variety of NLP tasks. This support includes the overall process, from quantization application to the loading of the resulting quantized model. The latter being enabled by the introduction of the IncQuantizedModel class.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.1: Intel Neural Compressor's dynamic, post-training and aware-training quantization support