Version 2.14.0
-
New Feature
- ONNX
- Add support for FP16 in
QuantizationSimModel(2494d90)
- Add support for FP16 in
- ONNX
-
Bug fixes and Improvements
-
ONNX
- Add sequential MSE support for
onnx >= 1.18.0. (754d030) - Improve histogram granularity during TFE calibration (91109af)
- Improve runtime for
QuantizationSimModelcreation for large models like LLMs (f7e700f) - Improve runtime for setting quantizers in a
QuantizationSimModelfor use cases like tying KV Cache input and output quantizers. (c0bdb46) - Add a check for None values in the
groupattribute ofConvlayers and fix improper handling of Nonegroupattribute inConvTransposewithin :func:fold_all_batch_norms_to_weight(374e8db)
- Add sequential MSE support for
-
PyTorch
- Address QAT convergence issue: Add a fix for cases where
quantizer.minbecomes equal toquantizer.maxduring training, leading to NaN values (51f8990)
- Address QAT convergence issue: Add a fix for cases where
-
Keras
- Fix accuracy drop issue for GPU wheel by excluding
libpython*.so*from the aimet wheel packages (22cac5c)
- Fix accuracy drop issue for GPU wheel by excluding
-
Common
-