Release Version 2.14.0 · qualcomm/aimet

New Feature
- ONNX
  - Add support for FP16 in QuantizationSimModel (2494d90)
Bug fixes and Improvements
- ONNX
  - Add sequential MSE support for onnx >= 1.18.0. (754d030)
  - Improve histogram granularity during TFE calibration (91109af)
  - Improve runtime for QuantizationSimModel creation for large models like LLMs (f7e700f)
  - Improve runtime for setting quantizers in a QuantizationSimModel for use cases like tying KV Cache input and output quantizers. (c0bdb46)
  - Add a check for None values in the group attribute of Conv layers and fix improper handling of None group attribute in ConvTranspose within :func:fold_all_batch_norms_to_weight (374e8db)
- PyTorch
  - Address QAT convergence issue: Add a fix for cases where quantizer.min becomes equal to quantizer.max during training, leading to NaN values (51f8990)
- Keras
  - Fix accuracy drop issue for GPU wheel by excluding libpython*.so* from the aimet wheel packages (22cac5c)
- Common
  - Remove Conv3d, Conv3dTranspose, and DepthwiseConv ops followed by activation from the supergroup until HTP support is available. (05f6810)
  - Fix color theme issue in documentation causing code snippets to render incorrectly (2c64eac)

Provide feedback

No results found