Version 2.16.0
New Features
- ONNX
- Experimental - Added Adascale, a post-training quantization technique (5e23ceb)
Bug fixes and Improvements
-
ONNX
- Skip tying Concat input/output quantizers with conflicting encoding constraints (b924107)
- Small updates to FPT Quant for improved accuracy (ba10947)
- Implement partial encoding freezing mechanism in aimet-onnx (658ec3c)
- Add Relu partial encoding constraints to HTP config files (dc8d978)
- Clear encoding analyzer stats after computing param encodings (3d4725f)
- Remove wasted computation/memory in FPTQuant local optimizer (59350af)
-
Torch