Skip to content

Version 2.16.0

Choose a tag to compare

@aimetci aimetci released this 07 Oct 04:36

New Features

  • ONNX
    • Experimental - Added Adascale, a post-training quantization technique (5e23ceb)

Bug fixes and Improvements

  • ONNX

    • Skip tying Concat input/output quantizers with conflicting encoding constraints (b924107)
    • Small updates to FPT Quant for improved accuracy (ba10947)
    • Implement partial encoding freezing mechanism in aimet-onnx (658ec3c)
    • Add Relu partial encoding constraints to HTP config files (dc8d978)
    • Clear encoding analyzer stats after computing param encodings (3d4725f)
    • Remove wasted computation/memory in FPTQuant local optimizer (59350af)
  • Torch

    • Allow boolean type casting of QuantizedTensors (7d63e66)
    • Implement partial encoding freezing mechanism in aimet-torch (1b99a39)
    • Improve scale post-processing to prevent scale freezing during QAT (6fe56b0)