Skip to content

Version 2.8.0

Latest
Compare
Choose a tag to compare
@aimetci aimetci released this 18 Jun 16:14

What's Changed

New Features

  • ONNX
    • Update aimet_onnx QuantizationSimModel.__init__ function signature (cbe67ae)
    • Defined new AdaRound API aimet_onnx.apply_adaround (84edcf5)
    • Defined new sequential MSE API aimet_onnx.apply_seq_mse (836ab1e)
    • Defined new per-layer sensitivity analysis API aimet_onnx.analyze_per_layer_sensitivity (dc34fa4)
    • Allowed onnx QuantizationSimModel.compute_encodings to take iterables (2c8ae88)
  • PyTorch
    • Added native support for huggingface Phi-3 (80cd141)

Bug Fixes and Improvements

  • ONNX
    • Made dynamic weights of Conv, ConvTranspose, Gemm, and MatMul follow the symmetry of static weights (ce68e75)
    • aimet-onnx on PyPI is now compatible with onnxruntime-gpu (6d3aa97)
    • Unpinned onnx version (abe8782)
    • Changed default execution provider to CPUExecutionProvider (e7d10c7)
    • Made QcQuantizeOp's data_type attribute always consistent without additional reconfiguration (8009871)
    • Made delta/offset and min/max always consistent (88706ef)
  • PyTorch
    • Made input quantizers always get enabled whenever the input wasn't already quantized (a2adae2)
    • Deprecated saving PyTorch model object during QuantizationsimModel.export (b5521f3)