What's Changed
New Features
- ONNX
- Update aimet_onnx
QuantizationSimModel.__init__
function signature (cbe67ae) - Defined new AdaRound API
aimet_onnx.apply_adaround
(84edcf5) - Defined new sequential MSE API
aimet_onnx.apply_seq_mse
(836ab1e) - Defined new per-layer sensitivity analysis API
aimet_onnx.analyze_per_layer_sensitivity
(dc34fa4) - Allowed onnx
QuantizationSimModel.compute_encodings
to take iterables (2c8ae88)
- Update aimet_onnx
- PyTorch
- Added native support for huggingface Phi-3 (80cd141)
Bug Fixes and Improvements
- ONNX
- Made dynamic weights of Conv, ConvTranspose, Gemm, and MatMul follow the symmetry of static weights (ce68e75)
- aimet-onnx on PyPI is now compatible with onnxruntime-gpu (6d3aa97)
- Unpinned onnx version (abe8782)
- Changed default execution provider to CPUExecutionProvider (e7d10c7)
- Made QcQuantizeOp's data_type attribute always consistent without additional reconfiguration (8009871)
- Made delta/offset and min/max always consistent (88706ef)
- PyTorch