Version 2.17.0

aimetci released this 20 Oct 21:38

0e679b7

Bug fixes and Improvements
- ONNX
  - Optimize SeqMSE latency and CPU memory usage (434ac6b)
  - Support excluding nodes from SeqMSE optimization (6a37239)
  - Support exporting large models (> 2GB) to ONNX QDQ (b1dafe6, 1bf8b82)
  - Support exporting float16 ONNX models to ONNX QDQ (66ccb45)
  - Allow disabling MatMul-Add supergroup via config file (e49660c)
  - Fix bug where on-disk tensor data is deleted before InferenceSession (d57a934)
- Torch
  - Fix sim.export bug when using Python >= 3.12 (ee949a2)
  - Allow export for back-to-back quantizers which share the same encodings (28a7382)
  - Fix numerical issue in FPTQuant (f0bc6c9)
- Common
  - Remove Conv-Relu supergroup from HTP < V73 config files (19e5a4e)
  - Fix LayerNorm and InstanceNorm weight symmetry in HTP < V73 config files (eb1ac5c, ce1ea63)

Assets 7