Skip to content

Version 2.17.0

Choose a tag to compare

@aimetci aimetci released this 20 Oct 21:38
  • Bug fixes and Improvements
    • ONNX

      • Optimize SeqMSE latency and CPU memory usage (434ac6b)
      • Support excluding nodes from SeqMSE optimization (6a37239)
      • Support exporting large models (> 2GB) to ONNX QDQ (b1dafe6, 1bf8b82)
      • Support exporting float16 ONNX models to ONNX QDQ (66ccb45)
      • Allow disabling MatMul-Add supergroup via config file (e49660c)
      • Fix bug where on-disk tensor data is deleted before InferenceSession (d57a934)
    • Torch

      • Fix sim.export bug when using Python >= 3.12 (ee949a2)
      • Allow export for back-to-back quantizers which share the same encodings (28a7382)
      • Fix numerical issue in FPTQuant (f0bc6c9)
    • Common

      • Remove Conv-Relu supergroup from HTP < V73 config files (19e5a4e)
      • Fix LayerNorm and InstanceNorm weight symmetry in HTP < V73 config files (eb1ac5c, ce1ea63)