Skip to content

Version 2.27.0

Choose a tag to compare

@aimetci aimetci released this 26 Mar 05:40
  • Bug fixes and Improvements
    • ONNX

      • Add force_activation_as option to export APIs to control activation signedness (3583462)
    • Torch

      • Reduce quantize-dequantize latency overhead (9ca3bf4, 525e993, b3de9a2)
      • Optimize inference speed for GenAITests models (cacd5cc, b6ea5bd, 30ab60a)
      • Allow checkpointing and loading during SeqMSE optimization (4eb97f0)
      • Fix SeqMSE error when model contains unquantized Conv/Linear layers (3dd4ca9)
      • Populate scalar constant Mul/Div output encodings at export (1228394, 169952d, ca2a324)
      • Propagate tensor encodings through scalar Mul/Div operations (54c7462, 2cfd07e)
    • Common

      • Propagate concat input quantizers to output when possible (5ee0f13)