Version 2.27.0

aimetci released this 26 Mar 05:40

f87b96a

Bug fixes and Improvements
- ONNX
  - Add force_activation_as option to export APIs to control activation signedness (3583462)
- Torch
  - Reduce quantize-dequantize latency overhead (9ca3bf4, 525e993, b3de9a2)
  - Optimize inference speed for GenAITests models (cacd5cc, b6ea5bd, 30ab60a)
  - Allow checkpointing and loading during SeqMSE optimization (4eb97f0)
  - Fix SeqMSE error when model contains unquantized Conv/Linear layers (3dd4ca9)
  - Populate scalar constant Mul/Div output encodings at export (1228394, 169952d, ca2a324)
  - Propagate tensor encodings through scalar Mul/Div operations (54c7462, 2cfd07e)
- Common
  - Propagate concat input quantizers to output when possible (5ee0f13)

Assets 8