Version 2.27.0
- Bug fixes and Improvements
-
ONNX
- Add
force_activation_asoption to export APIs to control activation signedness (3583462)
- Add
-
Torch
- Reduce quantize-dequantize latency overhead (9ca3bf4, 525e993, b3de9a2)
- Optimize inference speed for GenAITests models (cacd5cc, b6ea5bd, 30ab60a)
- Allow checkpointing and loading during SeqMSE optimization (4eb97f0)
- Fix SeqMSE error when model contains unquantized Conv/Linear layers (3dd4ca9)
- Populate scalar constant Mul/Div output encodings at export (1228394, 169952d, ca2a324)
- Propagate tensor encodings through scalar Mul/Div operations (54c7462, 2cfd07e)
-
Common
- Propagate concat input quantizers to output when possible (5ee0f13)
-