Version 2.10.0
What's Changed
-
New Feature
- Promote to_onnx_qdq to a public API (f333188). Note: This is currently a beta feature
-
Bug fixes and Improvements
- Common
- Added hover tooltip to plot per layer sensitivity. Changed x-axis to plot layer indices instead of names (c96894f)
- PyTorch
- ONNX
- Allow AdaRound and SeqMSE to take uncalibrated sims(31ca7fd)
- Modify bias quantizer setting based on weight quantizer (b47a97e)
- Fix cnt overflow issue (70029c5)
- Make memory saving optimization default in build_session and _infer_activation_dtypes (4b94ca9)
- Implement two-phase AMP API (1603c17)
- Work-around onnx version converter issue for models with external weights (22f0f23)
- Common
-
Documentation
- Update SeqMSE feature guide (fefd504)
- Fix links in example notebooks (fe66376)
- Modify docs for CLE (f9d0d6c)
- Edit automatic mixed precision feature guide (22b5c94)
- Polish BQ user guide (f547a49)
- Polish QAT user guide (339a225)
- Update Quick start example, PTQ section (example) and notebook (03fdd64)
- Add missing workflow images (010898a)
- Add reference to mixed precision page in docs (188d401)