Version 2.24.0
- Bug fixes and Improvements
-
ONNX
-
Torch
- Expose block-level AdaScale API (72246db)
- Improve numerical stability of zero point shifting ([-1.5, -.5, .5, 1.5]) implementation (489f7df)
- Fix :func:
replace_lora_layers_with_quantizable_layersto inherit train/eval flag (af5a82d) - Fix SpinQuant evaluation by untying lm_head and embed_tokens prior to loading the state_dict (47f574d)
- Experimental - Implement Progressive Gradient Scaling (PGS) support for Triton-based quantization kernels (b58b00b)
-
Common
-