v0.6.0: Mixtral, StableLM, DeciLM, Yi support, Transformers 4.36 compatibility
What's Changed
- Precise PyTorch version by @fxmarty in #421
- Fix triton unexpected keyword by @LaaZa in #423
- Add support for Yi models. by @LaaZa in #413
- Add support for Xverse models. by @LaaZa in #417
- Allow fp32 input to GPTQ linear by @fxmarty in #437
- Fix typos in tests by @fxmarty in #438
- Update _base.py - Remote (.bin) model load fix by @Shades-en in #465
- make build successful on Jetson device(L4T) by @mikeshi80 in #470
- Add option to disable qigen at build by @fxmarty in #471
- Stop trying to convert a list to int in setup.py when trying to retrieve cores_info by @wemoveon2 in #474
- Only make_quant on inside_layer_modules. by @LaaZa in #479
- Add support for DeciLM models. by @LaaZa in #481
- Support for StableLM Epoch models. by @LaaZa in #444
- Add support for Mixtral models. by @LaaZa in #480
- Fix compatibility with transformers 4.36 by @fxmarty in #483
New Contributors
- @Shades-en made their first contribution in #465
- @mikeshi80 made their first contribution in #470
- @wemoveon2 made their first contribution in #474
Full Changelog: v0.5.1...v0.6.0