Releases · turboderp/exllamav2

Support for Granite architecture
Support for GPT2 architecture
Support for banned strings in streaming generator
A bit more work on multimodal support (still unfinished)
Few bugfixes and stuff
Windows wheels for PyTorch 2.2.0 are included below to work around an apparent (likely temporary) issue in PyTorch. See #434 and pytorch/pytorch#125109

Full Changelog: v0.0.20...v0.0.21

Provide feedback