v0.1.0

PanQiWei released this 04 May 16:19

· 552 commits to main since this release

What's Changed

add option by @qwopqwop200 in #23
Add gpt2 by @qwopqwop200 in #30
Fix bug speedup quant and support gpt2 by @qwopqwop200 in #29
Offloading and Multiple devices quantization/inference by @PanQiWei in #24
Add raise exception and gpt2 xl example add by @qwopqwop200 in #31
Allow to load arbitrary models by @z80maniac in #33
Change save name by @qwopqwop200 in #34
Fix typo: 'hole' -> 'whole' by @TheBloke in #40
bug fix quantization demo by @qwopqwop200 in #37
Check that model_save_name exists before trying to load it, to avoid confusing checkpoint error by @TheBloke in #39
Faster cuda no actorder by @qwopqwop200 in #38

New Contributors

@z80maniac made their first contribution in #33
@TheBloke made their first contribution in #40

Full Changelog: v0.0.5...v0.1.0

Contributors

TheBloke, z80maniac, and 2 other contributors

Assets 2