v0.1.0
What's Changed
- add option by @qwopqwop200 in #23
- Add gpt2 by @qwopqwop200 in #30
- Fix bug speedup quant and support gpt2 by @qwopqwop200 in #29
- Offloading and Multiple devices quantization/inference by @PanQiWei in #24
- Add raise exception and gpt2 xl example add by @qwopqwop200 in #31
- Allow to load arbitrary models by @z80maniac in #33
- Change save name by @qwopqwop200 in #34
- Fix typo: 'hole' -> 'whole' by @TheBloke in #40
- bug fix quantization demo by @qwopqwop200 in #37
- Check that model_save_name exists before trying to load it, to avoid confusing checkpoint error by @TheBloke in #39
- Faster cuda no actorder by @qwopqwop200 in #38
New Contributors
- @z80maniac made their first contribution in #33
- @TheBloke made their first contribution in #40
Full Changelog: v0.0.5...v0.1.0