v0.1.11

thomasantony released this 31 Mar 16:28

· 34 commits to master since this release

751bfb3

Breaking change but makes model loading practically instantaneous thanks to memory-mapped I/O
Requires re-generating the weight files using the new convert script (or use the migration script from llama.cpp)

Assets 2