Skip to content

v0.1.11

Choose a tag to compare

@thomasantony thomasantony released this 31 Mar 16:28
· 34 commits to master since this release
  • Breaking change but makes model loading practically instantaneous thanks to memory-mapped I/O
  • Requires re-generating the weight files using the new convert script (or use the migration script from llama.cpp)