v0.1.9
- Updates the bindings to work with the new llama.cpp API from ggml-org/llama.cpp#370
- Adds two separate interfaces -
LlamaInferencewhich is similar to the bindings in v0.1.8 and the lower levelLlamaContext(currently untested) - The old bindings are still present in
PyLlama.cppbut is currently not compiled and will be removed at a later date