llama_state #62

martindevans · 2023-07-24T23:01:00Z

It looks like recent updates to Llama.cpp (e.g. ggerganov/llama.cpp#1797) have modified the API significantly with regards to how "state" is handled.

The llama_model is loaded with one API call (llama_load_model_from_file), which loads all of the static data (weights, vocabulary etc) and then you can create one or more states over this (llama_new_context_with_model).

Is anyone else working on this? If not I'm happy to have a go at it.

The text was updated successfully, but these errors were encountered:

martindevans · 2023-07-25T00:23:27Z

Low level changes addressed in #64

martindevans · 2023-07-28T13:20:45Z

Some higher level changes addressed in #70

martindevans mentioned this issue Jul 25, 2023

Low level new loading system #64

Merged

martindevans closed this as completed Aug 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama_state #62

llama_state #62

martindevans commented Jul 24, 2023

martindevans commented Jul 25, 2023

martindevans commented Jul 28, 2023

llama_state #62

llama_state #62

Comments

martindevans commented Jul 24, 2023

martindevans commented Jul 25, 2023

martindevans commented Jul 28, 2023