Multiple sessions on one model. #41

BruceKristelijn · 2023-06-26T15:03:53Z

Hi there, would love to have multiple sessions on the same model but th sessions seem to remember new information given by the other chat sessions. In the docs and settings I couldn't find anyhting. I am curious if this is somemthing I am doing wrong?

martindevans · 2023-07-27T17:50:44Z

At the moment the model weights and the context are all bound together in one object. You need to save and restore "states" to have two contexts in one set of weights. This is due to how llama.cpp itself used to work.

They've since made a change which splits model weights and model contexts into two separate things, so you can make multiple contexts from one set of shared weights. My PR (#64) partially addresses this by adding in support for the new loading system. Future PRs will modify the higher level APIs to use this.

BruceKristelijn · 2023-07-29T17:52:55Z

Thanks for the response. I am trying it right now but my responses seems to lose some context. I assumed that when a state is loaded / saved it retains the chat and prompt history, or am I mistaken?

BruceKristelijn · 2023-07-29T20:32:53Z

I tried including the chat history after loading the session again aswell but this seemed to reset the "memory" of the previous conversation aswell,

martindevans · 2023-07-29T20:33:50Z

Is this testing all being done on top of my PR (#64), with master or with some other version?

BruceKristelijn · 2023-07-29T20:45:18Z

Not yet, this was my next course of action. Was hoping I understoot the behaviour correctly first.

martindevans · 2023-07-29T20:47:41Z

I'm not too sure, sorry. I've been contributing PRs on some of the lower level bits of the stack but not the "higher level "stuff yet. I do know there are a few layers, which should all save and reload state together (executor, context etc), so maybe try backtracing some of that to check it all looks reasonable.

BruceKristelijn · 2023-07-30T11:25:07Z

Thanks, I just build your PR and it seems to work better without changing a lot of code which is great! Might be the wrong place to ask but I couldn't find it in sourcecode. Do you know if LLamaSharp adds things like 'Assistant:', and 'User:' to the chat?

martindevans · 2023-07-30T12:14:08Z

As far as I know it does not, but that'd be in the higher level parts that I'm not too familiar with so I'm not too sure on that!

martindevans · 2023-08-06T14:09:29Z

0.4.2 is out now. Does that resolve this issue?

martindevans closed this as completed Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple sessions on one model. #41

Multiple sessions on one model. #41

BruceKristelijn commented Jun 26, 2023

martindevans commented Jul 27, 2023

BruceKristelijn commented Jul 29, 2023

BruceKristelijn commented Jul 29, 2023

martindevans commented Jul 29, 2023

BruceKristelijn commented Jul 29, 2023

martindevans commented Jul 29, 2023

BruceKristelijn commented Jul 30, 2023 •

edited

martindevans commented Jul 30, 2023 •

edited

martindevans commented Aug 6, 2023

Multiple sessions on one model. #41

Multiple sessions on one model. #41

Comments

BruceKristelijn commented Jun 26, 2023

martindevans commented Jul 27, 2023

BruceKristelijn commented Jul 29, 2023

BruceKristelijn commented Jul 29, 2023

martindevans commented Jul 29, 2023

BruceKristelijn commented Jul 29, 2023

martindevans commented Jul 29, 2023

BruceKristelijn commented Jul 30, 2023 • edited

martindevans commented Jul 30, 2023 • edited

martindevans commented Aug 6, 2023

BruceKristelijn commented Jul 30, 2023 •

edited

martindevans commented Jul 30, 2023 •

edited