trim chat prompt based on llm context size #1963

BruceMacD · 2024-01-12T20:48:35Z

When trimming the input chat prompt we need to make sure we keep the prompt template in the expected format. Without this the prompt will be trimmed without accounting for the model template when the maximum context length is reached, which can result in unexpected behavior from the model.

update the ChatPrompt function to return a list of prompt variable, to allow the calling function to append them into the final prompt
create the final prompt based on the loaded LLM's context window size, while preserving the prompt template formatting and system message in the first message of the new context window

server/routes.go

server/images.go

- only encode each prompt once - reduce nested functions

BruceMacD added 2 commits January 12, 2024 15:43

trim chat prompt based on llm context size

04ec4f3

Update images_test.go

87eeb4e

BruceMacD mentioned this pull request Jan 12, 2024

Small context size limit occasionally causes Ollama to hang on prediction #1967

Closed

mxyng reviewed Jan 13, 2024

View reviewed changes

server/routes.go Outdated Show resolved Hide resolved

mxyng reviewed Jan 13, 2024

View reviewed changes

server/routes.go Outdated Show resolved Hide resolved

BruceMacD added 5 commits January 16, 2024 16:45

refactor contextLimitPrompt

c930fe8

maintain system message in chat history

94c21b5

lint fix

28e0293

fix lint

171e22b

formatting

4f6f68d

BruceMacD commented Jan 18, 2024

View reviewed changes

server/images.go Show resolved Hide resolved

server/images.go Show resolved Hide resolved

mxyng approved these changes Jan 23, 2024

View reviewed changes

refactor

b8608ff

- only encode each prompt once - reduce nested functions

swip3798 mentioned this pull request Jan 26, 2024

Questions about context size #2204

Closed

mxyng approved these changes Jan 26, 2024

View reviewed changes

BruceMacD merged commit 0632dff into main Jan 30, 2024
13 checks passed

BruceMacD deleted the brucemacd/template-token-smart branch January 30, 2024 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trim chat prompt based on llm context size #1963

trim chat prompt based on llm context size #1963

BruceMacD commented Jan 12, 2024 •

edited

Loading

trim chat prompt based on llm context size #1963

trim chat prompt based on llm context size #1963

Conversation

BruceMacD commented Jan 12, 2024 • edited Loading

BruceMacD commented Jan 12, 2024 •

edited

Loading