fix(llama.cpp): disable infinite context shifting #1704

mudler · 2024-02-13T17:54:24Z

Infinite context loop might as well trigger an infinite loop of context shifting if the model hallucinates and does not stop answering. This has the unpleasant effect that the prediction never terminates, which is the case especially on small models which tends to hallucinate. The behavior is observable when the assigned slot exceed the context size, which triggers context shifting, and can be reproduced by specifying a really small context size window.

Workarounds #1333 by removing context-shifting until it is fixed upstream.

See also upstream issue: ggerganov/llama.cpp#3969

netlify · 2024-02-13T17:54:27Z

✅ Deploy Preview for localai canceled.

Name	Link
🔨 Latest commit	`22f8902`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/65cbadc119fb0100088729d5

Infinite context loop might as well trigger an infinite loop of context shifting if the model hallucinates and does not stop answering. This has the unpleasant effect that the predicion never terminates, which is the case especially on small models which tends to hallucinate. Workarounds #1333 by removing context-shifting. See also upstream issue: ggerganov/llama.cpp#3969

mudler force-pushed the disable_context_shift branch from 89a3363 to 66b4af0 Compare February 13, 2024 17:57

mudler mentioned this pull request Feb 13, 2024

llama.cpp: infinite loop of context switch #1333

Closed

mudler force-pushed the disable_context_shift branch from 66b4af0 to 22f8902 Compare February 13, 2024 17:58

mudler added bug Something isn't working upstream issue labels Feb 13, 2024

mudler merged commit c56b6dd into master Feb 13, 2024
26 checks passed

mudler deleted the disable_context_shift branch February 13, 2024 20:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(llama.cpp): disable infinite context shifting #1704

fix(llama.cpp): disable infinite context shifting #1704

mudler commented Feb 13, 2024 •

edited

Loading

netlify bot commented Feb 13, 2024 •

edited

Loading

fix(llama.cpp): disable infinite context shifting #1704

fix(llama.cpp): disable infinite context shifting #1704

Conversation

mudler commented Feb 13, 2024 • edited Loading

netlify bot commented Feb 13, 2024 • edited Loading

✅ Deploy Preview for localai canceled.

mudler commented Feb 13, 2024 •

edited

Loading

netlify bot commented Feb 13, 2024 •

edited

Loading