You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched for all existing open AND closed issues and discussions for similar requests. I have found none that is comparable to my request.
Verify Feature Scope
I have read through and understood the scope definition for feature requests in the Issues section. I believe my feature request meets the definition and belongs in the Issues section instead of the Discussions.
Problem Description
When using a model that is already loaded in Ollama, OpenWebUI sees that it's already loaded and highlights it green. However if the model was loaded with a non-standard context window, OpenWebui will apply the default context, causing the model to reload for a chat session, delaying the response. The model will then have to be reloaded again when used at the custom context length.
Desired Solution you'd like
An option in settings for chats without pre-defined context lengths to use the currently loaded model context length to keep the model from being reloaded. This will allow for immediate processing of requests for quick chats with pre-loaded models. This option won't apply to custom models created in the workspace with customized context lengths.
This discussion was converted from issue #19319 on November 20, 2025 19:47.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Check Existing Issues
Verify Feature Scope
Problem Description
When using a model that is already loaded in Ollama, OpenWebUI sees that it's already loaded and highlights it green. However if the model was loaded with a non-standard context window, OpenWebui will apply the default context, causing the model to reload for a chat session, delaying the response. The model will then have to be reloaded again when used at the custom context length.
Desired Solution you'd like
An option in settings for chats without pre-defined context lengths to use the currently loaded model context length to keep the model from being reloaded. This will allow for immediate processing of requests for quick chats with pre-loaded models. This option won't apply to custom models created in the workspace with customized context lengths.
Alternatives Considered
No response
Additional Context
No response
Beta Was this translation helpful? Give feedback.
All reactions