webui: introduce OpenAI-compatible model selector in JSON payload #16562

ServeurpersoCom · 2025-10-13T13:31:34Z

Introduce OpenAI-compatible model selector in JSON payload

This PR adds a minimal model selector to the WebUI sidebar, allowing users to pick an available model exposed through the /v1/models OpenAI-compatible endpoint

The selector automatically fetches and lists models from the server, persists the selected model in local storage, and sends it in the JSON body of subsequent /v1/chat/completions requests. The selection logic mirrors OpenAI’s client behavior while remaining fully offline-compatible with local llama.cpp instances

This enables direct interoperability with OpenAI-compatible clients and simplifies multi-model setups in the WebUI

Restore OpenAI-Compatible model source of truth and unify metadata capture :

This change re-establishes a single, reliable source of truth for the active model:
fully aligned with the OpenAI-Compat API behavior

It introduces a unified metadata flow that captures the model field from both
streaming and non-streaming responses, wiring a new onModel callback through ChatService
The model name is now resolved directly from the API payload rather than relying on
server /props or UI assumptions

ChatStore records and persists the resolved model for each assistant message during
streaming, ensuring consistency across the UI and database
Type definitions for API and settings were also extended to include model metadata
and the onModel callback, completing the alignment with OpenAI-Compat semantics

Remaining '/props' usage audit in the WebUI :

A repository-wide search inside 'tools/server/webui' shows the remaining '/props' references are intentional because the WebUI still needs to bootstrap and validate server capabilities outside of chat responses:

'src/routes/+layout.svelte' and 'src/lib/stores/server.svelte.ts' fetch '/props' on application startup to populate the global server store with template, model alias, and capability metadata that never appears in chat completions.
'src/lib/components/app/server/ServerErrorSplash.svelte' and 'src/lib/components/app/chat/ChatScreen/ChatScreenWarning.svelte' surface fallback UI when '/props' is unreachable, ensuring the user understands cached data might be stale.
'src/lib/utils/api-key-validation.ts' validates API keys against '/props' so that the UI can warn about incompatible keys before issuing chat requests.
'src/lib/services/chat.ts' performs a last-resort fetch to '/props' when the streaming handshake fails, preserving compatibility with legacy servers that only expose model names via that endpoint.

…data capture This change re-establishes a single, reliable source of truth for the active model: fully aligned with the OpenAI-Compat API behavior It introduces a unified metadata flow that captures the model field from both streaming and non-streaming responses, wiring a new onModel callback through ChatService The model name is now resolved directly from the API payload rather than relying on server /props or UI assumptions ChatStore records and persists the resolved model for each assistant message during streaming, ensuring consistency across the UI and database Type definitions for API and settings were also extended to include model metadata and the onModel callback, completing the alignment with OpenAI-Compat semantics

ServeurpersoCom · 2025-10-13T13:34:53Z

TL;DR:
Adds a lightweight model selector for the WebUI using the /v1/models OpenAI-compatible endpoint.
Selected models are persisted locally and included in chat request payloads (model field).
Also unifies model metadata capture during streaming and non-streaming responses : the WebUI now uses a single source of truth for the active model across the stack.

ServeurpersoCom · 2025-10-13T13:37:40Z

@ngxson :) What do you think about this approach ?

aiming to stay compatible with the current standalone llama-server,
llama-swap
and future multi-model evolutions of llama-server?

It introduces a unified, KISS, OpenAI-compatible model selection path while keeping everything backward-compatible with existing setups

A standalone llama-server on a Raspberry Pi 5 :

I'll have to filter the model path here too (?)

ServeurpersoCom · 2025-10-13T15:04:47Z

@allozaur mind taking a look at those default Svelte arrows and the scrolling manager? I figured your Svelte wizardry might know the cleanest way to get rid of them 😄 I like things to be pixel-perfect, but it looks like this is built into the framework : and I’d rather not bypass Svelte just for that.

allozaur · 2025-10-13T15:06:15Z

@allozaur mind taking a look at those default Svelte arrows and the scrolling manager? I figured your Svelte wizardry might know the cleanest way to get rid of them 😄 I like things to be pixel-perfect, but it looks like this is built into the framework : and I’d rather not bypass Svelte just for that.

Yep, will take a look at that and come back to u with an answer 😉

ServeurpersoCom added 2 commits October 13, 2025 14:05

webui: introduce OpenAI-compatible model selector in JSON payload

2036848

ServeurpersoCom requested a review from allozaur as a code owner October 13, 2025 13:31

github-actions bot added examples server labels Oct 13, 2025

chore: update webui build output

6606ff7

allozaur requested a review from ngxson October 13, 2025 13:46

This was referenced Oct 13, 2025

Svelte webui model selector #16335

Closed

Feature Request: tool to list and delete cached models #16393

Open

Feature request: allow load/unload models on server #16487

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

webui: introduce OpenAI-compatible model selector in JSON payload #16562

webui: introduce OpenAI-compatible model selector in JSON payload #16562

Uh oh!

ServeurpersoCom commented Oct 13, 2025

Uh oh!

ServeurpersoCom commented Oct 13, 2025

Uh oh!

ServeurpersoCom commented Oct 13, 2025 •

edited

Loading

Uh oh!

ServeurpersoCom commented Oct 13, 2025

Uh oh!

allozaur commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

webui: introduce OpenAI-compatible model selector in JSON payload #16562

Are you sure you want to change the base?

webui: introduce OpenAI-compatible model selector in JSON payload #16562

Uh oh!

Conversation

ServeurpersoCom commented Oct 13, 2025

Introduce OpenAI-compatible model selector in JSON payload

Restore OpenAI-Compatible model source of truth and unify metadata capture :

Remaining '/props' usage audit in the WebUI :

Uh oh!

ServeurpersoCom commented Oct 13, 2025

Uh oh!

ServeurpersoCom commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ServeurpersoCom commented Oct 13, 2025

Uh oh!

allozaur commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ServeurpersoCom commented Oct 13, 2025 •

edited

Loading