Skip to content

Misc. bug: WebUI incorrectly displays local model names #16771

@jfslee

Description

@jfslee

Name and Version

version: 6821 (a2e0088)

Operating systems

Windows

Which llama.cpp modules do you know to be affected?

llama-server

Command line

llama-server --offline -m <some-model.gguf>

Problem description & steps to reproduce

Starting from b6821, the webUI no longer correctly shows the model used to generate a response when running a model offline from a local GGUF file. The last release that this worked correctly was b6818.

Steps to reproduce:

  1. Run llama-server using a local GGUF file.
    llama-server --offline -m .\gemma-3-4b-it-Q6_K.gguf
  2. Launch the webUI. In the settings page, ensure that the checkbox for 'Show model information' under the 'General' tab is checked.
  3. Start a conversation.
  4. Close the server and restart it using a different local GGUF file.
    llama-server --offline -m .\Qwen3-4B-Instruct-2507-Q6_K.gguf
  5. Launch the webUI and either continue the previous conversation, regenerate the most recent response or start a new conversation.

Expected behavior:
The webUI should show the first message as generated by gemma-3-4b-it-Q6_K.gguf and the second message generated by Qwen3-4B-Instruct-2507-Q6_K.gguf.

Image

However, since b6821, all responses are instead shown as being generated by whichever local model is loaded.

Image

First Bad Commit

Seems to have been introduced by commit 9b9201f

Relevant log output

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions