Capture model name only after first token (streaming) or completed request #16405

allozaur · 2025-10-03T08:08:57Z

Improves the logic for saving the used model information to not happen too early.

…ted request (non-streaming)

…-after-first-token

* origin/master: (124 commits) metal : fix loop bound in ggml_mem_ranges (ggml-org#16412) llama : fix shapes for bert/mpt q/k norm (ggml-org#16409) ggml : fix graph reallocation with multiple chunks (ggml-org#16396) Fix missing messages on sibling navigation (ggml-org#16408) vulkan: Replace uses of maxMemoryAllocationSize and VK_WHOLE_SIZE (ggml-org#16354) vulkan: Fix FA coopmat1 invalid array indexing (ggml-org#16365) ci : change macos-13 to macos-15-intel (ggml-org#16401) Capture model name only after first token (streaming) or completed request (ggml-org#16405) vulkan: in flash attention, bounds check against nem1 (don't rely on GGML_KQ_MASK_PAD) (ggml-org#16316) webui : Fix messages payload sent to chat completions (ggml-org#16402) fix: track viewportHeight via window.innerHeight to avoid unwanted scrolling (ggml-org#16356) test-barrier : do not use more threads than physically available (ggml-org#16389) ggml webgpu: add support for soft_max, optimize rms_norm (ggml-org#16357) model : Apertus model implementation (ggml-org#15852) musa: update compile flags (ggml-org#16265) ci : fix ubuntu-latest-cmake-rpc (disable ccache) (ggml-org#16388) ci: update vulkan ci (ggml-org#16294) ci : fix clean-up of old logs (ggml-org#16381) SYCL: Update to oneAPI 2025.2 (ggml-org#16371) HIP: add IMbackK to codeowner (ggml-org#16375) ...

allozaur added 2 commits October 3, 2025 09:19

feat: Capture model name only after first token (streaming) or comple…

ce0cf49

…ted request (non-streaming)

chore: update webui build output

d886359

allozaur requested a review from ggerganov October 3, 2025 08:08

allozaur added 2 commits October 3, 2025 10:10

Merge remote-tracking branch 'origin/master' into add-model-used-info…

0d8ec54

…-after-first-token

chore: update webui build output

9557d86

ggerganov approved these changes Oct 3, 2025

View reviewed changes

github-actions bot added examples server labels Oct 3, 2025

allozaur merged commit 7723327 into ggml-org:master Oct 3, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Capture model name only after first token (streaming) or completed request #16405

Capture model name only after first token (streaming) or completed request #16405

allozaur commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

Capture model name only after first token (streaming) or completed request #16405

Capture model name only after first token (streaming) or completed request #16405

Conversation

allozaur commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!