llama : skip output reordering for single token batches #17466

danbev · 2025-11-24T11:24:34Z

This commit adds a check to skip the output reordering logic when n_outputs == 1. With a single output token, the data is trivially sorted and the reordering code is currently doing unnecessary work (resetting and rebuilding output_ids to the same values).

The motivation for this change is improved code clarity and avoiding confusion when debugging. While the performance impact is probably negligible, this unnecessary work happens on every decode call in llama-server when processing batches with single-token outputs.

This commit adds a check to skip the output reordering logic when n_outputs == 1. With a single output token, the data is trivially sorted and the reordering code is currently doing unnecessary work (resetting and rebuilding output_ids to the same values). The motivation for this change is improved code clarity and avoiding confusion when debugging. While the performance impact is probably negligible, this unnecessary work happens on every decode call in llama-server when processing batches with single-token outputs.

danbev requested a review from ggerganov as a code owner November 24, 2025 11:24

loci-dev mentioned this pull request Nov 24, 2025

UPSTREAM PR #17466: llama : skip output reordering for single token batches auroralabs-loci/llama.cpp#304

Open

ggerganov approved these changes Nov 24, 2025

View reviewed changes

danbev merged commit 134e694 into ggml-org:master Nov 24, 2025
73 of 74 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : skip output reordering for single token batches #17466

llama : skip output reordering for single token batches #17466

danbev commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

llama : skip output reordering for single token batches #17466

llama : skip output reordering for single token batches #17466

Conversation

danbev commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants