Skip to content

Conversation

@danbev
Copy link
Member

@danbev danbev commented Nov 24, 2025

This commit adds the --kv-unified flag to the usage example in the README.md file for the batched example.

The motivation for this is that without this flag the example will fail with the following error:

Hello my name is
split_equal: sequential split is not supported when there are coupled
sequences in the input batch (you may need to use the -kvu flag)
decode: failed to find a memory slot for batch of size 4
main: llama_decode() failed

This commit adds the --kv-unified flag to the usage example
in the README.md file for the batched example.

The motivation for this is that without this flag the example will fail
with the following error:
```console
Hello my name is
split_equal: sequential split is not supported when there are coupled
sequences in the input batch (you may need to use the -kvu flag)
decode: failed to find a memory slot for batch of size 4
main: llama_decode() failed
```
@danbev danbev requested a review from ggerganov as a code owner November 24, 2025 13:34
@ggerganov ggerganov merged commit 6ab8eac into ggml-org:master Nov 24, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants