Fix the penultimate token sometimes being lost with SSE streaming by pi6am · Pull Request #1031 · LostRuins/koboldcpp

pi6am · 2024-07-29T06:06:18Z

The token immediately before an eot token was lost when SSE streaming was enabled if that token was contained entirely within a stop sequence. As an example of when this could happen, consider this prompt:

Type the phrase 'pleas' once.

In a Llama 3-derived model, 'pleas' tokenizes as 'ple' 'as'. The token 'as' is contained within this instruct mode stop sequence: <|eot_id|><|start_header_id|>assistant<|end_header_id|> due to the word 'assistant'. Since string_contains_sequence_substring returns True for 'as', this token is added to tokenReserve instead of being streamed immediately. If the '<|eot_id|>' token was generated next, the text in tokenReserve would be discarded.

Merge upstream

The token immediately before an eot token was lost when SSE streaming was enabled if that token was contained entirely within a stop sequence. As an example of when this could happen, consider this prompt: Type the phrase 'pleas' once. In a Llama 3-derived model, 'pleas' tokenizes as 'ple' 'as'. The token 'as' is contained within this instruct mode stop sequence: <|eot_id|><|start_header_id|>assistant<|end_header_id|> due to the word 'assistant'. Since `string_contains_sequence_substring` returns True for 'as', this token is added to `tokenReserve` instead of being streamed immediately. If the '<|eot_id|>' token was generated next, the text in `tokenReserve` would be discarded.

LostRuins

thanks

pi6am added 4 commits July 9, 2024 00:10

Merge pull request #1 from LostRuins/concedo

63c4437

Merge upstream

Merge pull request #3 from LostRuins/concedo

5a25d61

Merge upstream

Merge pull request #4 from LostRuins/concedo

694bf6b

Merge upstream

LostRuins added bug Something isn't working needs review needs review labels Jul 29, 2024

LostRuins approved these changes Jul 29, 2024

View reviewed changes

LostRuins changed the base branch from concedo to concedo_experimental July 29, 2024 12:16

LostRuins merged commit 26f1df5 into LostRuins:concedo_experimental Jul 29, 2024

pi6am deleted the fix/penultimate-sse branch July 29, 2024 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the penultimate token sometimes being lost with SSE streaming#1031

Fix the penultimate token sometimes being lost with SSE streaming#1031
LostRuins merged 4 commits intoLostRuins:concedo_experimentalfrom
pi6am:fix/penultimate-sse

pi6am commented Jul 29, 2024

Uh oh!

LostRuins left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pi6am commented Jul 29, 2024

Uh oh!

LostRuins left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants