Skip to content

Conversation

gesanqiu
Copy link
Contributor

If the first generated token is a special token and we set the skip_special_token=True, then we can't skip it in the read_offset, or else we will return the last input token as the new_text, which is repeated.

@WoosukKwon
Copy link
Collaborator

@gesanqiu LGTM. Thanks for the PR!
@Yard1 Thanks for the review!

@WoosukKwon WoosukKwon merged commit beac8dd into vllm-project:main Oct 29, 2023
@gesanqiu gesanqiu deleted the keep_first_special_token branch January 29, 2024 14:36
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 14, 2025
Follow-up for vllm-project#1178

Signed-off-by: Karol Damaszke <kdamaszke@habana.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants