-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Description
System Info
tgi:3.2.3
Information
- Docker
- The CLI directly
Tasks
- An officially supported command
- My own modifications
Reproduction
- Input a long text, concatenate the contents of user and assistant
- truncate part of the text. Set the length of the output truncated text.
- The model directly outputs "", and the finish_reason shows the reason as "length".
Expected behavior
It will also occur.: The model suddenly starts giving random answers during the inference process.
Metadata
Metadata
Assignees
Labels
No labels