-
Notifications
You must be signed in to change notification settings - Fork 132
Open
Labels
bugSomething isn't workingSomething isn't working
Description
System Info
tensorrt_llm==0.11.0.dev2024061800
Who can help?
Information
- The official example scripts
- My own modified scripts
Tasks
- An officially supported task in the
examplesfolder (such as GLUE/SQuAD, ...) - My own task or dataset (give details below)
Reproduction
deploy a model with beam_width > 1 and trtllm backend, request the BLS model with geneate_stream endpoint and stream: true
Expected behavior
the accumulate_tokens should be able to True
actual behavior
error thrown: Accumulation of tokens is only implemented for beam width = 1
additional notes
Maybe all we need to do is enhance the BLS script I think?
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working