Update on the development branch #1125
kaiyux
announced in
Announcements
Replies: 3 comments 9 replies
-
What does it mean |
Beta Was this translation helpful? Give feedback.
3 replies
-
Could you share some use cases of the weightless engine, please? |
Beta Was this translation helpful? Give feedback.
5 replies
-
Could you share if there has been any update on the 800 issue regarding triton backend for enc_dec models? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we are pushing an update to the development branch (and the Triton backend) this February 21, 2024.
This update includes:
encoder_input_len_range
should not be 0, thanks to the contribution from @Eddie-Wang1120 in Fix enc_dec bug and Make several improvements to whisper #992gptDecoderBatch
to support batched samplinggptManagerBenchmark
Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions