Skip to content

Pull requests: deepinfra/text-generation-inference

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Tgi router latency metrics
#10 opened Dec 15, 2023 by pathorn Loading…
Add support for Mistral
#9 opened Dec 7, 2023 by pathorn Loading…
Update flash attention to version v2.3.6
#8 opened Dec 1, 2023 by pathorn Loading…
Print stats about the KV Cache
#6 opened Oct 10, 2023 by NikolaBorisov Loading…
Update deps
#5 opened Oct 7, 2023 by NikolaBorisov Loading…
ProTip! Updated in the last three days: updated:>2024-05-23.