Skip to content

Pull requests: deepinfra/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Tgi router latency metrics
#10 opened Dec 15, 2023 by pathorn Loading…
Add support for Mistral
#9 opened Dec 7, 2023 by pathorn Loading…
Update flash attention to version v2.3.6
#8 opened Dec 1, 2023 by pathorn Loading…
Print stats about the KV Cache
#6 opened Oct 10, 2023 by NikolaBorisov Loading…
Update deps
#5 opened Oct 7, 2023 by NikolaBorisov Loading…
ProTip! Filter pull requests by the default branch with base:main.