-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Expose the real-time internal state of the batcher through SSE
#3065
opened Feb 27, 2025 by
mfuntowicz
•
Draft
Added model name label to metrics and added an optional argument --served-model-name
wontfix
This will not be worked on
#3064
opened Feb 27, 2025 by
yashaswipiplani
Loading…
display available cached versions in TGI server error message of Neuron backend
#3063
opened Feb 26, 2025 by
jimburtoft
Loading…
4 tasks
Fix CPU and memory affinity under external resource management
#3012
opened Feb 11, 2025 by
askervin
Loading…
Add 'json_schema' alias to GrammarType.Json
#2982
opened Jan 31, 2025 by
aW3st
Loading…
2 of 5 tasks
Kvrouter that will increase the kv-cache hits in case of multiple routing strategy
#2965
opened Jan 29, 2025 by
Narsil
Loading…
5 tasks
misc(gha): expose action cache url and runtime as secrets
#2964
opened Jan 29, 2025 by
mfuntowicz
Loading…
llava next image encoder to allow un-aligned patch / image sizes
#2936
opened Jan 22, 2025 by
jimexist
Loading…
5 tasks
Update Dockerfile to use devel image for compatibility
#2848
opened Dec 16, 2024 by
YaserJaradeh
Loading…
2 of 5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.