-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Insights: huggingface/text-generation-inference
Overview
-
- 6 Merged pull requests
- 2 Open pull requests
- 0 Closed issues
- 3 New issues
Could not load contribution data
Please try again later
6 Pull requests merged by 3 people
-
[gaudi] Fix the CI test errors
#3286 merged
Jul 7, 2025 -
[gaudi] Deepseek v2 mla and add ep to unquantized moe
#3287 merged
Jul 7, 2025 -
[gaudi] Remove unnecessary reinitialize to HeterogeneousNextTokenChooser to m…
#3284 merged
Jul 3, 2025 -
Optimum neuron 0.2.2
#3281 merged
Jul 3, 2025 -
xpu lora support
#3232 merged
Jul 2, 2025 -
[gaudi] Gemma3 sliding window support
#3280 merged
Jul 1, 2025
2 Pull requests opened by 2 people
-
Update quantization kernels
#3288 opened
Jul 7, 2025 -
fix: enable defs references in tool calls
#3291 opened
Jul 7, 2025
3 Issues opened by 3 people
-
Please add mistral-small-2506
#3290 opened
Jul 7, 2025 -
How to detect watermark?
#3289 opened
Jul 7, 2025 -
Error when launching Magistral-Small-2506
#3285 opened
Jul 4, 2025
6 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Multi modality fix
#3283 commented on
Jul 7, 2025 • 1 new comment -
Quantized Qwen3
#3226 commented on
Jul 1, 2025 • 0 new comments -
RuntimeError: Cannot load 'awq' weight when running Qwen2-VL-72B-Instruct-AWQ model
#2944 commented on
Jul 1, 2025 • 0 new comments -
Qwen 3 support
#3199 commented on
Jul 1, 2025 • 0 new comments -
Models no more supported with neuron backend in optimum-neuron 0.2.0
#3279 commented on
Jul 3, 2025 • 0 new comments -
fix outline import issue
#3282 commented on
Jul 1, 2025 • 0 new comments