-
-
Notifications
You must be signed in to change notification settings - Fork 8.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI/Build][Bugfix] Fix deadlock on v1 engine test CI
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19872
opened Jun 19, 2025 by
Isotr0py
Loading…
1 of 4 tasks
[Docs] Fix syntax highlighting of shell commands
ci/build
documentation
Improvements or additions to documentation
tool-calling
tpu
Related to Google TPUs
#19870
opened Jun 19, 2025 by
lgeiger
Loading…
[Benchmark][Bugfix] Fix Dataset Length Calculation
#19868
opened Jun 19, 2025 by
robertgshaw2-redhat
Loading…
3 of 4 tasks
[Chore]: qwen3-moe-type-hints-mistake
qwen
Related to Qwen models
#19860
opened Jun 19, 2025 by
Xerxes-cn
Loading…
4 tasks
[Misc] refactor example - openai_transcription_client
documentation
Improvements or additions to documentation
#19851
opened Jun 19, 2025 by
reidliu41
Loading…
4 tasks
refactor example - qwen3_reranker
documentation
Improvements or additions to documentation
qwen
Related to Qwen models
#19847
opened Jun 19, 2025 by
reidliu41
Loading…
4 tasks
[BugFix] Fix AssertionError for prompt_logprobs
#19844
opened Jun 19, 2025 by
xu-song
Loading…
1 of 4 tasks
Add Cutlass integration for MoE FP8
needs-rebase
v1
#19843
opened Jun 19, 2025 by
JackChuang
Loading…
3 of 4 tasks
[P/D] Asynchronously do _nixl_handshake
v1
#19836
opened Jun 19, 2025 by
lk-chen
Loading…
2 of 4 tasks
[WIP] Async Scheduler Prototype
needs-rebase
qwen
Related to Qwen models
structured-output
tpu
Related to Google TPUs
v1
#19831
opened Jun 19, 2025 by
LucasWilkinson
•
Draft
4 tasks
[Core] Add Flashinfer TRTLLM Backend for Flashinfer decode path (SM100).
v1
#19825
opened Jun 19, 2025 by
pavanimajety
•
Draft
1 of 4 tasks
WIP [P/D] Use ThreadPoolExecutor to do handshake for each P-D pair
needs-rebase
v1
#19823
opened Jun 19, 2025 by
lk-chen
Loading…
2 of 4 tasks
[Feature] Integrate new deepgemm
deepseek
Related to DeepSeek models
ready
ONLY add when PR is ready to merge/full CI is needed
#19820
opened Jun 18, 2025 by
yewentao256
Loading…
LoRA support on llama4
llama
Related to Llama models
#19819
opened Jun 18, 2025 by
frank-wei
Loading…
1 of 4 tasks
[Kernel] Add Conch backend for mixed-precision linear layer
ci/build
#19818
opened Jun 18, 2025 by
jmanning-stackav
Loading…
Introduce RayCudaCommunicator as Ray Compiled Graph communicator
#19816
opened Jun 18, 2025 by
ruisearch42
•
Draft
4 tasks
Improve quant config semantic clarity, add Nvidia ModelOpt config adaptation
#19815
opened Jun 18, 2025 by
Edwardf0t1
Loading…
3 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.