-
-
Notifications
You must be signed in to change notification settings - Fork 8.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix][ROCm] Remove unused variables to fix build error on GFX11/12
rocm
Related to AMD ROCm
#19891
opened Jun 20, 2025 by
hyoon1
Loading…
3 of 4 tasks
[New model support]Support Tarsier2
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
#19887
opened Jun 20, 2025 by
princepride
Loading…
[EP+DP] Optimize the little operations in the DeepGEMM + DeepEP low latency case
deepseek
Related to DeepSeek models
performance
Performance-related issues
qwen
Related to Qwen models
#19885
opened Jun 20, 2025 by
tlrmchlsmth
Loading…
[Core] Add Related to Google TPUs
v1
update_load_config
RPC method
tpu
#19884
opened Jun 20, 2025 by
22quinn
Loading…
4 tasks done
[Misc] Add type alias ONLY add when PR is ready to merge/full CI is needed
ReqId
and EngineId
for better readability
ready
#19880
opened Jun 19, 2025 by
lk-chen
Loading…
1 of 4 tasks
[Quantization] Add compressed-tensors emulations support for NVFP4
#19879
opened Jun 19, 2025 by
dsikka
Loading…
[Fix] import regex instead of re
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#19875
opened Jun 19, 2025 by
tdoublep
Loading…
[V1 Scheduler] BatchScheduler to balance token-based microbatches and reduce GPU pipeline bubbles
documentation
Improvements or additions to documentation
v1
#19873
opened Jun 19, 2025 by
juncheoll
Loading…
[Docs] Fix syntax highlighting of shell commands
ci/build
documentation
Improvements or additions to documentation
tool-calling
tpu
Related to Google TPUs
#19870
opened Jun 19, 2025 by
lgeiger
Loading…
[Chore]: qwen3-moe-type-hints-mistake
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#19860
opened Jun 19, 2025 by
Xerxes-cn
Loading…
4 tasks
[Misc] refactor example - openai_transcription_client
documentation
Improvements or additions to documentation
#19851
opened Jun 19, 2025 by
reidliu41
Loading…
4 tasks
refactor example - qwen3_reranker
documentation
Improvements or additions to documentation
qwen
Related to Qwen models
#19847
opened Jun 19, 2025 by
reidliu41
Loading…
4 tasks
[BugFix][V0] Fix AssertionError for prompt_logprobs
v0
#19844
opened Jun 19, 2025 by
xu-song
Loading…
1 of 4 tasks
Add Cutlass integration for MoE FP8
needs-rebase
v1
#19843
opened Jun 19, 2025 by
JackChuang
Loading…
3 of 4 tasks
[P/D] Asynchronously do _nixl_handshake
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19836
opened Jun 19, 2025 by
lk-chen
Loading…
2 of 4 tasks
[WIP] Async Scheduler Prototype
needs-rebase
qwen
Related to Qwen models
structured-output
tpu
Related to Google TPUs
v1
#19831
opened Jun 19, 2025 by
LucasWilkinson
•
Draft
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.