-
-
Notifications
You must be signed in to change notification settings - Fork 8.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Misc] fix pre-commit fail about qwen2_5
qwen
Related to Qwen models
#19834
opened Jun 19, 2025 by
andyxning
Loading…
4 tasks
[WIP] Async Scheduler Prototype
needs-rebase
qwen
Related to Qwen models
structured-output
v1
#19831
opened Jun 19, 2025 by
LucasWilkinson
•
Draft
4 tasks
[Core] Add Flashinfer TRTLLM Backend for Flashinfer decode path (SM100).
v1
#19825
opened Jun 19, 2025 by
pavanimajety
•
Draft
1 of 4 tasks
WIP [P/D] Use ThreadPoolExecutor to do handshake for each P-D pair
needs-rebase
v1
#19823
opened Jun 19, 2025 by
lk-chen
Loading…
2 of 4 tasks
[Feature] Integrate new deepgemm
deepseek
Related to DeepSeek models
ready
ONLY add when PR is ready to merge/full CI is needed
#19820
opened Jun 18, 2025 by
yewentao256
Loading…
LoRA support on llama4
llama
Related to Llama models
#19819
opened Jun 18, 2025 by
frank-wei
Loading…
1 of 4 tasks
[Kernel] Add Conch backend for mixed-precision linear layer
ci/build
#19818
opened Jun 18, 2025 by
jmanning-stackav
Loading…
Introduce RayCudaCommunicator as Ray Compiled Graph communicator
#19816
opened Jun 18, 2025 by
ruisearch42
•
Draft
4 tasks
Improve quant config semantic clarity, add Nvidia ModelOpt config adaptation
#19815
opened Jun 18, 2025 by
Edwardf0t1
Loading…
3 of 4 tasks
[MISC] add cpu_kvcache_space_bytes to CacheConfig
#19812
opened Jun 18, 2025 by
andyxning
Loading…
4 tasks
raise exception for pin_lora
ready
ONLY add when PR is ready to merge/full CI is needed
#19809
opened Jun 18, 2025 by
andyxning
Loading…
4 tasks
[Misc] DeepSeek Decode Optimizations
#19807
opened Jun 18, 2025 by
varun-sundar-rabindranath
•
Draft
[Misc] Enable fp8_dispatch for DeepEP
qwen
Related to Qwen models
#19806
opened Jun 18, 2025 by
varun-sundar-rabindranath
Loading…
[Misc] [ROCm] Prevent surplus tensor reshape
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
v1
#19803
opened Jun 18, 2025 by
zsolt-borbely-htec
Loading…
[Minor] Allow redirecting model path for HfRunner in test
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#19795
opened Jun 18, 2025 by
Isotr0py
Loading…
3 of 4 tasks
Add SM120 to the Dockerfile
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#19794
opened Jun 18, 2025 by
mgoin
Loading…
Mark invariant normalizer in Gemma as non-persistent
force-merge
ready
ONLY add when PR is ready to merge/full CI is needed
#19788
opened Jun 18, 2025 by
yhtang
Loading…
[Ray] v1 Change device str for platform compatibility
v1
#19785
opened Jun 18, 2025 by
1StepForever
Loading…
3 of 4 tasks
[WIP] Splitting attention _fwd_grouped_kernel_stage1 to improve occupancy
#19774
opened Jun 17, 2025 by
ekuznetsov139
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-05-18.