-
-
Notifications
You must be signed in to change notification settings - Fork 6.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Attention] Default to FlashMLA backend for MLA
#14451
opened Mar 7, 2025 by
LucasWilkinson
Loading…
[Feature]: PD separation supports prefix caching #12257
#14440
opened Mar 7, 2025 by
skyCreateXian
Loading…
Add training doc signposting to TRL
documentation
Improvements or additions to documentation
#14439
opened Mar 7, 2025 by
hmellor
Loading…
[Usage] Refactor speculative decoding configuration and tests
documentation
Improvements or additions to documentation
speculative-decoding
#14434
opened Mar 7, 2025 by
ShangmingCai
Loading…
[Kernel] [V1] Further optimizations to ROCm (Triton) Backend to better handle GQA.
#14431
opened Mar 7, 2025 by
tdoublep
Loading…
[Bugfix][Kernel]: Fix AllSpark kernel compilation errors and enable for CUDA < 12.0
ci/build
#14430
opened Mar 7, 2025 by
wyajieha
Loading…
[Refactor][Reasoning] Keep all logic about reasoning into one class
documentation
Improvements or additions to documentation
frontend
structured-output
#14428
opened Mar 7, 2025 by
gaocegege
Loading…
[Bugfix] Fix When choice the specified tool call, it returns a ToolCa…
ci/build
frontend
#14427
opened Mar 7, 2025 by
liuwwang
Loading…
[Bugfix][V1] Exclude HBM used by other processes when calculating peak memory during profile runs
v1
#14419
opened Mar 7, 2025 by
yeqcharlotte
Loading…
[Misc] Add get_stream_cls() method for Platform class
speculative-decoding
#14411
opened Mar 7, 2025 by
shen-shanshan
Loading…
[rlhf] support named placement group
documentation
Improvements or additions to documentation
#14410
opened Mar 7, 2025 by
youkaichao
•
Draft
Remove all references to PA in Engine v1
frontend
v1
#14408
opened Mar 7, 2025 by
vincent-4
Loading…
[Bugfix] Make the fused_moe code compatible with non-triton supported hardware
#14400
opened Mar 7, 2025 by
shen-shanshan
Loading…
[Kernel] Update cutlass FP8 blockwise to use upstream CUTLASS
ci/build
#14395
opened Mar 7, 2025 by
LucasWilkinson
•
Draft
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.