Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[ROCm][Kernel] MoE weights padding
#14454 opened Mar 7, 2025 by gshtras Loading…
[VLM] Add TP support for Phi-4-MM
#14453 opened Mar 7, 2025 by Isotr0py Draft
[ROCm] Fix kernel cache miss in Triton FA
#14448 opened Mar 7, 2025 by hyoon1 Loading…
Add training doc signposting to TRL documentation Improvements or additions to documentation
#14439 opened Mar 7, 2025 by hmellor Loading…
[BUGFIX] fix the need_recv method of model_runner
#14436 opened Mar 7, 2025 by maobaolong Loading…
[Usage] Refactor speculative decoding configuration and tests documentation Improvements or additions to documentation speculative-decoding
#14434 opened Mar 7, 2025 by ShangmingCai Loading…
[rlhf] support named placement group documentation Improvements or additions to documentation
#14410 opened Mar 7, 2025 by youkaichao Draft
Clean up Engine Args & Documentation documentation Improvements or additions to documentation
#14409 opened Mar 7, 2025 by vincent-4 Draft
[Misc] add disable_progress_bar to reduce logs
#14407 opened Mar 7, 2025 by aarnphm Loading…
A different take
#14393 opened Mar 7, 2025 by drisspg Draft
[neuron] add reshape_and_cache
#14391 opened Mar 7, 2025 by liangfu Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.