Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Fix][ROCm] Remove unused variables to fix build error on GFX11/12 rocm Related to AMD ROCm
#19891 opened Jun 20, 2025 by hyoon1 Loading…
3 of 4 tasks
[Misc] Clean up useless code
#19889 opened Jun 20, 2025 by wangxiyuan Loading…
4 tasks done
[New model support]Support Tarsier2 documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) qwen Related to Qwen models
#19887 opened Jun 20, 2025 by princepride Loading…
[EP+DP] Optimize the little operations in the DeepGEMM + DeepEP low latency case deepseek Related to DeepSeek models performance Performance-related issues qwen Related to Qwen models
#19885 opened Jun 20, 2025 by tlrmchlsmth Loading…
[Core] Add update_load_config RPC method tpu Related to Google TPUs v1
#19884 opened Jun 20, 2025 by 22quinn Loading…
4 tasks done
[Misc] Add type alias ReqId and EngineId for better readability ready ONLY add when PR is ready to merge/full CI is needed
#19880 opened Jun 19, 2025 by lk-chen Loading…
1 of 4 tasks
Add page-aligned prefill scheduling. v1
#19878 opened Jun 19, 2025 by py4 Loading…
[Fix] import regex instead of re frontend ready ONLY add when PR is ready to merge/full CI is needed tool-calling
#19875 opened Jun 19, 2025 by tdoublep Loading…
[BugFix][P/D] Fix for cases where _recving_transfers can be cleaned up when *all* transfer done bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1
#19874 opened Jun 19, 2025 by lk-chen Loading…
4 tasks
[Docs] Fix syntax highlighting of shell commands ci/build documentation Improvements or additions to documentation tool-calling tpu Related to Google TPUs
#19870 opened Jun 19, 2025 by lgeiger Loading…
[Misc] add vllm_config in __init__
#19866 opened Jun 19, 2025 by andyxning Loading…
4 tasks
[Chore]: qwen3-moe-type-hints-mistake qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#19860 opened Jun 19, 2025 by Xerxes-cn Loading…
4 tasks
optimze attn qwen Related to Qwen models
#19858 opened Jun 19, 2025 by momo609 Loading…
4 tasks
[Chore] logging metrics rename v1
#19852 opened Jun 19, 2025 by aarnphm Loading…
[Misc] refactor example - openai_transcription_client documentation Improvements or additions to documentation
#19851 opened Jun 19, 2025 by reidliu41 Loading…
4 tasks
v1: Introduce an offloading component ci/build v1
#19848 opened Jun 19, 2025 by orozery Loading…
refactor example - qwen3_reranker documentation Improvements or additions to documentation qwen Related to Qwen models
#19847 opened Jun 19, 2025 by reidliu41 Loading…
4 tasks
[BugFix][V0] Fix AssertionError for prompt_logprobs v0
#19844 opened Jun 19, 2025 by xu-song Loading…
1 of 4 tasks
Add Cutlass integration for MoE FP8 needs-rebase v1
#19843 opened Jun 19, 2025 by JackChuang Loading…
3 of 4 tasks
[P/D] Asynchronously do _nixl_handshake ready ONLY add when PR is ready to merge/full CI is needed v1
#19836 opened Jun 19, 2025 by lk-chen Loading…
2 of 4 tasks
[WIP] Async Scheduler Prototype needs-rebase qwen Related to Qwen models structured-output tpu Related to Google TPUs v1
#19831 opened Jun 19, 2025 by LucasWilkinson Draft
FP8 custom ops v1
#19830 opened Jun 19, 2025 by ProExpertProg Draft
4 tasks
ProTip! Add no:assignee to see everything that’s not assigned.