Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Misc] Bump ray to 2.48.0 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#22123 opened Aug 2, 2025 by ruisearch42 Loading…
4 tasks
[WIP] vLLM Benchmark suite improvement ci/build performance Performance-related issues
#22119 opened Aug 2, 2025 by louie-tsai Loading…
1 of 4 tasks
[Bugfix] Fix broken Minimax-01-VL model documentation Improvements or additions to documentation
#22116 opened Aug 2, 2025 by Isotr0py Draft
1 of 4 tasks
[Fix] Fix python path resolving in cpu cmake ci/build
#22115 opened Aug 2, 2025 by xiszishu Loading…
3 of 4 tasks
feat: update flashinfer ar oneshot params
#22108 opened Aug 1, 2025 by yyihuang Loading…
4 tasks
[Fix] Fix llama4 modelopt weight loading error bug Something isn't working llama Related to Llama models ready ONLY add when PR is ready to merge/full CI is needed
#22107 opened Aug 1, 2025 by jiahanc Loading…
2 of 4 tasks
enable Docker-aware precompiled wheel setup ci/build ready ONLY add when PR is ready to merge/full CI is needed
#22106 opened Aug 1, 2025 by dougbtv Loading…
3 tasks done
Enable EPLB on ernie4.5-moe
#22100 opened Aug 1, 2025 by HsChen-sys Loading…
3 of 4 tasks
[Frontend] Update OpenAI error response to upstream format frontend
#22099 opened Aug 1, 2025 by msanft Loading…
3 of 4 tasks
[NVIDIA] Support Flashinfer TRT-LLM Prefill Attention Kernel needs-rebase performance Performance-related issues v1
#22095 opened Aug 1, 2025 by elvischenv Loading…
3 of 4 tasks
Fix for Issue #22092 frontend
#22094 opened Aug 1, 2025 by RobertFischer Loading…
[Misc] rename torch backend literal string with const var documentation Improvements or additions to documentation rocm Related to AMD ROCm tpu Related to Google TPUs v1
#22087 opened Aug 1, 2025 by andyxning Loading…
4 tasks
[Model] ROCm Flash-Attention Rotary Embedding and Sin/Cos Caching for Qwen VL Models qwen Related to Qwen models rocm Related to AMD ROCm
#22081 opened Aug 1, 2025 by vllmellm Draft
3 of 4 tasks
Adds MoE configuration for Jetson AGX Orin
#22078 opened Aug 1, 2025 by massif-01 Loading…
4 tasks
ProTip! Follow long discussions with comments:>50.