Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add the instruction to run e2e validation manually before release documentation Improvements or additions to documentation
#21023 opened Jul 16, 2025 by huydhn Loading…
4 tasks done
[Misc] Minor comment reorganization in capture_model() v1
#21015 opened Jul 15, 2025 by ruisearch42 Loading…
3 of 4 tasks
[Docker] Allow FlashInfer to be built in the ARM CUDA Dockerfile ci/build ready ONLY add when PR is ready to merge/full CI is needed
#21013 opened Jul 15, 2025 by mgoin Loading…
4 tasks
Update PyTorch to torch==2.7.1 for CUDA ci/build ready ONLY add when PR is ready to merge/full CI is needed
#21011 opened Jul 15, 2025 by mgoin Loading… v0.10.0
[protocol] Add request_id to the Request object so they can be controlled better via external load balancers frontend ready ONLY add when PR is ready to merge/full CI is needed
#21009 opened Jul 15, 2025 by kouroshHakha Loading…
4 tasks
[Not for merge] Unshift eagle prefill documentation Improvements or additions to documentation llama Related to Llama models needs-rebase new-model Requests to new models speculative-decoding v1
#21008 opened Jul 15, 2025 by morgendave Draft
4 tasks
Start using py3.12 for TPU. ci/build documentation Improvements or additions to documentation tpu Related to Google TPUs
#21000 opened Jul 15, 2025 by vanbasten23 Loading…
3 of 4 tasks
[Misc] unify variable for LLM instance documentation Improvements or additions to documentation llama Related to Llama models qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed v1
#20996 opened Jul 15, 2025 by andyxning Loading…
4 tasks
[Performance] EPLB Execution Optimization v1
#20990 opened Jul 15, 2025 by david6666666 Draft
4 of 9 tasks
[CI] [Doc]: Add GH Action for auto labeling issues with rocm tag ci/build rocm Related to AMD ROCm
#20988 opened Jul 15, 2025 by vllmellm Loading…
3 of 4 tasks
fix(completion): always include usage frontend
#20983 opened Jul 15, 2025 by max-wittig Loading…
3 of 4 tasks
Resolved the extremely large block_size problem v1
#20977 opened Jul 15, 2025 by nadathurv Loading…
add support for qwen3 moe model EPLB qwen Related to Qwen models
#20967 opened Jul 15, 2025 by hsliuustc Loading…
2 of 4 tasks
Fix tool_calls to fit with openai client frontend
#20966 opened Jul 15, 2025 by relic-yuexi Loading…
3 of 4 tasks
[DP/EP] PPLX<>Triton Debug v1
#20957 opened Jul 15, 2025 by robertgshaw2-redhat Draft
4 tasks
Enable v1 metrics tests ci/build v1
#20953 opened Jul 14, 2025 by eicherseiji Loading…
3 of 4 tasks
Add add_logger API to AsyncLLM v1
#20952 opened Jul 14, 2025 by eicherseiji Draft
3 of 4 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.