-
-
Notifications
You must be signed in to change notification settings - Fork 9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Fix shape checking for Fuyu
ready
ONLY add when PR is ready to merge/full CI is needed
#21709
opened Jul 28, 2025 by
DarkLight1337
Loading…
1 of 4 tasks
[BugFix] Fix ChunkedLocalAttention when the hybrid kv-cache is disabled
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#21707
opened Jul 28, 2025 by
LucasWilkinson
Loading…
3 of 4 tasks
[Bugfix][Frontend] Fix create_error_response return 200 status code
frontend
#21705
opened Jul 28, 2025 by
kebe7jun
Loading…
3 of 4 tasks
update flashinfer to v0.2.9rc2
ci/build
#21701
opened Jul 28, 2025 by
weireweire
Loading…
2 of 4 tasks
[Benchmark] Support ready check timeout in Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
vllm bench serve
performance
#21696
opened Jul 28, 2025 by
yeqcharlotte
Loading…
3 of 4 tasks
[Misc] Add unit tests for chunked local attention
v1
#21692
opened Jul 27, 2025 by
sarckk
Loading…
3 of 4 tasks
[BugFix] Potential fix for FlashMLA full cuda-graph + DP
v1
#21691
opened Jul 27, 2025 by
LucasWilkinson
•
Draft
4 tasks
[Kernel][Triton] add bfloat16 support for awq
#21688
opened Jul 27, 2025 by
mandeeplearning
Loading…
4 tasks
Migrate KeyeImageInputs and KeyeVideoInputs to TensorSchema
ready
ONLY add when PR is ready to merge/full CI is needed
#21686
opened Jul 27, 2025 by
bbeckca
Loading…
Migrate InternVLImageInputs and InternVLVideoInputs to TensorSchema
ready
ONLY add when PR is ready to merge/full CI is needed
#21684
opened Jul 27, 2025 by
bbeckca
Loading…
Migrate Idefics3ImagePixelInputs and Idefics3ImageEmbeddingInputs to …
ready
ONLY add when PR is ready to merge/full CI is needed
#21683
opened Jul 27, 2025 by
bbeckca
Loading…
Migrate GraniteSpeechAudioInputs to TensorSchema
ready
ONLY add when PR is ready to merge/full CI is needed
#21682
opened Jul 27, 2025 by
bbeckca
Loading…
[feature] add log non default args in LLM
frontend
#21680
opened Jul 27, 2025 by
lengrongfu
Loading…
4 tasks
Migrate GLMVImagePixelInputs to TensorSchema
ready
ONLY add when PR is ready to merge/full CI is needed
#21679
opened Jul 27, 2025 by
bbeckca
Loading…
Migrate Glm4vImageInputs, Glm4vVideoInputs to TensorSchema
ready
ONLY add when PR is ready to merge/full CI is needed
#21678
opened Jul 27, 2025 by
bbeckca
Loading…
Migrate Gemma3ImagePixelInputs to TensorSchema
ready
ONLY add when PR is ready to merge/full CI is needed
#21676
opened Jul 27, 2025 by
bbeckca
Loading…
[Bugfix] fix max-file-size type from str to int
documentation
Improvements or additions to documentation
#21675
opened Jul 27, 2025 by
andyxning
Loading…
4 tasks
[Misc] Remove duplicate code and fix comment errors to improve code readability
v1
#21673
opened Jul 27, 2025 by
tanruixiang
Loading…
3 of 4 tasks
[Model] [Draft PR] Add support for SmallThinker model series
documentation
Improvements or additions to documentation
new-model
Requests to new models
#21670
opened Jul 27, 2025 by
SorryMaker2022
•
Draft
4 tasks
Introduce RayPPCommunicator for ray-based PP
#21660
opened Jul 26, 2025 by
ruisearch42
Loading…
3 of 4 tasks
Keep reasoning content before applying chat template
frontend
#21655
opened Jul 26, 2025 by
lhdeng-gh
Loading…
3 of 4 tasks
Limit concurrent long partial prefills via max_long_partial_prefills
v1
#21651
opened Jul 26, 2025 by
pansicheng
Loading…
3 of 4 tasks
Fix(benchmarks): Correct tqdm import to resolve TypeError in benchmark_w8a8_block_fp8.py
performance
Performance-related issues
#21650
opened Jul 26, 2025 by
Aymendje
Loading…
4 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.