Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix] Fix shape checking for Fuyu ready ONLY add when PR is ready to merge/full CI is needed
#21709 opened Jul 28, 2025 by DarkLight1337 Loading…
1 of 4 tasks
[BugFix] Fix ChunkedLocalAttention when the hybrid kv-cache is disabled ready ONLY add when PR is ready to merge/full CI is needed v1
#21707 opened Jul 28, 2025 by LucasWilkinson Loading…
3 of 4 tasks
update flashinfer to v0.2.9rc2 ci/build
#21701 opened Jul 28, 2025 by weireweire Loading…
2 of 4 tasks
[Benchmark] Support ready check timeout in vllm bench serve performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#21696 opened Jul 28, 2025 by yeqcharlotte Loading…
3 of 4 tasks
[Misc] Add unit tests for chunked local attention v1
#21692 opened Jul 27, 2025 by sarckk Loading…
3 of 4 tasks
Deprecate V0 ci/build v1
#21690 opened Jul 27, 2025 by WoosukKwon Loading…
[Kernel][Triton] add bfloat16 support for awq
#21688 opened Jul 27, 2025 by mandeeplearning Loading…
4 tasks
Migrate KeyeImageInputs and KeyeVideoInputs to TensorSchema ready ONLY add when PR is ready to merge/full CI is needed
#21686 opened Jul 27, 2025 by bbeckca Loading…
Migrate InternVLImageInputs and InternVLVideoInputs to TensorSchema ready ONLY add when PR is ready to merge/full CI is needed
#21684 opened Jul 27, 2025 by bbeckca Loading…
Migrate Idefics3ImagePixelInputs and Idefics3ImageEmbeddingInputs to … ready ONLY add when PR is ready to merge/full CI is needed
#21683 opened Jul 27, 2025 by bbeckca Loading…
Migrate GraniteSpeechAudioInputs to TensorSchema ready ONLY add when PR is ready to merge/full CI is needed
#21682 opened Jul 27, 2025 by bbeckca Loading…
[feature] add log non default args in LLM frontend
#21680 opened Jul 27, 2025 by lengrongfu Loading…
4 tasks
Migrate GLMVImagePixelInputs to TensorSchema ready ONLY add when PR is ready to merge/full CI is needed
#21679 opened Jul 27, 2025 by bbeckca Loading…
Migrate Glm4vImageInputs, Glm4vVideoInputs to TensorSchema ready ONLY add when PR is ready to merge/full CI is needed
#21678 opened Jul 27, 2025 by bbeckca Loading…
Migrate Gemma3ImagePixelInputs to TensorSchema ready ONLY add when PR is ready to merge/full CI is needed
#21676 opened Jul 27, 2025 by bbeckca Loading…
[Bugfix] fix max-file-size type from str to int documentation Improvements or additions to documentation
#21675 opened Jul 27, 2025 by andyxning Loading…
4 tasks
[Model] [Draft PR] Add support for SmallThinker model series documentation Improvements or additions to documentation new-model Requests to new models
#21670 opened Jul 27, 2025 by SorryMaker2022 Draft
4 tasks
Introduce RayPPCommunicator for ray-based PP
#21660 opened Jul 26, 2025 by ruisearch42 Loading…
3 of 4 tasks
Keep reasoning content before applying chat template frontend
#21655 opened Jul 26, 2025 by lhdeng-gh Loading…
3 of 4 tasks
Fix(benchmarks): Correct tqdm import to resolve TypeError in benchmark_w8a8_block_fp8.py performance Performance-related issues
#21650 opened Jul 26, 2025 by Aymendje Loading…
4 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.