vllm-project / vllm Public

Notifications
Fork 9k
Star 53.3k

Code
Issues 1.8k
Pull requests 866
Discussions
Actions
Projects 11
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: vllm-project/vllm

Labels 56 Milestones 0

New pull request New

866 Open 10,421 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Bugfix] Fix shape checking for Fuyu ready

ONLY add when PR is ready to merge/full CI is needed

#21709 opened Jul 28, 2025 by DarkLight1337

Loading…

1 of 4 tasks

[BugFix] Fix ChunkedLocalAttention when the hybrid kv-cache is disabled ready

ONLY add when PR is ready to merge/full CI is needed

#21707 opened Jul 28, 2025 by LucasWilkinson

Loading…

3 of 4 tasks

[Bugfix][Frontend] Fix create_error_response return 200 status code frontend

#21705 opened Jul 28, 2025 by kebe7jun

Loading…

3 of 4 tasks

[XPU] xpu punica wrapper with ipex kernel

#21703 opened Jul 28, 2025 by chaojun-zhang • Draft

update flashinfer to v0.2.9rc2 ci/build

#21701 opened Jul 28, 2025 by weireweire

Loading…

2 of 4 tasks

[Benchmark] Support ready check timeout in vllm bench serve performance

Performance-related issues

ready

ONLY add when PR is ready to merge/full CI is needed

#21696 opened Jul 28, 2025 by yeqcharlotte

Loading…

3 of 4 tasks

[Misc] Add unit tests for chunked local attention v1

#21692 opened Jul 27, 2025 by sarckk

Loading…

3 of 4 tasks

[BugFix] Potential fix for FlashMLA full cuda-graph + DP v1

#21691 opened Jul 27, 2025 by LucasWilkinson • Draft

4 tasks

Deprecate V0 ci/build v1

#21690 opened Jul 27, 2025 by WoosukKwon

Loading…

[Kernel][Triton] add bfloat16 support for awq

#21688 opened Jul 27, 2025 by mandeeplearning

Loading…

4 tasks

Migrate KeyeImageInputs and KeyeVideoInputs to TensorSchema ready

ONLY add when PR is ready to merge/full CI is needed

#21686 opened Jul 27, 2025 by bbeckca

Loading…

Migrate InternVLImageInputs and InternVLVideoInputs to TensorSchema ready

ONLY add when PR is ready to merge/full CI is needed

#21684 opened Jul 27, 2025 by bbeckca

Loading…

Migrate Idefics3ImagePixelInputs and Idefics3ImageEmbeddingInputs to … ready

ONLY add when PR is ready to merge/full CI is needed

#21683 opened Jul 27, 2025 by bbeckca

Loading…

Migrate GraniteSpeechAudioInputs to TensorSchema ready

ONLY add when PR is ready to merge/full CI is needed

#21682 opened Jul 27, 2025 by bbeckca

Loading…

[feature] add log non default args in LLM frontend

#21680 opened Jul 27, 2025 by lengrongfu

Loading…

4 tasks

Migrate GLMVImagePixelInputs to TensorSchema ready

ONLY add when PR is ready to merge/full CI is needed

#21679 opened Jul 27, 2025 by bbeckca

Loading…

Migrate Glm4vImageInputs, Glm4vVideoInputs to TensorSchema ready

ONLY add when PR is ready to merge/full CI is needed

#21678 opened Jul 27, 2025 by bbeckca

Loading…

Migrate Gemma3ImagePixelInputs to TensorSchema ready

ONLY add when PR is ready to merge/full CI is needed

#21676 opened Jul 27, 2025 by bbeckca

Loading…

[Bugfix] fix max-file-size type from str to int documentation

Improvements or additions to documentation

#21675 opened Jul 27, 2025 by andyxning

Loading…

4 tasks

[Misc] Remove duplicate code and fix comment errors to improve code readability v1

#21673 opened Jul 27, 2025 by tanruixiang

Loading…

3 of 4 tasks

[Model] [Draft PR] Add support for SmallThinker model series documentation

Improvements or additions to documentation

new-model

Requests to new models

#21670 opened Jul 27, 2025 by SorryMaker2022 • Draft

4 tasks

Introduce RayPPCommunicator for ray-based PP

#21660 opened Jul 26, 2025 by ruisearch42

Loading…

3 of 4 tasks

Keep reasoning content before applying chat template frontend

#21655 opened Jul 26, 2025 by lhdeng-gh

Loading…

3 of 4 tasks

Limit concurrent long partial prefills via max_long_partial_prefills v1

#21651 opened Jul 26, 2025 by pansicheng

Loading…

3 of 4 tasks

Fix(benchmarks): Correct tqdm import to resolve TypeError in benchmark_w8a8_block_fp8.py performance

Performance-related issues

#21650 opened Jul 26, 2025 by Aymendje

Loading…

4 tasks done

Previous 1 2 3 4 5 … 34 35 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!