Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Fix] Update mamba_ssm to 2.2.5 ci/build documentation Improvements or additions to documentation
#21421 opened Jul 23, 2025 by elvischenv Loading…
1 of 4 tasks
[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported ready ONLY add when PR is ready to merge/full CI is needed
#21420 opened Jul 23, 2025 by elvischenv Loading…
1 of 4 tasks
[V1] Fix local chunked attention always disabled
#21419 opened Jul 23, 2025 by sarckk Loading…
3 of 4 tasks
[TPU][TEST] Fix the downloading issue in TPU v1 test 11. ci/build ready ONLY add when PR is ready to merge/full CI is needed
#21418 opened Jul 22, 2025 by QiliangCui Loading…
Support Pathways in vLLM v1
#21417 opened Jul 22, 2025 by wenxindongwork Loading…
4 tasks done
[BUGFIX] deepseek-v2-lite failed due to fused_qkv_a_proj name update bug Something isn't working deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed
#21414 opened Jul 22, 2025 by xuechendi Loading…
1 of 4 tasks
Intentionally fail parallel sampling test v1
#21413 opened Jul 22, 2025 by sethkimmel3 Loading…
1 of 4 tasks
[v1][attention] Support Hybrid Allocator + FlashInfer rocm Related to AMD ROCm v1
#21412 opened Jul 22, 2025 by heheda12345 Loading…
3 of 4 tasks
Changing "amdproduction" allocation. ci/build rocm Related to AMD ROCm
#21409 opened Jul 22, 2025 by Alexei-V-Ivanov-AMD Loading…
Update flashinfer CUTLASS MoE Kernel
#21408 opened Jul 22, 2025 by wenscarl Draft
4 tasks
Clean up usages of SpecializedManager v1
#21407 opened Jul 22, 2025 by zhouwfang Loading…
3 of 4 tasks
Fix amd build fail caused by #21803 rocm Related to AMD ROCm
#21405 opened Jul 22, 2025 by charlifu Loading…
Refactor dense FP8 tensor/channel/block utils and add CT FP8 block ready ONLY add when PR is ready to merge/full CI is needed
#21404 opened Jul 22, 2025 by mgoin Loading…
[Core] Add basic unit test for maybe_evict_cached_block v1
#21400 opened Jul 22, 2025 by Jialin Loading…
3 of 4 tasks
[Sampler] Introduce logprobs mode for logging ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#21398 opened Jul 22, 2025 by houseroad Loading…
4 tasks done
[wip] v1
#21395 opened Jul 22, 2025 by hj-mistral Draft
[Do not merge] Refactor JambaForCausalLM
#21394 opened Jul 22, 2025 by jeejeelee Loading…
4 tasks
[Bugfix][ROCm][Build] Fix build regression on ROCm ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#21393 opened Jul 22, 2025 by gshtras Loading…
[BugFix] Update python to python3 calls for image; fix prefix & input calculations. performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#21391 opened Jul 22, 2025 by ericehanley Loading…
1 task
ProTip! Exclude everything labeled bug with -label:bug.