-
-
Notifications
You must be signed in to change notification settings - Fork 8.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix] Update mamba_ssm to 2.2.5
ci/build
documentation
Improvements or additions to documentation
#21421
opened Jul 23, 2025 by
elvischenv
Loading…
1 of 4 tasks
[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported
ready
ONLY add when PR is ready to merge/full CI is needed
#21420
opened Jul 23, 2025 by
elvischenv
Loading…
1 of 4 tasks
[V1] Fix local chunked attention always disabled
#21419
opened Jul 23, 2025 by
sarckk
Loading…
3 of 4 tasks
[TPU][TEST] Fix the downloading issue in TPU v1 test 11.
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#21418
opened Jul 22, 2025 by
QiliangCui
Loading…
[WIP] Prepare for DI integration. Currently DI is not used; but this is to make sure both paths run fine.
tpu
Related to Google TPUs
v1
#21415
opened Jul 22, 2025 by
yarongmu-google
Loading…
4 tasks
Intentionally fail parallel sampling test
v1
#21413
opened Jul 22, 2025 by
sethkimmel3
Loading…
1 of 4 tasks
[v1][attention] Support Hybrid Allocator + FlashInfer
rocm
Related to AMD ROCm
v1
#21412
opened Jul 22, 2025 by
heheda12345
Loading…
3 of 4 tasks
[NVIDIA] Explicitly disable shuffled weights for flashinfer blockscale moe fp8 kernels
#21411
opened Jul 22, 2025 by
kaixih
Loading…
Changing "amdproduction" allocation.
ci/build
rocm
Related to AMD ROCm
#21409
opened Jul 22, 2025 by
Alexei-V-Ivanov-AMD
Loading…
Clean up usages of
SpecializedManager
v1
#21407
opened Jul 22, 2025 by
zhouwfang
Loading…
3 of 4 tasks
Fix amd build fail caused by #21803
rocm
Related to AMD ROCm
#21405
opened Jul 22, 2025 by
charlifu
Loading…
Refactor dense FP8 tensor/channel/block utils and add CT FP8 block
ready
ONLY add when PR is ready to merge/full CI is needed
#21404
opened Jul 22, 2025 by
mgoin
Loading…
[Misc] Update Dockerfile FlashInfer to v0.2.8
ci/build
#21402
opened Jul 22, 2025 by
stmcginnis
Loading…
[Core] Add basic unit test for maybe_evict_cached_block
v1
#21400
opened Jul 22, 2025 by
Jialin
Loading…
3 of 4 tasks
[Bug] Warning for Deprecated Device Param
frontend
#21397
opened Jul 22, 2025 by
yewentao256
Loading…
[BugFix] Update python to python3 calls for image; fix prefix & input calculations.
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#21391
opened Jul 22, 2025 by
ericehanley
Loading…
1 task
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.