-
-
Notifications
You must be signed in to change notification settings - Fork 8.3k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[doc] use MkDocs collapsible blocks - supplement
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#19973
opened Jun 23, 2025 by
reidliu41
Loading…
4 tasks
[Bugfix] Fix CI bitsandbytes failure
ready
ONLY add when PR is ready to merge/full CI is needed
#19969
opened Jun 23, 2025 by
jeejeelee
Loading…
1 of 4 tasks
feat: add reward model + min_p speculative decode
frontend
qwen
Related to Qwen models
#19968
opened Jun 23, 2025 by
jatery55555
Loading…
4 tasks
feat: offload weights to cpu before fp8 online quant
documentation
Improvements or additions to documentation
#19967
opened Jun 23, 2025 by
yma11
Loading…
[Chore] Clarifying log messages for KV Connector
ready
ONLY add when PR is ready to merge/full CI is needed
#19965
opened Jun 23, 2025 by
aarnphm
Loading…
feat(audio): add flag for Whisper chunking (#19772)
frontend
#19961
opened Jun 23, 2025 by
hardikkgupta
Loading…
1 of 4 tasks
[CI/Build] Add basic multimodal lm eval for CI testing
ci/build
#19959
opened Jun 23, 2025 by
yeqcharlotte
Loading…
3 of 4 tasks
[Doc] cmd+k
documentation
Improvements or additions to documentation
#19957
opened Jun 22, 2025 by
aarnphm
Loading…
[Bugfix][v1] Fix step pooler implementation and step pooling usage in v1
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
v1
#19956
opened Jun 22, 2025 by
Isotr0py
Loading…
3 of 4 tasks
[Doc] Update V1 status for decoder-only embedding models
documentation
Improvements or additions to documentation
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#19952
opened Jun 22, 2025 by
Isotr0py
Loading…
1 of 4 tasks
[Perf][Frontend]: eliminate api_key and x_request_id headers middleware overhead
documentation
Improvements or additions to documentation
frontend
needs-rebase
#19946
opened Jun 22, 2025 by
Yazan-Sharaya
Loading…
4 tasks done
[Bugfix] fix sampling seeding being off when sequences are prempted
#19940
opened Jun 21, 2025 by
Jackmin801
•
Draft
[PERF] Speedup of MRoPE prepare inputs
qwen
Related to Qwen models
v1
#19939
opened Jun 21, 2025 by
vadiklyutiy
Loading…
3 tasks done
[Bugfix][Benchmark] Fix Marlin benchmark
perf-benchmarks
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#19929
opened Jun 21, 2025 by
22quinn
Loading…
4 tasks done
[TPU] add kv cache update kernel
ci/build
tpu
Related to Google TPUs
v1
#19928
opened Jun 21, 2025 by
yaochengji
Loading…
[V1] Solve potential deadlock issue in v1 engine core client internally
v1
#19927
opened Jun 21, 2025 by
Isotr0py
Loading…
3 of 4 tasks
[doc] improve readability for long commands
documentation
Improvements or additions to documentation
#19920
opened Jun 20, 2025 by
reidliu41
Loading…
4 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.