Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

support --no-enable-chunked-prefill for V1
#19975 opened Jun 23, 2025 by liuyumoye Loading…
[doc] use MkDocs collapsible blocks - supplement documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#19973 opened Jun 23, 2025 by reidliu41 Loading…
4 tasks
Enabling Safe KVConnector
#19972 opened Jun 23, 2025 by prashant182 Loading…
[Core][V1] Support sharded state loading
#19971 opened Jun 23, 2025 by aarnphm Loading…
Implement Async Scheduling v1
#19970 opened Jun 23, 2025 by WoosukKwon Draft
4 tasks
[Bugfix] Fix CI bitsandbytes failure ready ONLY add when PR is ready to merge/full CI is needed
#19969 opened Jun 23, 2025 by jeejeelee Loading…
1 of 4 tasks
feat: add reward model + min_p speculative decode frontend qwen Related to Qwen models
#19968 opened Jun 23, 2025 by jatery55555 Loading…
4 tasks
feat: offload weights to cpu before fp8 online quant documentation Improvements or additions to documentation
#19967 opened Jun 23, 2025 by yma11 Loading…
[Chore] Clarifying log messages for KV Connector ready ONLY add when PR is ready to merge/full CI is needed
#19965 opened Jun 23, 2025 by aarnphm Loading…
[CI/Build] Upgrade lm-eval to 0.4.9 ci/build
#19962 opened Jun 23, 2025 by yeqcharlotte Loading…
feat(audio): add flag for Whisper chunking (#19772) frontend
#19961 opened Jun 23, 2025 by hardikkgupta Loading…
1 of 4 tasks
[CI/Build] Add basic multimodal lm eval for CI testing ci/build
#19959 opened Jun 23, 2025 by yeqcharlotte Loading…
3 of 4 tasks
[Doc] cmd+k documentation Improvements or additions to documentation
#19957 opened Jun 22, 2025 by aarnphm Loading…
[Bugfix][v1] Fix step pooler implementation and step pooling usage in v1 multi-modality Related to multi-modality (#4194) qwen Related to Qwen models v1
#19956 opened Jun 22, 2025 by Isotr0py Loading…
3 of 4 tasks
[Doc] Update V1 status for decoder-only embedding models documentation Improvements or additions to documentation qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#19952 opened Jun 22, 2025 by Isotr0py Loading…
1 of 4 tasks
[Perf][Frontend]: eliminate api_key and x_request_id headers middleware overhead documentation Improvements or additions to documentation frontend needs-rebase
#19946 opened Jun 22, 2025 by Yazan-Sharaya Loading…
4 tasks done
[PERF] Speedup of MRoPE prepare inputs qwen Related to Qwen models v1
#19939 opened Jun 21, 2025 by vadiklyutiy Loading…
3 tasks done
[BugFix] Fix multi-node offline data parallel bug Something isn't working ci/build frontend v1
#19937 opened Jun 21, 2025 by njhill Loading…
[Bugfix][Benchmark] Fix Marlin benchmark perf-benchmarks performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#19929 opened Jun 21, 2025 by 22quinn Loading…
4 tasks done
[TPU] add kv cache update kernel ci/build tpu Related to Google TPUs v1
#19928 opened Jun 21, 2025 by yaochengji Loading…
enable multiple ssm groups duplication
#19924 opened Jun 20, 2025 by ilyasch2 Loading…
2 of 4 tasks
Use FusedMoEQuantConfig everywhere rocm Related to AMD ROCm
#19921 opened Jun 20, 2025 by bnellnm Draft
4 tasks
[doc] improve readability for long commands documentation Improvements or additions to documentation
#19920 opened Jun 20, 2025 by reidliu41 Loading…
4 tasks
ProTip! Adding no:label will show everything without a label.