Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Feature] The Qwen3 reasoning parser supports guided decoding ready ONLY add when PR is ready to merge/full CI is needed
#17466 opened Apr 30, 2025 by chaunceyjiang Loading…
[Model] Add GraniteMoeHybrid 4.0 model ci/build documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) needs-rebase tool-calling v1
#17461 opened Apr 30, 2025 by s3woz Draft
[Misc] refactor example - cpu_offload_lmcache documentation Improvements or additions to documentation
#17460 opened Apr 30, 2025 by reidliu41 Loading…
[CI/Build] Reorganize models tests ci/build multi-modality Related to multi-modality (#4194)
#17459 opened Apr 30, 2025 by DarkLight1337 Loading…
Improve configs - ObservabilityConfig
#17453 opened Apr 30, 2025 by hmellor Loading…
Fix more broken speculative decode tests ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding
#17450 opened Apr 30, 2025 by huydhn Loading…
fix tmp_out and exp_sums dimensions
#17438 opened Apr 30, 2025 by hliuca Loading…
[Feat] Add deprecated=True to CLI args
#17426 opened Apr 30, 2025 by aarnphm Loading…
[Fix] Support passing args to logger multi-modality Related to multi-modality (#4194) structured-output
#17425 opened Apr 29, 2025 by aarnphm Loading…
[Misc][AMD] Add query_platform method to interface.py
#17424 opened Apr 29, 2025 by rasmith Loading…
[Chore] import as annotations on config needs-rebase ready ONLY add when PR is ready to merge/full CI is needed
#17423 opened Apr 29, 2025 by aarnphm Loading…
Avoid overwriting vllm_compile_cache.py ready ONLY add when PR is ready to merge/full CI is needed
#17418 opened Apr 29, 2025 by youngkent Loading…
[DO NOT MERGE] Manual Fusion PR for Comparison
#17417 opened Apr 29, 2025 by rasmith Loading…
Fix noisy warning for uncalibrated q_scale/p_scale
#17414 opened Apr 29, 2025 by mgoin Loading…
[Bugfix] Temporarily disable gptq_bitblas on ROCm documentation Improvements or additions to documentation
#17411 opened Apr 29, 2025 by nlzy Loading…
ProTip! Follow long discussions with comments:>50.