Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Core][Distributed] add same-node detection
#5369 opened Jun 10, 2024 by youkaichao Loading…
[Misc] Various simplifications and typing fixes
#5368 opened Jun 9, 2024 by njhill Loading…
[Core][Bugfix]: fix prefix caching for blockv2
#5364 opened Jun 9, 2024 by leiwen83 Loading…
[Model][Bugfix] Add GLM-4v support
#5358 opened Jun 8, 2024 by songxxzp Loading…
remove sort_keys=True in guided_decoding
#5332 opened Jun 7, 2024 by DeyangKong Loading…
[ci] Use small_cpu_queue for doc build
#5331 opened Jun 7, 2024 by khluu Loading…
[MISC] Upgrade dependency to PyTorch 2.3.1
#5327 opened Jun 7, 2024 by comaniac Loading…
[WIP][Hardware] Initial TPU integration tpu Related to Google TPUs
#5292 opened Jun 5, 2024 by WoosukKwon Draft
2 tasks
ProTip! Filter pull requests by the default branch with base:main.