Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Chore] added stubs for vllm_flash_attn during development mode ci/build ready ONLY add when PR is ready to merge/full CI is needed
#17228 opened Apr 26, 2025 by aarnphm Loading…
Use CUDA 12.6 as default for release and nightly wheels ci/build documentation Improvements or additions to documentation
#17224 opened Apr 26, 2025 by huydhn Draft
[Bugfix] Get a specific type of layer from forward context tpu Related to Google TPUs v1
#17222 opened Apr 26, 2025 by heheda12345 Loading…
[Doc] Clarify note for H2O-VL documentation Improvements or additions to documentation
#17219 opened Apr 26, 2025 by DarkLight1337 Loading…
[V1][Spec Decode] Apply torch.compile & cudagraph to EAGLE documentation Improvements or additions to documentation v1
#17211 opened Apr 26, 2025 by luyuzhe111 Loading…
[Misc]add configurable cuda graph size
#17201 opened Apr 25, 2025 by CXIAAAAA Loading…
[Hardware][Apple] Allows VLLM_TARGET_DEVICE=empty on MacOs ci/build ready ONLY add when PR is ready to merge/full CI is needed
#17200 opened Apr 25, 2025 by wallashss Loading…
[Security] Don't bind tcp zmq socket to all interfaces documentation Improvements or additions to documentation security Security related issues and PRs
#17197 opened Apr 25, 2025 by russellb Loading… v0.8.5
[Bugfix] Fix Lora Name Parsing
#17196 opened Apr 25, 2025 by alex-jw-brooks Loading…
[WIP][Bugfix] Fix 'MistralTokenizer' object has no attribute 'init_kwargs' bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed
#17195 opened Apr 25, 2025 by chaunceyjiang Loading… v0.8.5
[V1] Remove num_input_tokens from attn_metadata tpu Related to Google TPUs v1
#17193 opened Apr 25, 2025 by heheda12345 Loading…
[Bugfix] support local dataset path in benchmark_serving
#17179 opened Apr 25, 2025 by wubai Loading…
[Misc] Add gemma3 chat template with pythonic-style function calling documentation Improvements or additions to documentation tool-calling
#17149 opened Apr 25, 2025 by philipchung Loading…
Add xLAM tool parser support documentation Improvements or additions to documentation frontend tool-calling
#17148 opened Apr 25, 2025 by zuxin666 Loading…
ProTip! Exclude everything labeled bug with -label:bug.