-
-
Notifications
You must be signed in to change notification settings - Fork 9.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(compilation): add VLLM_COMPILE_DEPYF env var to control depyf de…
#22125
opened Aug 2, 2025 by
vincentzed
•
Draft
4 tasks
[Misc] Bump ray to 2.48.0
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#22123
opened Aug 2, 2025 by
ruisearch42
Loading…
4 tasks
[Bugfix] Add num_special_tokens_to_add to MistralTokenizer, fixes #22013
#22121
opened Aug 2, 2025 by
ShUl0w
Loading…
3 of 4 tasks
[WIP] vLLM Benchmark suite improvement
ci/build
performance
Performance-related issues
#22119
opened Aug 2, 2025 by
louie-tsai
Loading…
1 of 4 tasks
[Bugfix] Add Dense module support for sentence-transformers models
#22117
opened Aug 2, 2025 by
FFFfff1FFFfff
Loading…
[Fix] Fix python path resolving in cpu cmake
ci/build
#22115
opened Aug 2, 2025 by
xiszishu
Loading…
3 of 4 tasks
[Hardware][RISC-V] Add riscv64 support for vLLM with scalar
ci/build
#22112
opened Aug 2, 2025 by
langc23
Loading…
enable Docker-aware precompiled wheel setup
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#22106
opened Aug 1, 2025 by
dougbtv
Loading…
3 tasks done
[Frontend] Update OpenAI error response to upstream format
frontend
#22099
opened Aug 1, 2025 by
msanft
Loading…
3 of 4 tasks
[ROCm][Misc] Rename the context_len to seq_len in ROCm custom paged attention kernel
rocm
Related to AMD ROCm
#22097
opened Aug 1, 2025 by
charlifu
Loading…
[Misc] Add comprehensive error message for non-integer CUDA_VISIBLE_DEVICES variables
#22096
opened Aug 1, 2025 by
odashi
Loading…
4 tasks
[NVIDIA] Support Flashinfer TRT-LLM Prefill Attention Kernel
needs-rebase
performance
Performance-related issues
v1
#22095
opened Aug 1, 2025 by
elvischenv
Loading…
3 of 4 tasks
[Misc] rename torch backend literal string with const var
documentation
Improvements or additions to documentation
rocm
Related to AMD ROCm
tpu
Related to Google TPUs
v1
#22087
opened Aug 1, 2025 by
andyxning
Loading…
4 tasks
[Speculators][Speculative Decoding] Add Eagle3 Support For HunYuan Model
new-model
Requests to new models
speculative-decoding
v1
#22080
opened Aug 1, 2025 by
kzjeef
Loading…
3 of 4 tasks
Add Tool Call Parser for tngtech/DeepSeek-TNG-R1T2-Chimera
deepseek
Related to DeepSeek models
frontend
tool-calling
#22074
opened Aug 1, 2025 by
sfbemerk
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.