Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Model] use AutoWeightsLoader for bloom
#18300 opened May 17, 2025 by calvin0327 Loading…
[Model] use AutoWeightsLoader for bart
#18299 opened May 17, 2025 by calvin0327 Loading…
[P/D] Support TPU in NixlConnector needs-rebase tpu Related to Google TPUs v1
#18293 opened May 17, 2025 by juncgu Loading…
[TPU] Calculate block size only when not set. tpu Related to Google TPUs
#18292 opened May 17, 2025 by QiliangCui Draft
[Doc] Add doc to explain the usage of Qwen3 thinking documentation Improvements or additions to documentation
#18291 opened May 17, 2025 by WangErXiao Loading…
fix warning in eagle.py documentation Improvements or additions to documentation
#18288 opened May 17, 2025 by RuixiangMa Loading…
fix CUDA_check redefinition in #17918 ready ONLY add when PR is ready to merge/full CI is needed
#18287 opened May 17, 2025 by luccafong Loading…
Add multi-LoRA support for Neuron.
#18284 opened May 16, 2025 by aws-satyajith Loading…
Support quantization on neuron
#18283 opened May 16, 2025 by aws-satyajith Loading…
[CI/Build] [TPU] Fix TPU CI exit code ci/build ready ONLY add when PR is ready to merge/full CI is needed
#18282 opened May 16, 2025 by CAROLZXYZXY Loading…
[Attention][V1] Toggle for v1 attention backend v1
#18275 opened May 16, 2025 by gshtras Loading…
Update default neuron config for speculation
#18274 opened May 16, 2025 by elaineyz Loading…
[Doc] update Contributing page's testing section documentation Improvements or additions to documentation
#18272 opened May 16, 2025 by davidxia Loading…
ProTip! Follow long discussions with comments:>50.