-
Notifications
You must be signed in to change notification settings - Fork 253
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI] Switching to infra cache server to reduce network pressure
#1792
opened Jul 14, 2025 by
pkking
Loading…
Add graph mode for Qwen2.5 and Qwen3
module:core
module:ops
#1787
opened Jul 14, 2025 by
NicholasTao
Loading…
optmize rope in qwen2
merge-conflicts
module:core
module:ops
module:tests
#1782
opened Jul 14, 2025 by
David9857
Loading…
[Doc] Update accuracy reports for main
documentation
Improvements or additions to documentation
#1777
opened Jul 14, 2025 by
vllm-ascend-ci
Loading…
[PD Disagg][CI] Upgrade vllm version to fix ci
pd-test
enable pd test for PR
ready-for-test
start test by label for PR
#1765
opened Jul 14, 2025 by
MengqingCao
Loading…
[Misc] Remove VLLM_USE_V1 usage in code
module:core
module:tests
#1764
opened Jul 14, 2025 by
wangxiyuan
Loading…
【main】 Support SP for qwen2.5 and qwen3 moe
module:core
module:ops
#1761
opened Jul 12, 2025 by
lbk-sys
Loading…
[V0.9.1] torchair_graph bugfix when chunked_prefill is true
#1748
opened Jul 11, 2025 by
fems14
Loading…
[V0.9.1] Add support for flashcomm_v1 in Qwen2.5
merge-conflicts
module:core
#1745
opened Jul 11, 2025 by
rjg-lyh
Loading…
flashcomm3 multi stream of moe layer
merge-conflicts
module:core
module:ops
module:quantization
#1744
opened Jul 11, 2025 by
wyhhyw123
Loading…
Optimization of TP4 Parallelism in DeepSeek MLP Dense Layers
#1738
opened Jul 11, 2025 by
zhanghw0354
Loading…
[Doc] Add model costomization doc
documentation
Improvements or additions to documentation
#1737
opened Jul 11, 2025 by
shen-shanshan
Loading…
[2/N] Enable shellcheck and pymarkdown for lint system
documentation
Improvements or additions to documentation
module:tests
module:tools
#1735
opened Jul 11, 2025 by
Potabk
Loading…
[Test] Remove VLLM_USE_V1 in example and tests
merge-conflicts
module:tests
#1733
opened Jul 11, 2025 by
wangxiyuan
Loading…
[Perf] Reduce memory usage by splitting tokens in fused_experts
documentation
Improvements or additions to documentation
module:core
module:ops
module:quantization
module:tests
ready
read for review
#1729
opened Jul 10, 2025 by
ApsarasX
Loading…
[V0.9.1] add support for flashcomm2 in qwen3
module:core
module:tests
#1726
opened Jul 10, 2025 by
David9857
Loading…
[BUGFIX] [v0.9.1-dev] Obtain the NPU ID of non-consecutive NPU cards
#1724
opened Jul 10, 2025 by
yangqinghao-cmss
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.