-
Notifications
You must be signed in to change notification settings - Fork 210
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[v0.9.1-dev][CI/UT][bugfix]fix v0 spec decode
module:tests
#1323
opened Jun 20, 2025 by
mengwei805
Loading…
[0.9.1]support deepseek w4a8 quantization
module:quantization
module:tests
#1320
opened Jun 20, 2025 by
pichangping
Loading…
[Bugfix] fix disaggregated prefill bug (merge into v0.9.1)
module:quantization
module:tests
#1317
opened Jun 20, 2025 by
underfituu
Loading…
[CI/UT] Fix disaggregated prefill ci
module:tests
pd-test
enable pd test for PR
ready-for-test
start test by label for PR
#1313
opened Jun 20, 2025 by
MengqingCao
Loading…
[CI] Update guided decoding ut
long-term-test
enable long term test for PR
module:tests
ready-for-test
start test by label for PR
#1312
opened Jun 20, 2025 by
shen-shanshan
Loading…
[0.9.1]fix oom issue in mla and enable mla_pa for deepseek mla decode
documentation
Improvements or additions to documentation
module:core
module:quantization
#1311
opened Jun 20, 2025 by
ganyi1996ppo
Loading…
[WIP] support fa3 quant
module:quantization
module:tests
#1310
opened Jun 20, 2025 by
22dimensions
Loading…
use fused ops npu_top_k_top_p
documentation
Improvements or additions to documentation
#1308
opened Jun 20, 2025 by
Pr0Wh1teGivee
Loading…
[Doc] Add reinstall instructions doc
documentation
Improvements or additions to documentation
#1303
opened Jun 19, 2025 by
weiguihua2
Loading…
[Doc] Add sleep mode doc
documentation
Improvements or additions to documentation
#1295
opened Jun 19, 2025 by
Potabk
Loading…
[CI]Update accuracy report test
dense-accuracy-test
enable dense accuracy test for PR
ready-for-test
start test by label for PR
#1288
opened Jun 18, 2025 by
zhangxinyuehfad
Loading…
dLLM, short for distributed LLM, an easy-to-use tool for multi-node vllm deployment
#1280
opened Jun 18, 2025 by
zhouyeju
Loading…
[v0.9.1][DP] Tiny fix of dp and update example
module:tests
#1277
opened Jun 18, 2025 by
MengqingCao
Loading…
[0.9.1][Feature] Support Qwen3 W4A8 quantization
module:quantization
module:tests
#1275
opened Jun 18, 2025 by
zhoux77899
Loading…
[DP] Tiny fix of dp and update example
documentation
Improvements or additions to documentation
module:tests
#1273
opened Jun 18, 2025 by
MengqingCao
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-05-20.