-
Notifications
You must be signed in to change notification settings - Fork 556
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add benchmark scripts about pipeline api and inference engines according to the config file
#3622
opened Jun 8, 2025 by
lvhan028
Loading…
Support Mooncake migration backend for PD disaggregation
enhancement
New feature or request
#3620
opened Jun 7, 2025 by
Risc-lt
Loading…
feature: enable tool_call and reasoning_content parsing for qwen3
enhancement
New feature or request
#3615
opened Jun 5, 2025 by
ywx217
Loading…
Add FP8 MoE for turbomind
enhancement
New feature or request
#3601
opened May 30, 2025 by
lzhangzz
Loading…
[Feature] metrics support
enhancement
New feature or request
#3534
opened May 9, 2025 by
CUHKSZzxy
Loading…
7 of 8 tasks
Add Gloo communication to turobmind
enhancement
New feature or request
#3362
opened Mar 28, 2025 by
irexyc
Loading…
Improve turbomind's prefix cache
BC-breaking
improvement
#3332
opened Mar 25, 2025 by
lvhan028
Loading…
6 of 8 tasks
add deepseekv3 doc
documentation
Improvements or additions to documentation
WIP
#3265
opened Mar 17, 2025 by
CUHKSZzxy
Loading…
support loading model with user input params (turbomind)
enhancement
New feature or request
#3204
opened Mar 3, 2025 by
irexyc
Loading…
support setting devices for turbomind backend
improvement
#3203
opened Mar 3, 2025 by
irexyc
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-06-06.