-
Notifications
You must be signed in to change notification settings - Fork 306
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add slora serving document
documentation
Improvements or additions to documentation
#2084
opened Jul 19, 2024 by
AllentDan
Loading…
Reorganize the user guide
documentation
Improvements or additions to documentation
WIP
#2038
opened Jul 16, 2024 by
lvhan028
Loading…
1 of 3 tasks
torch engine optimize prefill for long context
improvement
#1962
opened Jul 9, 2024 by
grimoire
Loading…
Remove deprecated arguments from API and clarify model_name and chat_template_name
BC-breaking
improvement
WIP
#1931
opened Jul 5, 2024 by
lvhan028
Loading…
PyTorch Engine AWQ support
enhancement
New feature or request
#1913
opened Jul 3, 2024 by
grimoire
Loading…
Fix index error when profiling token generation with
-ct 1
Bug:P1
#1898
opened Jul 2, 2024 by
lvhan028
Loading…
Support guided decoding for pytorch backend
enhancement
New feature or request
#1856
opened Jun 26, 2024 by
AllentDan
Loading…
feat: skip invokeFlattenKV_v2_ when fp16 and bf16 with CacheType::kBlock
#1683
opened May 29, 2024 by
zhyncs
Loading…
[benchmark] optimize benchmark: counting tokenlizer tokens and error requests
#1607
opened May 17, 2024 by
NiuBlibing
Loading…
fix: update api_server_backend.py to adapt latest gradio
improvement
#1541
opened May 3, 2024 by
kv-chiu
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.