Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Remove kv cache offline quantization
#2097 opened Jul 22, 2024 by AllentDan Loading…
New GEMM kernels for weight-only quantization WIP
#2090 opened Jul 19, 2024 by lzhangzz Loading…
Add nest_asyncio to requirements
#2085 opened Jul 19, 2024 by AllentDan Loading…
Add slora serving document documentation Improvements or additions to documentation
#2084 opened Jul 19, 2024 by AllentDan Loading…
raise thread exception Bug:P1
#2071 opened Jul 18, 2024 by irexyc Loading…
InternLM Summer Camp3
#2059 opened Jul 17, 2024 by boshallen Loading…
Support custom attention backend
#2046 opened Jul 16, 2024 by grimoire Draft
Reorganize the user guide documentation Improvements or additions to documentation WIP
#2038 opened Jul 16, 2024 by lvhan028 Loading…
1 of 3 tasks
Add log info for prefix cache
#2018 opened Jul 13, 2024 by ispobock Loading…
Phi3 awq
#1984 opened Jul 10, 2024 by grimoire Loading…
1 task
support min_p sampling & do_sample setting WIP
#1966 opened Jul 9, 2024 by irexyc Loading…
PyTorch Engine AWQ support enhancement New feature or request
#1913 opened Jul 3, 2024 by grimoire Loading…
Support guided decoding for pytorch backend enhancement New feature or request
#1856 opened Jun 26, 2024 by AllentDan Loading…
feat: decouple input_ids and output_ids
#1855 opened Jun 25, 2024 by zhyncs Loading…
Add Jetson platform support (by docker)
#1820 opened Jun 21, 2024 by BestAnHongjun Loading…
support vl benchmark
#1662 opened May 27, 2024 by AllentDan Loading…
Check base64 image validation Bug:P2
#1615 opened May 20, 2024 by AllentDan Loading…
support AI4Chem/ChemLLM-7B-Chat-1_5-SFT WIP
#1552 opened May 7, 2024 by lvhan028 Loading…
ProTip! Adding no:label will show everything without a label.