InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 288
Star 3.2k

Code
Issues 196
Pull requests 27
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 6

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#890 opened Dec 26, 2023 by vansin

Open

Labels 32 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

196 Open 832 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Feature] tubromind有计划支持cogvlm2吗？

#1970 opened Jul 9, 2024 by jidechao

[Bug] InternVL2-40B generates nonsense outputs

#1965 opened Jul 9, 2024 by pseudotensor

2 tasks done

[Feature] Support for CogVLM2-Video-LLama3-Chat in TorchEngine

#1964 opened Jul 9, 2024 by ericzhou571

[Bug] KeyError: 'Qwen2ForCausalLM' for InternVL2 1B

#1963 opened Jul 9, 2024 by zhyncs

2 tasks done

internvl 多图文使用openai 接口形式如何传数据[Bug]

#1961 opened Jul 9, 2024 by hitzhu

2 tasks done

为什么离线转换lmdeploy convert不支持internlm2.5和Qwen2

#1960 opened Jul 9, 2024 by wwwyfff

Internvl2 api 使用没法正常返回结果，用transforms的推理方式可以

#1959 opened Jul 9, 2024 by day9011

1 of 2 tasks

[Bug] 使用lmdeploy serve开启internvl-v1-5后一定输出到最长长度

#1958 opened Jul 9, 2024 by sunzx8

2 tasks done

[Feature] Can you please do INT4 Quantization for InternVL2-26B and InternVL2-40B

#1955 opened Jul 8, 2024 by tairen99

[Bug] Turbomind Docker getting failed after High load

#1954 opened Jul 8, 2024 by Tushar-ml

2 tasks done

[Bug] response里需要生成's 但是只显示' s都不输出。

#1951 opened Jul 8, 2024 by jiangjingz

2 tasks

AttributeError: 'AsyncEngine' object has no attribute 'get_ppl'

#1950 opened Jul 8, 2024 by poisonwine

多模态批推理如何实现？

#1949 opened Jul 8, 2024 by DankoZhang

[Bug] 为什么logprobs的内容是None？Why the value of logprobs is None?

#1948 opened Jul 8, 2024 by airaria

2 tasks done

[Bug] Llama3 Chat Template are not consistency with the Huggingface implementation.

#1945 opened Jul 8, 2024 by efsotr

2 tasks done

[Bug] v0.5.0 crashes with CUDA OOM error while v0.4.2 does not (in exactly the same scenario - 30 concurrent requests to LLama2 70B)

#1943 opened Jul 7, 2024 by josephrocca

2 tasks done

[Feature] Prefix cache hit/miss/eviction statistics to detect cache thrashing

#1942 opened Jul 7, 2024 by josephrocca

[Bug] same code A800 good but A10 stuck MiniCPM-Llama3-V-2_5

#1938 opened Jul 6, 2024 by llmrainer

2 tasks done

[Bug] unified_attention split kv for prefill with more workspace coredump

#1935 opened Jul 6, 2024 by snippetzero

2 tasks done

logits的获取

#1933 opened Jul 5, 2024 by AIFFFENG

2 tasks done

[Feature] 可以支持embedding模型吗，类似于xinference的功能

#1927 opened Jul 5, 2024 by jxfruit

[Bug] Encount TCP error (Port Aready used) when deploy with PytorchEngine awaiting response

#1925 opened Jul 5, 2024 by Desein-Yang

2 tasks done

[Bug] lmdeploy awq量化后不能多卡部署

#1923 opened Jul 4, 2024 by qiuxuezhe123

2 tasks

[Feature] Is there any plan to support for InternLM-XComposer2.5 inference?

#1920 opened Jul 4, 2024 by Charles-Xie

关于internv2的支持 awaiting response

#1919 opened Jul 4, 2024 by White-Friday

Previous 1 2 3 4 5 6 7 8 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly