support Internvl chat v1.1, v1.2 and v1.2-plus #1425

irexyc · 2024-04-11T12:45:11Z

Motivation

support

Modification

add chat template (internvl_zh)
add source model (InternVLModel)
add vision model (InternVLVisionModel)
add InternVLChatTemplateWrapper

Usage:

The vision model is big, so we need to reduce kv-cache in order to load the vision model on single gpu.

With 4 x A100, the memory usage is about 47G x 1 and 35G x 3 when using the following setting. (InternVL-Chat-Chinese-V1-2-Plus)

from lmdeploy import pipeline, TurbomindEngineConfig
from lmdeploy.vl import load_image

pipe = pipeline('/home/chenxin/InternVL-Chat-Chinese-V1-2-Plus/', log_level='INFO',
    backend_config=TurbomindEngineConfig(cache_max_entry_count=0.3, tp=4))

im = load_image('tiger.jpeg')
out = pipe(('描述这个图片', im))

TODO

The vision model is big compraed with qwen / llava which use about 12G gpu memory. I will make another PR to improve it.

lmdeploy/model.py

lvhan028 · 2024-04-15T07:43:19Z

supported_models.md，以及 readme中的模型支持列表，请更新

lmdeploy/model.py

lvhan028 · 2024-04-15T12:06:51Z

test with OpenGVLab/InternVL-Chat-Chinese-V1-2-Plus

pipeline (tp4)
gradio (tp4)
api_server (tp4)

AllentDan

test with OpenGVLab/InternVL-Chat-Chinese-V1-1

pipeline (tp4 && tp8)
api_server (tp4)

irexyc added 2 commits April 11, 2024 12:34

support internvl-chat

38f01a9

fix vocab_size_padded_ tp

8443c0d

irexyc mentioned this pull request Apr 11, 2024

[Feature] support InternVL-Chat-Chinese-V1-2-Plus #1424

Closed

lvhan028 requested review from AllentDan and lvhan028 April 12, 2024 03:44

lvhan028 added the enhancement New feature or request label Apr 12, 2024

lvhan028 reviewed Apr 15, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

lvhan028 reviewed Apr 15, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

lvhan028 reviewed Apr 15, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

lvhan028 reviewed Apr 15, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

irexyc added 2 commits April 15, 2024 12:31

resolve comments

fa9da7f

update README

76b24b7

lvhan028 approved these changes Apr 15, 2024

View reviewed changes

AllentDan approved these changes Apr 16, 2024

View reviewed changes

lvhan028 changed the title ~~support Internvl chat~~ support Internvl chat v1.1, v1.2 and v1.2-plus Apr 16, 2024

lvhan028 merged commit 3b5795a into InternLM:main Apr 16, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support Internvl chat v1.1, v1.2 and v1.2-plus #1425

support Internvl chat v1.1, v1.2 and v1.2-plus #1425

irexyc commented Apr 11, 2024

lvhan028 commented Apr 15, 2024

lvhan028 commented Apr 15, 2024 •

edited

Loading

AllentDan left a comment

support Internvl chat v1.1, v1.2 and v1.2-plus #1425

support Internvl chat v1.1, v1.2 and v1.2-plus #1425

Conversation

irexyc commented Apr 11, 2024

Motivation

Modification

Usage:

TODO

lvhan028 commented Apr 15, 2024

lvhan028 commented Apr 15, 2024 • edited Loading

AllentDan left a comment

Choose a reason for hiding this comment

lvhan028 commented Apr 15, 2024 •

edited

Loading