support OpenGVLab/InternVL-Chat-V1-5 #1490

irexyc · 2024-04-24T12:15:26Z

Motivation

support https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5

lmdeploy/vl/model/internvl.py

AllentDan · 2024-04-26T04:15:49Z

lmdeploy/vl/model/internvl.py

+            MEAN = (123.675, 116.28, 103.53)
+            STD = (58.395, 57.12, 57.375)


The two constants, can it be inferred from the Internvl code?

Let's try not to infer any thing from the upstream's repo code. We'd better keep them as independent as possible

the mean and std are not in the repo but in the example code.

LRHstudy · 2024-04-26T07:02:21Z

运行这个代码在输入4通道图像时会报错：
File "/opt/py38/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/opt/lmdeploy/lmdeploy/vl/model/internvl.py", line 151, in forward
return self._forward_func(images)
File "/opt/lmdeploy/lmdeploy/vl/model/internvl.py", line 132, in _forward_v1_5
outputs = self.transform(outputs)
File "/opt/py38/lib/python3.8/site-packages/torchvision/transforms/transforms.py", line 95, in call
img = t(img)
File "/opt/py38/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/opt/py38/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1527, in call_impl
return forward_call(*args, **kwargs)
File "/opt/py38/lib/python3.8/site-packages/torchvision/transforms/transforms.py", line 277, in forward
return F.normalize(tensor, self.mean, self.std, self.inplace)
File "/opt/py38/lib/python3.8/site-packages/torchvision/transforms/functional.py", line 363, in normalize
return F_t.normalize(tensor, mean=mean, std=std, inplace=inplace)
File "/opt/py38/lib/python3.8/site-packages/torchvision/transforms/functional_tensor.py", line 928, in normalize
return tensor.sub(mean).div(std)
File "/opt/py38/lib/python3.8/site-packages/torch/utils/_device.py", line 77, in torch_function
return func(*args, **kwargs)
RuntimeError: The size of tensor a (4) must match the size of tensor b (3) at non-singleton dimension 1

缺少代码：
image = image.convert('RGB') if image.mode != 'RGB' else image

irexyc · 2024-04-26T07:10:28Z

@LRHstudy 修好了

lmdeploy/model.py

lvhan028 · 2024-04-28T11:14:10Z

vl pipeline (tp 1, 2)
api server

AllentDan

LGTM

lijing1996 · 2024-04-30T05:22:09Z

为什么我用tp的话H800每张卡的显存还是吃满了 batch只能跟单张卡还是一样呢

irexyc · 2024-04-30T06:26:18Z

@lijing1996

多TP的话，是 Tensor Parallel, 每张卡都会算一部分，不管你的batch 是多少。

要控制显存的话，这里提到一些降低显存的方法：
#1173 (comment)

LLM 模型在 tp > 1的时候，每个显卡上的显存是一致。目前vision 模型是挂在0号卡上的，会导致其他卡显存的利用率偏低。这个问题目前正在处理，后面也会让 vision 模型均匀分摊到每个卡上。

lijing1996 · 2024-04-30T06:38:42Z

@lijing1996

多TP的话，是 Tensor Parallel, 每张卡都会算一部分，不管你的batch 是多少。

要控制显存的话，这里提到一些降低显存的方法： #1173 (comment)

LLM 模型在 tp > 1的时候，每个显卡上的显存是一致。目前vision 模型是挂在0号卡上的，会导致其他卡显存的利用率偏低。这个问题目前正在处理，后面也会让 vision 模型均匀分摊到每个卡上。

是这样的 tp=8 和 tp=1的情况下，batch相同的情况下，0卡的显存是一样的，同时1~7也占用了大量的显存，是比0卡小。是哪里没有设置对吗？

irexyc · 2024-04-30T06:43:47Z

你这个没有问题，符合逻辑。

目前显存分配的逻辑是：
现在0号卡上构建vision模型 - 在剩余卡上加载LLM模型权重，计算所有卡上剩余最小的显存 - 根据百分比申请kv cache的显存。

tp=8或者1，不影响0号卡剩余的显存大小，所有两种情况显存一样。但是tp=8的时候，因为vision模型目前只在0号卡上，所以1-7上显存会小一些。

irexyc added 2 commits April 24, 2024 11:55

support OpenGVLab/InternVL-Chat-V1-5

8f20cfd

update README

8079a5e

lvhan028 added the enhancement New feature or request label Apr 25, 2024

lvhan028 mentioned this pull request Apr 26, 2024

[Feature] InternVL-Chat-V1.5 Support #1501

Closed

lvhan028 requested a review from AllentDan April 26, 2024 03:06

AllentDan reviewed Apr 26, 2024

View reviewed changes

convert image to RGB

9abf56b

irexyc added 3 commits April 26, 2024 07:18

add code source link

b13ce65

fix internvl-chat-v1-2

5da979c

update model

177e8be

lvhan028 reviewed Apr 28, 2024

View reviewed changes

lmdeploy/model.py Show resolved Hide resolved

irexyc mentioned this pull request Apr 28, 2024

请问有推荐的部署框架吗? OpenGVLab/InternVL#102

Open

lvhan028 approved these changes Apr 28, 2024

View reviewed changes

irexyc mentioned this pull request Apr 28, 2024

[Feature] Support for OpenGVLab/InternVL-Chat-V1-2-Plus #1495

Closed

AllentDan approved these changes Apr 29, 2024

View reviewed changes

lvhan028 merged commit b22366b into InternLM:main Apr 29, 2024
5 checks passed

utkarsh995 mentioned this pull request Apr 30, 2024

How to deploy InternVL? OpenGVLab/InternVL#122

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support OpenGVLab/InternVL-Chat-V1-5 #1490

support OpenGVLab/InternVL-Chat-V1-5 #1490

irexyc commented Apr 24, 2024

AllentDan Apr 26, 2024

lvhan028 Apr 28, 2024

irexyc Apr 28, 2024 •

edited

LRHstudy commented Apr 26, 2024 •

edited

irexyc commented Apr 26, 2024

lvhan028 commented Apr 28, 2024 •

edited

AllentDan left a comment

lijing1996 commented Apr 30, 2024

irexyc commented Apr 30, 2024 •

edited

lijing1996 commented Apr 30, 2024

irexyc commented Apr 30, 2024 •

edited

		MEAN = (123.675, 116.28, 103.53)
		STD = (58.395, 57.12, 57.375)

support OpenGVLab/InternVL-Chat-V1-5 #1490

support OpenGVLab/InternVL-Chat-V1-5 #1490

Conversation

irexyc commented Apr 24, 2024

Motivation

AllentDan Apr 26, 2024

Choose a reason for hiding this comment

lvhan028 Apr 28, 2024

Choose a reason for hiding this comment

irexyc Apr 28, 2024 • edited

Choose a reason for hiding this comment

LRHstudy commented Apr 26, 2024 • edited

irexyc commented Apr 26, 2024

lvhan028 commented Apr 28, 2024 • edited

AllentDan left a comment

Choose a reason for hiding this comment

lijing1996 commented Apr 30, 2024

irexyc commented Apr 30, 2024 • edited

lijing1996 commented Apr 30, 2024

irexyc commented Apr 30, 2024 • edited

irexyc Apr 28, 2024 •

edited

LRHstudy commented Apr 26, 2024 •

edited

lvhan028 commented Apr 28, 2024 •

edited

irexyc commented Apr 30, 2024 •

edited

irexyc commented Apr 30, 2024 •

edited