Add deepseek vl #1335

AllentDan · 2024-03-25T06:48:31Z

#1321

Conflicts: lmdeploy/serve/vl_async_engine.py

lmdeploy/model.py

lmdeploy/vl/model/deepseek.py

lmdeploy/turbomind/supported_models.py

irexyc · 2024-03-28T11:53:19Z

@zhoujh1113

麻烦修改下面这两个地方，看下还会不会挂掉或者卡住。cache_max_entry_count 可以设小一点。

backend_config=TurbomindEngineConfig(tp=2, session_len=8192, cache_max_entry_count=0.5)

lmdeploy/lmdeploy/vl/model/deepseek.py

Line 24 in c9b61e3

def __init__(self, model_path, device='cuda'):

这个地方改成cuda:0

https://github.com/InternLM/lmdeploy/blob/c9b61e354b473de5e3d7c319aa3f053ef9bd54f3/lmdeploy/vl/engine.py#L101C2-L108C23
这个地方改成

with torch.device('cuda:0'):
    time_start = time.perf_counter()
    outputs = self.model.forward(inputs)
    time_end = time.perf_counter()
    logger.info(f'ImageEncoder forward {len(inputs)} images, '
                f'cost {time_end - time_start:.3f}s')

lmdeploy/turbomind/supported_models.py

lmdeploy/pytorch/supported_models.py

irexyc · 2024-04-02T06:12:21Z

lmdeploy/vl/model/deepseek.py

+        with torch.device('cpu'):
+            model = AutoModelForCausalLM.from_pretrained(
+                self.model_path, trust_remote_code=True)


may use init_empty_weights to accelerate loading

tried. But seemed the output of the model would be wrong.

Accelerating loading model is very important. Please investigate

lmdeploy/pytorch/supported_models.py

lvhan028 · 2024-04-02T07:06:33Z

ValueError: Could not find the operator torchvision::nms. Please make sure you have already registered the operator and (if registered from C++) loaded it via torch.ops.load_library.

AllentDan · 2024-04-02T07:09:52Z

what's the version of torch and torchvision are you using?

torch 2.1.2+cu118
torchvision 0.16.2

AllentDan added 6 commits March 22, 2024 19:04

DeepSeekVL WIP

d36e586

deepseek conversion and running

fe1324a

Merge branch 'main' into add-deepseek-vl

2f069ce

Conflicts: lmdeploy/serve/vl_async_engine.py

recover llama back

441369e

doc and docstring

42deb81

fix

717a4ca

lvhan028 requested a review from irexyc March 25, 2024 07:33

lvhan028 added the enhancement New feature or request label Mar 25, 2024

better match

5d665c0

lvhan028 reviewed Mar 25, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

lvhan028 reviewed Mar 25, 2024

View reviewed changes

lmdeploy/vl/model/deepseek.py Outdated Show resolved Hide resolved

lvhan028 reviewed Mar 25, 2024

View reviewed changes

lmdeploy/turbomind/supported_models.py Show resolved Hide resolved

refine

c9b61e3

AllentDan mentioned this pull request Mar 27, 2024

LMDeploy support accelerating DeepSeek VL models now!!! 🚀 deepseek-ai/DeepSeek-VL#39

Open

lvhan028 reviewed Apr 2, 2024

View reviewed changes

lmdeploy/turbomind/supported_models.py Outdated Show resolved Hide resolved

AllentDan added 2 commits April 2, 2024 13:03

Merge branch 'main' into add-deepseek-vl

9791020

resolve comments

db7c8f4

lvhan028 reviewed Apr 2, 2024

View reviewed changes

lmdeploy/pytorch/supported_models.py Show resolved Hide resolved

add logger.warning

4c73eb2

irexyc reviewed Apr 2, 2024

View reviewed changes

mv logger.warning

f5658d0

lvhan028 reviewed Apr 2, 2024

View reviewed changes

lmdeploy/pytorch/supported_models.py Outdated Show resolved Hide resolved

logging

34f5898

empty init

566002e

lvhan028 approved these changes Apr 2, 2024

View reviewed changes

irexyc approved these changes Apr 2, 2024

View reviewed changes

lvhan028 merged commit 9b8ebc1 into InternLM:main Apr 2, 2024
3 of 5 checks passed

AllentDan mentioned this pull request Apr 2, 2024

[Fix] fix the unit test of model name deduce #1382

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add deepseek vl #1335

Add deepseek vl #1335

AllentDan commented Mar 25, 2024

irexyc commented Mar 28, 2024

irexyc Apr 2, 2024

AllentDan Apr 2, 2024

lvhan028 Apr 2, 2024

lvhan028 commented Apr 2, 2024

AllentDan commented Apr 2, 2024

Add deepseek vl #1335

Add deepseek vl #1335

Conversation

AllentDan commented Mar 25, 2024

irexyc commented Mar 28, 2024

irexyc Apr 2, 2024

Choose a reason for hiding this comment

AllentDan Apr 2, 2024

Choose a reason for hiding this comment

lvhan028 Apr 2, 2024

Choose a reason for hiding this comment

lvhan028 commented Apr 2, 2024

AllentDan commented Apr 2, 2024