ENH: make deepseek_vl support streaming output #1444

Minamiyama · 2024-05-08T02:27:23Z

QQ202458-102123.mp4

sunhaha123 · 2024-05-09T06:35:57Z

Looks great! Could this model run on 24gb gpu?

sunhaha123 · 2024-05-09T07:53:54Z

Looks great! Could this model run on 24gb gpu?

seems fp16 works.
using:vl_gpt: MultiModalityCausalLM = AutoModelForCausalLM.from_pretrained( # type: ignore self.model_path, trust_remote_code=True, device_map=self._device, low_cpu_mem_usage=True, torch_dtype=torch.float16 )

Minamiyama · 2024-05-09T08:22:15Z

Looks great! Could this model run on 24gb gpu?

yes, I run it on gtx4090

sunhaha123 · 2024-05-10T04:11:10Z

  File "/home/echo/miniconda3/envs/xinfer/lib/python3.10/site-packages/xinference/model/llm/pytorch/deepseek_vl.py", line 190, in chat
    from ....thirdparty.deepseek_vl.serve.inference import generate
ModuleNotFoundError: [address=0.0.0.0:42331, pid=2580143] No module named 'xinference.thirdparty.deepseek_vl.serve'

I pip install after git pull from main branch. Occur this problem.

Minamiyama · 2024-05-10T05:42:43Z

  File "/home/echo/miniconda3/envs/xinfer/lib/python3.10/site-packages/xinference/model/llm/pytorch/deepseek_vl.py", line 190, in chat
    from ....thirdparty.deepseek_vl.serve.inference import generate
ModuleNotFoundError: [address=0.0.0.0:42331, pid=2580143] No module named 'xinference.thirdparty.deepseek_vl.serve'

I pip install after git pull from main branch. Occur this problem.

the module codes is in source codes:

maybe you should build from source code, or put the codes in xinference/thirdparty/deepseek_vl to your package path (I solved it by this way)

sunhaha123 · 2024-05-10T06:25:24Z

maybe you should build from source code, or put the codes in xinference/thirdparty/deepseek_vl to your package path (I solved it by this way)

It works. Still feeling strange, I do pip install from source code.

qinxuye · 2024-05-10T06:27:21Z

Did you update the main code?

sunhaha123 · 2024-05-10T07:33:39Z

Did you update the main code?

git pull origin main, and pip install .[transformers]

dimitribellini · 2024-05-24T12:12:37Z

Dear Team,
I'm new to "xinference" and I do not understand so well on how to use your inference server and the Deepseek VL model.
Using the docker option I was able to run it and load the module using the engine option "transformer" but as soon a connect using OpenWebUI through the OpenAI API I received the message reported from @Minamiyama
How I can solve it? Your help it's very appreciate :-)
Thanks so much

ENH: make deepseek_vl support streaming output

2820122

XprobeBot added the enhancement New feature or request label May 8, 2024

XprobeBot added this to the v0.11.0 milestone May 8, 2024

Minamiyama added 7 commits May 8, 2024 10:43

correct format

fd0eb1c

black reformat

32a64d6

isort reformat

54a88c2

fix missing key

0d8405e

fix missing key

eb55b54

black reformat

13134d8

Merge branch 'xorbitsai:main' into ENH/deepseek_vl_support_stream

12bdfc7

codingl2k1 approved these changes May 9, 2024

View reviewed changes

qinxuye merged commit 0cb0f0e into xorbitsai:main May 10, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: make deepseek_vl support streaming output #1444

ENH: make deepseek_vl support streaming output #1444

Minamiyama commented May 8, 2024

sunhaha123 commented May 9, 2024

sunhaha123 commented May 9, 2024 •

edited

Minamiyama commented May 9, 2024

sunhaha123 commented May 10, 2024 •

edited

Minamiyama commented May 10, 2024

sunhaha123 commented May 10, 2024

qinxuye commented May 10, 2024

sunhaha123 commented May 10, 2024

dimitribellini commented May 24, 2024

ENH: make deepseek_vl support streaming output #1444

ENH: make deepseek_vl support streaming output #1444

Conversation

Minamiyama commented May 8, 2024

sunhaha123 commented May 9, 2024

sunhaha123 commented May 9, 2024 • edited

Minamiyama commented May 9, 2024

sunhaha123 commented May 10, 2024 • edited

Minamiyama commented May 10, 2024

sunhaha123 commented May 10, 2024

qinxuye commented May 10, 2024

sunhaha123 commented May 10, 2024

dimitribellini commented May 24, 2024

sunhaha123 commented May 9, 2024 •

edited

sunhaha123 commented May 10, 2024 •

edited