Skip to content

[Bug] vllm deploy InternVL3_5-241B-A28B error #1175

@liuxuexun

Description

@liuxuexun

Checklist

  • 1. I have searched related issues but cannot get the expected help.
  • 2. The bug has not been fixed in the latest version.
  • 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.

Describe the bug

(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654] Traceback (most recent call last):
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/v1/executor/multiproc_executor.py", line 649, in worker_busy_loop
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     output = func(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return func(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/v1/worker/gpu_worker.py", line 263, in determine_available_memory
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     self.model_runner.profile_run()
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/v1/worker/gpu_model_runner.py", line 3017, in profile_run
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     self.model.get_multimodal_embeddings(
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/internvl.py", line 1331, in get_multimodal_embeddings
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     video_embeddings = self._process_image_input(video_input)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/internvl.py", line 1264, in _process_image_input
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     image_embeds = self.extract_feature(image_input["pixel_values_flat"])
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/internvl.py", line 1154, in extract_feature
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     vit_embeds = self.vision_model(pixel_values=pixel_values)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return self._call_impl(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return forward_call(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/intern_vit.py", line 467, in forward
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     encoder_outputs = self.encoder(inputs_embeds=hidden_states)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return self._call_impl(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return forward_call(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/intern_vit.py", line 413, in forward
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     hidden_states = encoder_layer(hidden_states)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return self._call_impl(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return forward_call(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/intern_vit.py", line 372, in forward
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     hidden_states = hidden_states + self.attn(
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return self._call_impl(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return forward_call(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/model_executor/models/intern_vit.py", line 277, in forward
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     x = self.attn(q, k, v)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return self._call_impl(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     return forward_call(*args, **kwargs)
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]   File "/root/miniconda3/envs/vllm/lib/python3.10/site-packages/vllm/attention/layer.py", line 383, in forward
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654]     bsz, q_len, _ = query.size()
(Worker_TP0 pid=173356) ERROR 09-15 15:24:40 [multiproc_executor.py:654] ValueError: too many values to unpack (expected 3)

I use vllm==0.10.2, think you !

Reproduction

vllm serve /root/models/InternVL3_5-241B-A28B/ --port 8084 --host 0.0.0.0 --dtype bfloat16 --max-model-len 16384 --tensor-parallel-size 8 --enforce-eager --allowed-local-media-path / --trust_remote_code

Environment

python=3.10 torch=2.8.0 vllm=0.10.2

Error traceback

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions