vllm backend faild #2028

chunniunai220ml · 2024-06-27T12:08:07Z

hi, i tried to eval :
export CUDA_VISIBLE_DEVICES="2,3" accelerate launch -m lm_eval --model vllm \ --model_args pretrained="THUDM/glm-4-9b",dtype=bfloat16 \ --tasks mmlu \ --device cuda \ --batch_size 2 \ --trust_remote_code \ --cache_requests true \ --num_fewshot 5 \
for accelerate, but failed in A100 (vllm=0.5.0.post1, vllm-flash-attn =2.5.9, torch2.3.0+cu121)：

The text was updated successfully, but these errors were encountered:

haileyschoelkopf · 2024-06-27T12:44:01Z

Hi, in order to help solve your issue we'd need more information. Namely:

can you provide the full traceback, not just this snippet from it?
can you provide the library version that you are running with?

chunniunai220ml · 2024-06-27T13:15:21Z

full traceback , it's too long, abstract key lines:
vllm/vllm/engine/llm_engine.py", line 230,
vllm/vllm/executor/executor_base.py", line 41
/vllm/vllm/executor/gpu_executor.py"
vllm/vllm/distributed/parallel_state.py", line 771,

library version:

vllm=0.5.0.post1, vllm-flash-attn =2.5.9, torch2.3.0+cu121) , lm-eval 0.4.2

haileyschoelkopf · 2024-06-27T14:48:21Z

I see the problem-- accelerate launch should not be used with VLLM. instead use just lm_eval --model vllm --model_args data_parallel_size=NUMGPUS

chunniunai220ml · 2024-06-28T08:42:09Z

thank for ur comment, another error, test:
export CUDA_VISIBLE_DEVICES="2,7" #accelerate launch -m lm_eval --model vllm \ --model_args pretrained="THUDM/glm-4-9b",dtype=bfloat16,data_parallel_size=2 \ --tasks mmlu \ --device cuda \ --batch_size 2 \ --trust_remote_code \ --cache_requests true \ --num_fewshot 5
/*/python3.10/site-packages/lm_eval/api/model.py", line 300, in _encode_pair
if self.AUTO_MODEL_CLASS == transformers.AutoModelForCausalLM:
AttributeError: 'VLLM' object has no attribute 'AUTO_MODEL_CLASS'

and when pip install lm_eval[vllm], and fix refer #1953
but, always OOM, no matter i use 4 card A100 or 2 card A100

haileyschoelkopf · 2024-07-01T21:03:39Z

Hi, could you try the following:

setting enforce_eager=True as described in OOM Issue #1923
setting gpu_memory_utilization=0.8 or lower
using lm_eval==0.4.3 from PyPI ; or the most recent commit from main

chunniunai220ml · 2024-07-02T11:49:58Z

pip install lm_eval[vllm] . error report:

lm-evaluation-harness/lm_eval/evaluator.py", line 15, in
from lm_eval.caching.cache import delete_cache
ModuleNotFoundError: No module named 'lm_eval.caching.cache'

but i can import in python client:

haileyschoelkopf · 2024-07-02T12:18:47Z

@chunniunai220ml are you by chance running your PyPI-installed lm_eval in the same folder as an older git clone of lm-evaluation-harness?

chunniunai220ml · 2024-07-02T12:25:02Z

at first , pip install lm_eval[vllm], error as reported. then git pull the latest code, pip install -e . , same error.
i tried to create init.py in lm_eval/cache, doesn't work untill now
@haileyschoelkopf

haileyschoelkopf closed this as completed Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vllm backend faild #2028

vllm backend faild #2028

chunniunai220ml commented Jun 27, 2024 •

edited

Loading

haileyschoelkopf commented Jun 27, 2024

chunniunai220ml commented Jun 27, 2024

haileyschoelkopf commented Jun 27, 2024

chunniunai220ml commented Jun 28, 2024 •

edited

Loading

haileyschoelkopf commented Jul 1, 2024

chunniunai220ml commented Jul 2, 2024

haileyschoelkopf commented Jul 2, 2024

chunniunai220ml commented Jul 2, 2024 •

edited

Loading

vllm backend faild #2028

vllm backend faild #2028

Comments

chunniunai220ml commented Jun 27, 2024 • edited Loading

haileyschoelkopf commented Jun 27, 2024

chunniunai220ml commented Jun 27, 2024

haileyschoelkopf commented Jun 27, 2024

chunniunai220ml commented Jun 28, 2024 • edited Loading

haileyschoelkopf commented Jul 1, 2024

chunniunai220ml commented Jul 2, 2024

haileyschoelkopf commented Jul 2, 2024

chunniunai220ml commented Jul 2, 2024 • edited Loading

chunniunai220ml commented Jun 27, 2024 •

edited

Loading

chunniunai220ml commented Jun 28, 2024 •

edited

Loading

chunniunai220ml commented Jul 2, 2024 •

edited

Loading