fix local variable 'response' referenced before assignment in async_engine.generate #1513

irexyc · 2024-04-28T05:30:50Z

Motivation

When input is long or the chat template doesn't match, the first output token id with turbomind backend may be eos_id or in stop_words. In this case, the async_engine.generate will meet the following error.

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/chenxin/ws3/vl/lmdeploy/serve/async_engine.py", line 325, in __call__
    return self.batch_infer(prompts,
  File "/home/chenxin/ws3/vl/lmdeploy/serve/async_engine.py", line 441, in batch_infer
    _get_event_loop().run_until_complete(gather())
  File "/home/chenxin/miniconda3/envs/38/lib/python3.8/asyncio/base_events.py", line 616, in run_until_complete
    return future.result()
  File "/home/chenxin/ws3/vl/lmdeploy/serve/async_engine.py", line 436, in gather
    await asyncio.gather(*[
  File "/home/chenxin/ws3/vl/lmdeploy/serve/async_engine.py", line 423, in _inner_call
    async for out in generator:
  File "/home/chenxin/ws3/vl/lmdeploy/serve/async_engine.py", line 648, in generate
    if not response.endswith('�'):
UnboundLocalError: local variable 'response' referenced before assignment

reproduce

from lmdeploy import pipeline
pipe = pipeline('/mnt/140/Qwen/Qwen1.5-7B-Chat')
pipe('hello ' * 7500)

zhulinJulia24 · 2024-04-28T05:59:25Z

Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
[WARNING] gemm_config.in is not found; using default GEMM algo
Response(text='', generate_token_len=1, input_token_len=7520, session_id=0, finish_reason='stop', token_ids=[], logprobs=None)

fixed

zhulinJulia24

lgtm

fix response

3005275

irexyc requested review from lvhan028 and zhulinJulia24 April 28, 2024 05:31

irexyc mentioned this pull request Apr 28, 2024

[Bug] sometimes Internal Server Error #1505

Closed

2 tasks

lvhan028 approved these changes Apr 28, 2024

View reviewed changes

zhulinJulia24 approved these changes Apr 28, 2024

View reviewed changes

lvhan028 added the Bug:P1 label Apr 28, 2024

lvhan028 merged commit d72432e into InternLM:main Apr 28, 2024
5 checks passed

lvhan028 mentioned this pull request Apr 30, 2024

[Bug] UnboundLocalError: local variable 'response' referenced before assignment #1535

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix local variable 'response' referenced before assignment in async_engine.generate #1513

fix local variable 'response' referenced before assignment in async_engine.generate #1513

irexyc commented Apr 28, 2024

zhulinJulia24 commented Apr 28, 2024

zhulinJulia24 left a comment

fix local variable 'response' referenced before assignment in async_engine.generate #1513

fix local variable 'response' referenced before assignment in async_engine.generate #1513

Conversation

irexyc commented Apr 28, 2024

Motivation

zhulinJulia24 commented Apr 28, 2024

zhulinJulia24 left a comment

Choose a reason for hiding this comment