Support qwen for pytorch engine #1265

RunningLeon · 2024-03-08T11:57:47Z

Motivation

Support Qwen for pytorch engine

opencompass results:

Modification

Please briefly describe what modification is made in this PR.

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

jjjjohnson · 2024-03-12T04:21:37Z

Looks like not support logn_attn yet

RunningLeon · 2024-03-12T07:58:47Z

@jjjjohnson hi, thanks for the reminder. Will update later.

lmdeploy/pytorch/supported_models.py

lmdeploy/pytorch/models/qwen.py

jjjjohnson · 2024-03-25T12:53:34Z

@RunningLeon Look like there is en error in Qwen pytorch backend...

from lmdeploy.messages import PytorchEngineConfig, EngineGenerationConfig
from lmdeploy.pytorch.engine import Engine
from lmdeploy.serve.async_engine import AsyncEngine
import logging
logger = logging.getLogger(__name__)
logger.setLevel(logging.DEBUG)



model_name='qwen-14b'
engine_config = PytorchEngineConfig(model_name=model_name,
                                    tp=1,max_batch_size=8,
                                    adapters=None,num_gpu_blocks=200, num_cpu_blocks=100)


engin= AsyncEngine(model_path=model_path, backend='pytorch', model_name=model_name, backend_config=engine_config)

prompts = 'Hello'
engin(prompts)

RunningLeon · 2024-03-26T02:54:54Z

@jjjjohnson Hi, thanks for your feedback. I'll try to reproduce the issue. Coming back to you soon.

RunningLeon · 2024-03-27T03:01:13Z

@jjjjohnson hi, lmdeploy had supported qwen with old version. It seems you are running new version of Qwen and some rewriting fails in this case. Could try PR #1351? thanks for your understanding.

jjjjohnson · 2024-03-27T06:47:14Z

Cool! THX!

RunningLeon added 6 commits March 7, 2024 11:55

add qwen

ff2979e

update

1fa01f8

support qwen

971347b

remove unused args

236f8a4

fix

81b1549

update docs

aecbe06

RunningLeon force-pushed the support-qwen branch from 220e835 to aecbe06 Compare March 8, 2024 11:58

RunningLeon added 5 commits March 8, 2024 20:00

fix

7d56c26

set torch_dtype for qwen

e75398f

Merge remote-tracking branch 'upstream/main' into support-qwen

c985630

refact

9f56ba3

fix

d61995f

RunningLeon marked this pull request as ready for review March 12, 2024 03:14

RunningLeon added 2 commits March 12, 2024 11:25

fix batch infer

4995a22

remove unused arg

5c7e080

RunningLeon changed the title ~~[WIP]: Support qwen for pytorch engine~~ Support qwen for pytorch engine Mar 12, 2024

add stop words

3cf92b1

RunningLeon added 3 commits March 13, 2024 16:19

support use_logn_attn

a4ad656

Merge remote-tracking branch 'upstream/main' into support-qwen

2902aa5

Merge remote-tracking branch 'upstream/main' into support-qwen

fb4ecd9

lvhan028 added the enhancement New feature or request label Mar 14, 2024

lvhan028 requested review from grimoire and AllentDan March 14, 2024 03:14

grimoire reviewed Mar 14, 2024

View reviewed changes

lmdeploy/pytorch/supported_models.py Show resolved Hide resolved

grimoire reviewed Mar 14, 2024

View reviewed changes

lmdeploy/pytorch/models/qwen.py Outdated Show resolved Hide resolved

remove unnecessary rewriting

4e27fe7

grimoire approved these changes Mar 21, 2024

View reviewed changes

AllentDan approved these changes Mar 21, 2024

View reviewed changes

lvhan028 merged commit 2dfceef into InternLM:main Mar 21, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support qwen for pytorch engine #1265

Support qwen for pytorch engine #1265

RunningLeon commented Mar 8, 2024 •

edited

Loading

jjjjohnson commented Mar 12, 2024

RunningLeon commented Mar 12, 2024

jjjjohnson commented Mar 25, 2024

RunningLeon commented Mar 26, 2024

RunningLeon commented Mar 27, 2024 •

edited

Loading

jjjjohnson commented Mar 27, 2024

Support qwen for pytorch engine #1265

Support qwen for pytorch engine #1265

Conversation

RunningLeon commented Mar 8, 2024 • edited Loading

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

jjjjohnson commented Mar 12, 2024

RunningLeon commented Mar 12, 2024

jjjjohnson commented Mar 25, 2024

RunningLeon commented Mar 26, 2024

RunningLeon commented Mar 27, 2024 • edited Loading

jjjjohnson commented Mar 27, 2024

RunningLeon commented Mar 8, 2024 •

edited

Loading

RunningLeon commented Mar 27, 2024 •

edited

Loading