Update rewritings for qwen #1351

RunningLeon · 2024-03-27T02:58:10Z

Motivation

update rewritings for the latest modeling_qwen.py as in huggingface

Modification

Please briefly describe what modification is made in this PR.

BC-breaking (Optional)

None

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

lvhan028 · 2024-03-27T03:01:38Z

lmdeploy/pytorch/models/qwen.py

-            layer_outputs = decoder_layer(
+        context = self.context.context
+        max_kv_seq_length = context.max_kv_seq_length
+        # do not support use_dynamic_ntk


for these reasons, we could postpone the support of the feature for now and see if users really need it:

dynamic_ntk only works on prefilling phase where input_len is greater than seq_lenth in config.json, which is 8192 for qwen7b

dynamic_ntk should assign different ntk_alphas for different sequences in a batch, which is complicated to implement in pytorch engine

Qwen2 has remove dynamic_ntk, which means it is not so important.

Evaluations results with opencompass suggests that it won't change the results too much:

But seq_lenth for Qwen-14B is 2048
https://huggingface.co/Qwen/Qwen-14B/blob/main/config.json

But seq_lenth for Qwen-14B is 2048 https://huggingface.co/Qwen/Qwen-14B/blob/main/config.json

Yes. you are right. How do you use lmdeploy with Qwen-14B on pytorch engine? is dynamic ntk useful for you ? Why not use Qwen2?

RunningLeon added 2 commits March 26, 2024 17:08

update

134e52d

update qwen

685916b

RunningLeon mentioned this pull request Mar 27, 2024

Support qwen for pytorch engine #1265

Merged

lvhan028 reviewed Mar 27, 2024

View reviewed changes

lvhan028 requested a review from grimoire March 28, 2024 04:22

lvhan028 added the improvement label Mar 28, 2024

lvhan028 approved these changes Mar 28, 2024

View reviewed changes

grimoire approved these changes Mar 28, 2024

View reviewed changes

lvhan028 merged commit 69207f0 into InternLM:main Mar 28, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update rewritings for qwen #1351

Update rewritings for qwen #1351

RunningLeon commented Mar 27, 2024

lvhan028 Mar 27, 2024

RunningLeon Mar 27, 2024 •

edited

jjjjohnson Mar 27, 2024

RunningLeon Mar 27, 2024

Update rewritings for qwen #1351

Update rewritings for qwen #1351

Conversation

RunningLeon commented Mar 27, 2024

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

lvhan028 Mar 27, 2024

Choose a reason for hiding this comment

RunningLeon Mar 27, 2024 • edited

Choose a reason for hiding this comment

jjjjohnson Mar 27, 2024

Choose a reason for hiding this comment

RunningLeon Mar 27, 2024

Choose a reason for hiding this comment

RunningLeon Mar 27, 2024 •

edited