Refactor chat template and support accurate name matching. #1216

AllentDan · 2024-02-29T07:39:02Z

Refactor chat template. Removed from_config, decorate_prompt, _translate_messages, and sampling_param from the chat template.
accurate name matching

AllentDan · 2024-03-01T03:23:06Z

Need update after #1168 merged

lvhan028 · 2024-03-01T04:19:04Z

#1168 has been merged

Conflicts: lmdeploy/model.py lmdeploy/turbomind/utils.py

requirements/runtime.txt

lmdeploy/model.py

irexyc · 2024-03-04T08:43:29Z

lmdeploy/model.py

+            eoh='\n',
+            assistant='ASSISTANT: ',
+            eoa='</s>',
+            separator='\n',


vicuna 的模版里面有 \n 么？

vicuna decorate之后跟之前的不一样，可以看一下

是有点奇怪，FastChat 文档里面的模板和代码效果略有差异。文档里效果是这样的：

A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: Hello! ASSISTANT: Hello!</s> USER: How are you? ASSISTANT: I am good.</s>

然后代码效果这样的：

A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: Hello! ASSISTANT: Hi!</s>USER: How are you? ASSISTANT:

lmdeploy/model.py

examples/vl/qwen_model.py

irexyc · 2024-03-04T09:25:05Z

lmdeploy/model.py

+        box_map = dict(user=self.user,
+                       assistant=self.assistant,
+                       system=self.system)
+        eox_map = dict(user=self.eoh,
+                       assistant=self.eoa + self.separator,
+                       system=self.eosys)
+        ret = ''
+        for message in messages:
+            role = message['role']
+            content = message['content']
+            ret += f'{box_map[role]}{content}{eox_map[role]}'
+        ret += f'{self.assistant}'


messages2prompt 是说messages里面没有system的话，就不加了是么？这个之前貌似默认是有的。

对，这个行为被BC了（internlm 的模型除外），需要讨论下。之前如果用户不传 meta_instrucion，会用默认的。现在只看用户传的，如果用户没传就没在用。

确实有些纠结。这里修改的出发点是，可以简化些逻辑，不用translate_messages，对吧。

确实有些纠结。这里修改的出发点是，可以简化些逻辑，不用translate_messages，对吧。

对，其实我们是不知道ChatGPT内部的行为是什么。不过像 FastChat 似乎内部是用了缺省值的，如果用户没传的话。

grimoire · 2024-03-06T03:49:49Z

tokenizer_config.json contain a lot of useful information might help to ease the chat template.
For example gemma-7b-it has bos_token, eos_token and chat_template.

lmdeploy/cli/utils.py

Conflicts: lmdeploy/cli/cli.py

lmdeploy/model.py

tests/test_lmdeploy/test_model.py

lmdeploy/model.py

RunningLeon

LGTM

lvhan028 · 2024-03-07T03:49:05Z

I am doing BC-breaking test, comparing the results between v0.2.5 and this PR. They are different

prompt = 'hi, how are you'

from lmdeploy.model import MODELS

chat_templates = [
    'internlm', 'llama', 'base',
    'wizardlm', 'vicuna',
    'internlm-chat', 'internlm-chat-7b', 'internlm-chat-20b', 'internlm-chat-7b-8k',
    'internlm2-chat', 'internlm2-chat-1_8b', 'internlm2-chat-7b', 'internlm2-chat-20b',
    'baichuan-7b',
    'baichuan2-7b',
    'llama2', 'llama-2', 'llama-2-chat',
    'qwen-7b', 'qwen-14b',
    'codellama',
    'falcon',
    'chatglm2-6b',
    'solar', 'solar-70b',
    'ultracm',
    'ultralm',
    'yi', 'yi-chat', 'yi-200k', 'yi-34b',
    'Mistral-7B-Instruct', 'Mixtral-8x7B-Instruct',
    'gemma',
    'deepseek-chat'
]

for _template in chat_templates:
    print(f'---------{_template}---------')
    model = MODELS.get(_template)()
    print(model.get_prompt(prompt))

examples/vl/qwen_model.py

AllentDan · 2024-03-07T06:07:35Z

还有个问题，base 模型的匹配，现在基本都会匹配不到名字。

lmdeploy/model.py

AllentDan added 2 commits February 28, 2024 19:59

remove sampling parameters in model.py

b1767ab

Refactor chat template

37ffff3

AllentDan added the WIP label Feb 29, 2024

AllentDan mentioned this pull request Feb 29, 2024

Support passing json file to chat template #1200

Merged

AllentDan added 5 commits February 29, 2024 18:36

Merge branch 'main' into refactor-model

98e7d19

gemma

267a8cc

add match

b233d4e

fix UT

df8b4fd

remove fuzzywuzzy

0015384

AllentDan removed the WIP label Mar 1, 2024

fix llama2 UT after removing fuzzy matching

ff28a3c

AllentDan added 3 commits March 1, 2024 13:43

Merge branch 'main' into refactor-model

1cb8c01

update chatglm2

6e1c88d

Merge branch 'main' into refactor-model

d1ea3cf

Conflicts: lmdeploy/model.py lmdeploy/turbomind/utils.py

lvhan028 reviewed Mar 1, 2024

View reviewed changes

requirements/runtime.txt Show resolved Hide resolved

lvhan028 reviewed Mar 1, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

lvhan028 requested review from irexyc, RunningLeon and grimoire March 1, 2024 08:21

remove fuzzywuzzy in readthedocs

fae1faf

lvhan028 added the improvement label Mar 2, 2024

AllentDan added 2 commits March 4, 2024 12:59

rename stop_word_suffix -> seperator

5eed3ee

fix UT

cba0e4a

irexyc reviewed Mar 4, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

irexyc reviewed Mar 4, 2024

View reviewed changes

resolve comments

a4f4897

fix

0b291ea

RunningLeon reviewed Mar 6, 2024

View reviewed changes

lmdeploy/cli/utils.py Outdated Show resolved Hide resolved

Merge branch 'main' into refactor-model

498ebb2

Conflicts: lmdeploy/cli/cli.py

grimoire approved these changes Mar 6, 2024

View reviewed changes

mv deprecate to front

34a36ce

lvhan028 reviewed Mar 6, 2024

View reviewed changes

lmdeploy/model.py Show resolved Hide resolved

lvhan028 reviewed Mar 6, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

match is chat

b459083

RunningLeon reviewed Mar 6, 2024

View reviewed changes

tests/test_lmdeploy/test_model.py Show resolved Hide resolved

RunningLeon reviewed Mar 6, 2024

View reviewed changes

lmdeploy/model.py Show resolved Hide resolved

fix UT

4be54ad

RunningLeon approved these changes Mar 7, 2024

View reviewed changes

recover two names

c293b8b

lvhan028 reviewed Mar 7, 2024

View reviewed changes

examples/vl/qwen_model.py Show resolved Hide resolved

clean names

4d853b3

AllentDan added 3 commits March 7, 2024 14:16

recover vicuna back

c85fcf9

put llama back

bff5c43

fix yi

c7d83fd

AllentDan force-pushed the refactor-model branch from a9adb0f to c7d83fd Compare March 7, 2024 07:24

lvhan028 reviewed Mar 11, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

AllentDan added 6 commits March 11, 2024 13:24

better match for deepseek

b5e727e

fix UT

96f4284

remove eoa from solar model chat template

551ce22

use default if not passed in

340e5b4

better deprecate hint

1a4cf76

update example

9ed42c1

lvhan028 approved these changes Mar 12, 2024

View reviewed changes

lvhan028 merged commit 24bd4b9 into InternLM:main Mar 12, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor chat template and support accurate name matching. #1216

Refactor chat template and support accurate name matching. #1216

AllentDan commented Feb 29, 2024 •

edited

AllentDan commented Mar 1, 2024

lvhan028 commented Mar 1, 2024

irexyc Mar 4, 2024

irexyc Mar 4, 2024

AllentDan Mar 6, 2024

irexyc Mar 4, 2024

AllentDan Mar 4, 2024

lvhan028 Mar 6, 2024

AllentDan Mar 6, 2024

grimoire commented Mar 6, 2024

RunningLeon left a comment

lvhan028 commented Mar 7, 2024

AllentDan commented Mar 7, 2024

Refactor chat template and support accurate name matching. #1216

Refactor chat template and support accurate name matching. #1216

Conversation

AllentDan commented Feb 29, 2024 • edited

AllentDan commented Mar 1, 2024

lvhan028 commented Mar 1, 2024

irexyc Mar 4, 2024

Choose a reason for hiding this comment

irexyc Mar 4, 2024

Choose a reason for hiding this comment

AllentDan Mar 6, 2024

Choose a reason for hiding this comment

irexyc Mar 4, 2024

Choose a reason for hiding this comment

AllentDan Mar 4, 2024

Choose a reason for hiding this comment

lvhan028 Mar 6, 2024

Choose a reason for hiding this comment

AllentDan Mar 6, 2024

Choose a reason for hiding this comment

grimoire commented Mar 6, 2024

RunningLeon left a comment

Choose a reason for hiding this comment

lvhan028 commented Mar 7, 2024

AllentDan commented Mar 7, 2024

AllentDan commented Feb 29, 2024 •

edited