New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

(应该是bug)：chatglm3在agent tuning数据前处理的时候，没有拼接<|assistant|> #2421

Closed

1 task done

13269279918 opened this issue Feb 4, 2024 · 7 comments

Labels

solved

13269279918 commented Feb 4, 2024

Reminder

I have read the README and searched the existing issues.

Reproduction

在数据中{“from”:"gpt","content":"xxx"}，gpt对应的应该是assistant。
但是在前处理代码中
format_assistant=StringFormatter(slots=["\n", "{{content}}"]),
并没有在{content}前拼接<|assistant|>

Expected behavior

No response

System Info

No response

Others

No response

Owner

hiyouga commented Feb 4, 2024

拼过了

LLaMA-Factory/src/llmtuner/data/template.py

Line 336 in db0ab4d

    
           format_user=StringFormatter(slots=[{"token": "<|user|>"}, "\n", "{{content}}", {"token": "<|assistant|>"}]),

hiyouga added the invalid label

hiyouga closed this as completed

Author

13269279918 commented Feb 4, 2024

拼过了

LLaMA-Factory/src/llmtuner/data/template.py

Line 336 in db0ab4d

format_user=StringFormatter(slots=[{"token": "<|user|>"}, "\n", "{{content}}", {"token": "<|assistant|>"}]),

format_user后面拼了，format_observation后面没拼。你用glaive跑一下，然后看打印出来数据样例就能看出来问题。

Author

13269279918 commented Feb 4, 2024

format_user后面的<|assistant|>是给function_call预留的，observation后面也应该给gpt预留一个<|assistant|>，不然最终拼接出来的数据有问题。

hiyouga added solved and removed invalid labels

hiyouga reopened this

Owner

hiyouga commented Feb 4, 2024

感谢提醒，已修复

hiyouga closed this as completed in

3dc86c4

Author

13269279918 commented Feb 5, 2024 •

edited

感谢提醒，已修复

建议好好检查一下对agent-tool的训练代码，这部分应该还有很多隐含问题。因为训练效果非常差。
比如我遇到了一个问题：输出的时候每一字后面都会被加上一个空格。（目前没找到该问题的原因）

Owner

hiyouga commented Feb 5, 2024

具体描述一下？

cgq0816 commented Mar 20, 2024

format_user后面的<|assistant|>是给function_call预留的，observation后面也应该给gpt预留一个<|assistant|>，不然最终拼接出来的数据有问题。

你好，想问一下训练出来的模型能调用function call功能吗？我自己训练出来的不能调用，请问一下如果能，训练的参数有哪些呢？

sangttruong pushed a commit to painkillernhat/LLaMA-Factory that referenced this issue


          fix hiyouga#2421

d63a3a7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment