New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(应该是bug):chatglm3在agent tuning数据前处理的时候,没有拼接<|assistant|> #2421
Comments
拼过了 LLaMA-Factory/src/llmtuner/data/template.py Line 336 in db0ab4d
|
format_user后面拼了,format_observation后面没拼。你用glaive跑一下,然后看打印出来数据样例就能看出来问题。 |
format_user后面的<|assistant|>是给function_call预留的,observation后面也应该给gpt预留一个<|assistant|>,不然最终拼接出来的数据有问题。 |
感谢提醒,已修复 |
建议好好检查一下对agent-tool的训练代码,这部分应该还有很多隐含问题。因为训练效果非常差。 |
具体描述一下? |
你好,想问一下训练出来的模型能调用function call功能吗?我自己训练出来的不能调用,请问一下如果能,训练的参数有哪些呢? |
Reminder
Reproduction
在数据中{“from”:"gpt","content":"xxx"},gpt对应的应该是assistant。
但是在前处理代码中
format_assistant=StringFormatter(slots=["\n", "{{content}}"]),
并没有在{content}前拼接<|assistant|>
Expected behavior
No response
System Info
No response
Others
No response
The text was updated successfully, but these errors were encountered: