-
Notifications
You must be signed in to change notification settings - Fork 281
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Llama3-Chinese-instruct-DPO-beta0.5模型推理结果出现大量重复生成的现象 #55
Comments
当前没有使用vllm和你设置的超参数去尝试,关于API部署可以尝试按以下文档中的方案不会重复:https://github.com/CrazyBoyM/llama3-Chinese-chat/tree/main/deploy/API 代码:
|
可以尝试设置一下repetition_penalty系数 |
好的,谢谢~ |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
您好!感谢你们开发并开源了如此有影响力的中文llama3系列模型,但我有一些疑问想请教一下各位开发者。
我从modelscope仓库下载了开源的Llama3-Chinese-instruct-DPO-beta0.5模型参数,并运用vllm在x-ApacaEval中的中文数据集上进行推理。
推理的参数设置如下:
推理结果中出现很多重复生成的response,例如:
我直接使用下载的Llama3-Chinese-instruct-DPO-beta0.5参数进行推理,没有经过任何的修改和微调,请问一下这种现象是怎么造成的呢,以及你们在测试过程中有无遇到过类似的现象,你们有什么好的解决方案可以一起讨论吗?
非常感谢~
The text was updated successfully, but these errors were encountered: