Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]关于输入重复短语导致无限输出的问题 #36

Open
tangsipeng opened this issue Jan 1, 2024 · 9 comments
Open

[Bug]关于输入重复短语导致无限输出的问题 #36

tangsipeng opened this issue Jan 1, 2024 · 9 comments
Labels
duplicate This issue or pull request already exists

Comments

@tangsipeng
Copy link

Describe the bug
Sakura-13B-LNovel-v0_8-4bit 模型
输入为:
<reserved_106>将下面的日文文本翻译成中文:はっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっはっ<reserved_107>
输出为:
image

To Reproduce
我是按照这个python教程进行部署的
https://github.com/SakuraLLM/Sakura-13B-Galgame/wiki/Python%E9%83%A8%E7%BD%B2%E6%95%99%E7%A8%8B

image

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

  • OS: [Linux]
  • Python version [3.11]
  • Whether use GPU, whether is NVIDIA GPU. [3090、A100均出现此问题]
  • The version of the model you use [Sakura-13B-LNovel-v0_8-4bit]

Additional context
Add any other context about the problem here.

@sakura-umi sakura-umi added the duplicate This issue or pull request already exists label Jan 1, 2024
@sakura-umi
Copy link
Collaborator

#35

@Agateshefi
Copy link

@pipixia244 能请问下如何增加frequency_penalty么,云端部署着实找不到

@sakura-umi
Copy link
Collaborator

@pipixia244 能请问下如何增加frequency_penalty么,云端部署着实找不到

跟部署方式无关,发送请求的时候请求体带上frequency_penalty就行。如使用openai库,直接在参数中指定。

仓库中已经给出了例子:https://github.com/SakuraLLM/Sakura-13B-Galgame/blob/2c5fb94c88cb8952c0d4ed78de4fe8c33954e22b/tests/example_openai.py#L44

@Agateshefi
Copy link

@pipixia244 抱歉再请问用的是kaggle平台并用轻小说机翻机器人该把这行加到哪里呢?

@sakura-umi
Copy link
Collaborator

sakura-umi commented Jan 4, 2024

@pipixia244 抱歉再请问用的是kaggle平台并用轻小说机翻机器人该把这行加到哪里呢?

网站会自动检测退化并应用参数,不需要你自己做什么。

其他支持的项目如GalTransl和LunaTranslator也是如此。只有自己调用api时需要考虑这个问题。

@Agateshefi
Copy link

@pipixia244 那在网站也遇到这个问题是因为显卡不行吗

@sakura-umi
Copy link
Collaborator

@pipixia244 那在网站也遇到这个问题是因为显卡不行吗

遇到这个问题是正常的,是模型固有的问题,能够通过设置frequency_penalty的参数解决,而不是这个问题就不出现了

@Agateshefi
Copy link

@pipixia244 好的谢谢解惑

@FFreQD
Copy link

FFreQD commented Jun 22, 2024

@pipixia244 抱歉再请问用的是kaggle平台并用轻小说机翻机器人该把这行加到哪里呢?

网站会自动检测退化并应用参数,不需要你自己做什么。

其他支持的项目如GalTransl和LunaTranslator也是如此。只有自己调用api时需要考虑这个问题。

大佬我用Kaggle+GalTransl遇到同样的问题,我看Kaggle设置的是0.1,不知道怎么修改成0.2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Projects
None yet
Development

No branches or pull requests

4 participants