New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
公布的LongLM模型在OutGen任务的生成结果不对 #4
Comments
感谢关注!填空任务和续写任务是LongLM的预训练任务,可以直接不经过finetune就进行生成。OutGen任务是我们设计LOT benchmark中的的一个下游任务,需要对LongLM在相应的训练集上进行训练才能正确生成。 |
请问之前在OutGen微调后的模型还有么,谢谢。可以分享啊。 |
您好,我在测试LongLM模型的在填空任务时,用的您上面的输入示例,但是得到的输出不尽人意,您可以提供一下你当时的测试代码吗?非常感谢~
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
你好,测试LongLM模型,在填空任务的输出正确:
可以得到正确输出:
但是,在OutGen任务的生成结果不对,似乎没有特殊处理#字符。
我用 https://github.com/thu-coai/LOT-LongLM/blob/7cf5377f20656732542221a876ae7fb86e85fcdc/baselines/generation/test.source 的输入:
得到的输出:
它认为#是普通字符,所以产生了大量#。请问OutGen任务是否需要不同的输入格式,谢谢。
The text was updated successfully, but these errors were encountered: