Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于standard benchmark #24

Open
cmnfriend opened this issue Jun 15, 2024 · 0 comments
Open

关于standard benchmark #24

cmnfriend opened this issue Jun 15, 2024 · 0 comments

Comments

@cmnfriend
Copy link
Owner

那个由4个任务组成的benchmark,最近double check了一下发现学习率设得足够小的话,全量微调baseline效果也会非常好(几乎不遗忘),说明这个benchmark对于t5已经没有挑战了,对于后续的大模型同理,所以不建议follow...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant