Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【开源实习】bigbird_pegasus模型微调 #1972

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

outbreak-sen
Copy link
Contributor

@outbreak-sen outbreak-sen commented Mar 4, 2025

实现了bigbird_pegasus模型在databricks/databricks-dolly-15k数据集上的微调实验。
任务链接在https://gitee.com/mindspore/community/issues/IAUPBF
transformers+pytorch+4060的benchmark是自己编写的,仓库位于https://github.com/outbreak-sen/bigbird_pegasus_finetune
更改代码位于llm/finetune/bigbird_prgasus,只包含mindnlp+mindspore的
实验结果如下

bigbird_pegasus模型微调对比

train loss

对比微调训练的loss变化

epoch mindnlp+mindspore transformer+torch(4060)
1 2.0958 8.7301
2 1.969 8.1557
3 1.8755 7.7516
4 1.8264 7.5017
5 1.7349 7.2614
6 1.678 7.0559
7 1.6937 6.8405
8 1.654 6.7297
9 1.6365 6.7136
10 1.7003 6.6279

eval loss

对比评估得分

epoch mindnlp+mindspore transformer+torch(4060)
1 2.1257965564727783 6.3235931396484375

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant