Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

8台A40机器上复现magicoder-S-DS-6.7B的结果 #35

Closed
tusiqi1 opened this issue Mar 8, 2024 · 0 comments
Closed

8台A40机器上复现magicoder-S-DS-6.7B的结果 #35

tusiqi1 opened this issue Mar 8, 2024 · 0 comments

Comments

@tusiqi1
Copy link

tusiqi1 commented Mar 8, 2024

因为README-DEV.md脚本直接使用accelerate提示训练内存不足,故修改为deepspeed-stage1启动,其余参数均为默认。因是8卡迭代步长缩小了1/4。

经过实验后我发现:

  1. 训练速度大幅降低
  2. 1阶段和2截断训练效果均无法达到60%

想咨询下机器不同,且增加了deepspeed有可能让结果变差这么多吗?

@tusiqi1 tusiqi1 closed this as completed Mar 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant