We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好,感谢您的工作。我想请问一下8张A100 80GB上微调flan-t5-11B原论文是如何设置各项参数的。例如deepspeed选择什么模式,batch_size等等参数
The text was updated successfully, but these errors were encountered:
deepspeed使用zero3, 设置每张卡batch_size为2,梯度累计到为16,训练5个epoch
Sorry, something went wrong.
nitwtog
No branches or pull requests
您好,感谢您的工作。我想请问一下8张A100 80GB上微调flan-t5-11B原论文是如何设置各项参数的。例如deepspeed选择什么模式,batch_size等等参数
The text was updated successfully, but these errors were encountered: