为长文本数据和短文本数据指定不同的batchsize和梯度累积 #10503
Unanswered
xuzhang0112
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
如题,直接将长文本和短文本混合训练效率很低,希望根据长度做分桶采样,是否有推荐的最佳实践?(例如修改llamafactory库代码)
Beta Was this translation helpful? Give feedback.
All reactions