We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
在目前的情形下,pipeline parallel会在第一份数据进入时固定logits等的形状,但事实上,下游任务中很多时候最好能支持变长以节省算力,而不是进行过多的padding。
The text was updated successfully, but these errors were encountered:
Merge pull request InternLM#9 from yingtongxiong/feat/refactor-fstp-h…
d87d9f9
…andler feat(*) refactor fstp handler
srcIndex < srcSelectDimSize
sunpengsdu
No branches or pull requests
Describe the feature
在目前的情形下,pipeline parallel会在第一份数据进入时固定logits等的形状,但事实上,下游任务中很多时候最好能支持变长以节省算力,而不是进行过多的padding。
Will you implement it?
The text was updated successfully, but these errors were encountered: