Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

训练问题 #33

Open
Dominic-ZZ opened this issue Jun 24, 2021 · 0 comments
Open

训练问题 #33

Dominic-ZZ opened this issue Jun 24, 2021 · 0 comments

Comments

@Dominic-ZZ
Copy link

1.为什么batchsize只能设置为1?我尝试过设置为6,在3个epoch之后loss都变成了Nan,请问调大batchsize需要改代码的哪些地方?
2,GPU利用率不稳定的问题,当batchsize为1时,GPU的利用率在20%-90%之间波动,我试着调大了batchsize该问题仍然存在,请问该如何解决?修改dataloader的num_workers以及pin_memory也都无效。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant