-
Notifications
You must be signed in to change notification settings - Fork 463
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
怎么用单卡训练? #7
Comments
@xinhaojin 哈喽,这个不能单卡训练吗?我只有一块GPU运行报错,好像是关于显卡的问题 |
你好,将--nproc_per_node设为1即可进行单卡训练。 |
@jyqi 你好帅哥,我已经按照你的提示设置了单卡,但是报了以下的错误: 2022-11-30 10:10:14.802 | ERROR | main::68 - An error has been caught in function '', process 'MainProcess' (25404), thread 'M
File "tools\train.py", line 53, in main File "D:\Anaconda3\envs\DAMO-YOLO\lib\site-packages\torch\distributed\distributed_c10d.py", line 421, in init_process_group File "D:\Anaconda3\envs\DAMO-YOLO\lib\site-packages\torch\distributed\rendezvous.py", line 82, in rendezvous RuntimeError: No rendezvous handler for env:// 这是啥原因呀? |
你好,看到你的backend改成了gloo,是因为在windows平台训练吗?我们在linux平台上尝试单卡训练,没有碰到过这个错误。可以排查下是否平台原因。 |
@jyqi 好的,谢谢你的建议 |
如题
The text was updated successfully, but these errors were encountered: