多GPU训练Yolov3 #16

saigequn · 2019-01-18T01:54:49Z

两个GPU训练Yolov3

yolov3.yml修改

gpus: "0,1"
mini_batch_size:8

GPU使用情况如下

Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla P100-PCIE...  Off  | 00000000:05:00.0 Off |                    0 |
| N/A   58C    P0   237W / 250W |   8835MiB / 16160MiB |     100%      Default|
+-------------------------------+----------------------+----------------------+
|   1  Tesla P100-PCIE...  Off  | 00000000:89:00.0 Off |                    0 |
| N/A   36C    P0    25W / 250W |     11MiB / 16160MiB |      0%      Default

其中一个GPU在闲置

The text was updated successfully, but these errors were encountered:

mileistone · 2019-01-18T02:15:16Z

We don't support multi gpus now. It's easy, you can implement it in 10 minutes.

tonysy · 2019-03-17T15:03:27Z

Could you share how to revise the code to achieve multi-gpu train? I found it difficult to implement.

yutaizhou · 2019-05-31T18:43:32Z

Can you share a guide on multi gpu implementation? Or give some good resources?

gongh4 · 2020-07-23T11:13:17Z

First, update the cfg set gpus:"0,1"
Then, in _voc_train.py, insert the code after
if self.cuda:
net.cuda()
as
net.net = torch.nn.DataParallel(net.net)

Then, you can train using multi-gpu

mileistone closed this as completed Jan 18, 2019

mileistone mentioned this issue Jan 21, 2019

how to train using multi-gpu? #22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

多GPU训练Yolov3 #16

多GPU训练Yolov3 #16

saigequn commented Jan 18, 2019

mileistone commented Jan 18, 2019

tonysy commented Mar 17, 2019

yutaizhou commented May 31, 2019

gongh4 commented Jul 23, 2020

多GPU训练Yolov3 #16

多GPU训练Yolov3 #16

Comments

saigequn commented Jan 18, 2019

mileistone commented Jan 18, 2019

tonysy commented Mar 17, 2019

yutaizhou commented May 31, 2019

gongh4 commented Jul 23, 2020