training with multiple GPU #6

gokceay · 2021-05-08T20:33:38Z

Hello Mrs. Liyues,

I would like to use more than one GPU, how can I achieve this? In trainer net at line 25: self.model = nn.DataParallel(self.model).cuda(), I should add gpu ids inside DataParallel besides self.model right? Should I also increase the batch size to be more than the used GPU? Also did you tried batch size more than 1? What was the result if you tried?
Thanks in advance.

yinyin-llll · 2023-07-10T01:32:15Z

hi，i also want to retrain the model but it need the csv file with annotation.Do you know this file or how to design this file?Thank you very much

quocbao2772004 · 2024-09-02T02:45:12Z

@yinyin-llll hi bro, have you successfully retrained the model?

yinyin-llll · 2024-09-03T07:09:08Z

No   CR7.xxl ***@***.***  

…

------------------ 原始邮件 ------------------ 发件人: "Lê Trần Quốc ***@***.***>; 发送时间: 2024年9月2日(星期一) 上午10:45 收件人: ***@***.***>; 抄送: ***@***.***>; ***@***.***>; 主题: Re: [liyues/PatRecon] training with multiple GPU (#6) @yinyin-llll hi bro, have you successfully retrained the model? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

liyues · 2024-09-03T16:29:43Z

Hi all, thanks for your interests in this work and code. This codebase has been quite a few years ago with many changes happened later. I will try to recall my memory to answer these questions to my best but sorry if there is anything unclear.
For the csv file, that should be just a file to store the path to the data file names where you store your data. So you may want to change this file to the one that can direct to your data path so the images can be loaded. Hope this can be helpful and let me know if there is any question.

liyues · 2024-09-03T16:32:35Z

For the multi-GPU training, I think you are right it is possible to assign different batch of data onto different GPUs so that you can increase your total training batch size. That is to say, this is data parallel training. The PyTorch version may be too old now and you may want to check the new version for the data parallel training in the new PyTorch version. Hope this can be helpful

gokceay changed the title ~~training with a new dataset~~ training with more gpu May 22, 2021

gokceay changed the title ~~training with more gpu~~ training with multiple GPU May 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training with multiple GPU #6

training with multiple GPU #6

gokceay commented May 8, 2021 •

edited

Loading

yinyin-llll commented Jul 10, 2023

quocbao2772004 commented Sep 2, 2024

yinyin-llll commented Sep 3, 2024 via email

liyues commented Sep 3, 2024

liyues commented Sep 3, 2024

training with multiple GPU #6

training with multiple GPU #6

Comments

gokceay commented May 8, 2021 • edited Loading

yinyin-llll commented Jul 10, 2023

quocbao2772004 commented Sep 2, 2024

yinyin-llll commented Sep 3, 2024 via email

liyues commented Sep 3, 2024

liyues commented Sep 3, 2024

gokceay commented May 8, 2021 •

edited

Loading