Loss is #21

HuaYuexia · 2024-02-03T05:14:45Z

No description provided.

HuaYuexia · 2024-02-03T05:15:35Z

Loss is nan

AmingWu · 2024-02-03T05:36:01Z

The learning rate is set to 0.001. And the batchsize is set to 4. Welcome to communicate with me. Thanks.

HuaYuexia · 2024-02-05T02:01:10Z

The learning rate is set to 0.001. And the batchsize is set to 4. Welcome to communicate with me. Thanks.

My learning rate is set to 0.001 too.The difference is that I used two gpus with a batch size of 8.The only way to get it to work properly is to reduce the learning rate to 1e-5, but then it won't converge, why is that?Sincerely look forward to your guidance, although this may be a naive question for you.

xiao-song2022 · 2024-02-06T17:38:47Z

Thank you for your reply. The problem has been successfully resolved.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss is #21

Loss is #21

HuaYuexia commented Feb 3, 2024

HuaYuexia commented Feb 3, 2024

AmingWu commented Feb 3, 2024

HuaYuexia commented Feb 5, 2024 •

edited

Loading

xiao-song2022 commented Feb 6, 2024

Loss is #21

Loss is #21

Comments

HuaYuexia commented Feb 3, 2024

HuaYuexia commented Feb 3, 2024

AmingWu commented Feb 3, 2024

HuaYuexia commented Feb 5, 2024 • edited Loading

xiao-song2022 commented Feb 6, 2024

HuaYuexia commented Feb 5, 2024 •

edited

Loading