About gradient accumulation in your douban_final.py #2

Xie-Minghui · 2021-08-11T07:58:21Z

in your code, if args.gradient_accumulation_steps > 1, loss.backward() will not be excuted. But in every step, loss.backward() should be excuted.
The normal gradient accumulation process is as follows:

I don't know if I was wrong.

hanjanghoon · 2021-08-23T01:41:49Z

Sorry for the late reply.
As you said, the gradient accumulation code is not implemented properly.
At first I tried to use it, but I got a new gpu card, so I didn't implement it.

thank you for your opinion. i will modify code soon.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About gradient accumulation in your douban_final.py #2

About gradient accumulation in your douban_final.py #2

Xie-Minghui commented Aug 11, 2021

hanjanghoon commented Aug 23, 2021

About gradient accumulation in your douban_final.py #2

About gradient accumulation in your douban_final.py #2

Comments

Xie-Minghui commented Aug 11, 2021

hanjanghoon commented Aug 23, 2021