Fix incorrect steps calculation when gradient acc. #31

eldarkurtic · 2021-12-20T10:07:55Z

When gradient accumulation is used, the effective batch size is gradent_accumulation_steps times larger.

When gradient accumulation is used, the effective batch size is `gradent_accumulation_steps` times larger.

bfineran

good catch

When gradient accumulation is used, the effective batch size is `gradent_accumulation_steps` times larger.

Fix incorrect steps calculation when gradient acc.

f4b3597

When gradient accumulation is used, the effective batch size is `gradent_accumulation_steps` times larger.

bfineran approved these changes Dec 21, 2021

View reviewed changes

markurtz approved these changes Jan 24, 2022

View reviewed changes

markurtz merged commit c7b33f0 into neuralmagic:master Jan 24, 2022

KSGulin pushed a commit that referenced this pull request Mar 9, 2022

Fix incorrect steps calculation when gradient acc. (#31)

756a9bb

When gradient accumulation is used, the effective batch size is `gradent_accumulation_steps` times larger.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix incorrect steps calculation when gradient acc. #31

Fix incorrect steps calculation when gradient acc. #31

Uh oh!

eldarkurtic commented Dec 20, 2021

Uh oh!

bfineran left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix incorrect steps calculation when gradient acc. #31

Fix incorrect steps calculation when gradient acc. #31

Uh oh!

Conversation

eldarkurtic commented Dec 20, 2021

Uh oh!

bfineran left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants