fix for lr scheduler in distributed training #9103

sayantan1410 · 2024-08-06T19:11:45Z

What does this PR do?

Fix for LR in a distributed training when num_train_epoch is passed

Part of #8384

I have made changes to a single training script only.
Let me know if there are any mistakes, will be glad to fix those
@sayakpaul

sayakpaul · 2024-08-07T01:47:34Z

Thank you.

We need to fix the code quality before we can merge. Fixing instructions are available here:
https://github.com/huggingface/diffusers/actions/runs/10272588162/job/28437541407?pr=9103

HuggingFaceDocBuilderDev · 2024-08-07T01:51:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayantan1410 · 2024-08-07T18:59:54Z

Hi, Fixed the code quality, Can you please re-run the test.
@sayakpaul

sayakpaul · 2024-08-08T03:15:56Z

Thank you for your contribution.

* fix for lr scheduler in distributed training * Fixed the recalculation of the total training step section * Fixed lint error --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

sayantan1410 added 2 commits August 7, 2024 00:29

fix for lr scheduler in distributed training

c16b74d

Fixed the recalculation of the total training step section

cdda5e2

Fixed lint error

babaa37

sayakpaul added 2 commits August 8, 2024 08:31

Merge branch 'main' into main

6009de4

Merge branch 'main' into main

933efb2

sayakpaul merged commit 8e3affc into huggingface:main Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix for lr scheduler in distributed training #9103

fix for lr scheduler in distributed training #9103

Uh oh!

sayantan1410 commented Aug 6, 2024 •

edited by sayakpaul

Loading

Uh oh!

sayakpaul commented Aug 7, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Aug 7, 2024

Uh oh!

sayantan1410 commented Aug 7, 2024

Uh oh!

sayakpaul commented Aug 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix for lr scheduler in distributed training #9103

fix for lr scheduler in distributed training #9103

Uh oh!

Conversation

sayantan1410 commented Aug 6, 2024 • edited by sayakpaul Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

sayakpaul commented Aug 7, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Aug 7, 2024

Uh oh!

sayantan1410 commented Aug 7, 2024

Uh oh!

sayakpaul commented Aug 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sayantan1410 commented Aug 6, 2024 •

edited by sayakpaul

Loading