[`distributed`] Fix early stopping and DP #254

younesbelkada · 2023-03-27T11:25:24Z

What does this PR do?

This PR fixes an issue where the training was hanging in multi-GPU setting and using earyl stopping
In fact, in a multi-GPU setting, we need to compute the average of policykl across all GPUs, and call early stopping

I can confirm this solution worked, the way I checked was to add a print after the self.optimizer_zero_grad and forcing the coefficient to be 0 :
if policykl > 0 * self.config.target_kl:

cc @edbeeching @lvwerra

4it [00:03,  1.36it/s]early stopping successfulearly stopping successful
5it [00:04,  1.21it/s]early stopping successful

HuggingFaceDocBuilderDev · 2023-03-27T11:30:09Z

The documentation is not available anymore as the PR was closed or merged.

trl/trainer/ppo_trainer.py

lvwerra · 2023-03-28T09:40:40Z

If it looks good for @edbeeching I am happy to merge it.

younesbelkada · 2023-03-28T11:41:01Z

Should have addressed the last comment by now!

edbeeching

LGTM! Thanks

younesbelkada added 2 commits March 27, 2023 11:22

fix ES DP

a52eb91

fix coef

373dc6e

younesbelkada added 2 commits March 27, 2023 11:37

wrap in a private method

833828f

fix value

7a968ac

younesbelkada requested review from lvwerra and edbeeching March 27, 2023 11:44

edbeeching reviewed Mar 28, 2023

View reviewed changes

trl/trainer/ppo_trainer.py Show resolved Hide resolved

fix trainer logic

55044d8

younesbelkada requested a review from edbeeching March 28, 2023 11:40

edbeeching approved these changes Mar 28, 2023

View reviewed changes

younesbelkada merged commit 237eb9c into main Mar 28, 2023

younesbelkada deleted the fix-early-stopping-dp branch March 28, 2023 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`distributed`] Fix early stopping and DP #254

[`distributed`] Fix early stopping and DP #254

younesbelkada commented Mar 27, 2023 •

edited

HuggingFaceDocBuilderDev commented Mar 27, 2023 •

edited

lvwerra commented Mar 28, 2023

younesbelkada commented Mar 28, 2023

edbeeching left a comment

[distributed] Fix early stopping and DP #254

[distributed] Fix early stopping and DP #254

Conversation

younesbelkada commented Mar 27, 2023 • edited

What does this PR do?

HuggingFaceDocBuilderDev commented Mar 27, 2023 • edited

lvwerra commented Mar 28, 2023

younesbelkada commented Mar 28, 2023

edbeeching left a comment

Choose a reason for hiding this comment

[`distributed`] Fix early stopping and DP #254

[`distributed`] Fix early stopping and DP #254

younesbelkada commented Mar 27, 2023 •

edited

HuggingFaceDocBuilderDev commented Mar 27, 2023 •

edited