Small changes when integrating into H4 by natolambert · Pull Request #216 · huggingface/trl

natolambert · 2023-03-14T03:25:30Z

Two changes:

Pass the optimizer in the sentiment example (currently variable was not passed into trainier).
[I think] fix the kwarg option for wandb config of Accelerate. See this docs page, where init_kwargs is handled differently. In trying to use this with the code as is, wandb is getting read as a kwarg and not handled correctly by this line. If this is different in Tensorboard, it may just be incompatible.

Let me know if I'm wrong!

Fixes: #215

natolambert · 2023-03-14T03:25:56Z

Closes ##215 if correct on point 1 @younesbelkada !

HuggingFaceDocBuilderDev · 2023-03-14T03:28:40Z

The documentation is not available anymore as the PR was closed or merged.

natolambert · 2023-03-14T03:34:12Z

I tested the logging change with my code in H4 #https://github.com/huggingface/h4/pull/73, and it fixed my problem!

younesbelkada

Thanks a lot for the PR
Agreed for the first point! Great catch!
Regarding the second point I have slight doubts that it may break things with tensorboard, if this is not vital we can leave it on a follow up PR to test it properly - otherwise you can quickly test any script with log_with="tensorboard" and see if the training runs
Thanks!

natolambert · 2023-03-14T16:32:40Z

I'll test tensorboard today. FYI this is needed for the script in H4, so I'll be motivated to get this working soon.

If tensorboard doesn't work, I'll prolly do an if statement.

natolambert · 2023-03-14T19:22:14Z

@younesbelkada I think I ran this with tensorboard (just changed the config to as follows and it didn't error). Seems good to me?

The term I changed tracker_kwargs was not used in any of TRL to date actually.

config = PPOConfig(
    model_name="ybelkada/gpt-j-6b-sharded-bf16",
    learning_rate=(1.47e-5) * 2,
    # log_with="wandb",
    log_with="tensorboard",
    accelerator_kwargs={"logging_dir": '/home/nathan/logs/'},
    batch_size=32,
    forward_batch_size=1,
)

younesbelkada

Thanks a lot for fixing! 🔥

younesbelkada · 2023-03-14T19:47:14Z

Thanks a lot for experimenting @natolambert ! LGTM

* nits * style

nits

3a0c0cd

style

e849ef0

younesbelkada reviewed Mar 14, 2023

View reviewed changes

younesbelkada approved these changes Mar 14, 2023

View reviewed changes

natolambert merged commit 357730f into main Mar 14, 2023

natolambert deleted the nol_nits branch March 14, 2023 21:15

yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025

Small changes when integrating into H4 (huggingface#216)

7241c5e

* nits * style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small changes when integrating into H4#216

Small changes when integrating into H4#216
natolambert merged 2 commits intomainfrom
nol_nits

natolambert commented Mar 14, 2023 •

edited by younesbelkada

Loading

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Mar 14, 2023 •

edited

Loading

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

younesbelkada left a comment

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

younesbelkada left a comment

Uh oh!

younesbelkada commented Mar 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

natolambert commented Mar 14, 2023 • edited by younesbelkada Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Mar 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

natolambert commented Mar 14, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented Mar 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

natolambert commented Mar 14, 2023 •

edited by younesbelkada

Loading

HuggingFaceDocBuilderDev commented Mar 14, 2023 •

edited

Loading