Skip to content

Use uniform config#817

Merged
vwxyzjn merged 5 commits intohuggingface:mainfrom
vwxyzjn:new_config
Oct 9, 2023
Merged

Use uniform config#817
vwxyzjn merged 5 commits intohuggingface:mainfrom
vwxyzjn:new_config

Conversation

@vwxyzjn
Copy link
Copy Markdown
Contributor

@vwxyzjn vwxyzjn commented Sep 25, 2023

This PR converts various configs to the new format. Things should be backward-compatible at this PR.

I did not refactor stable_diffusion_tuning.py nor reward_trainer.py because that refactor might involve documentation changes. We should do them in separate PRs. For example, we would probably do python stable_diffusion_tuning.py --ddpo_config.train_batch_size 32 instead of python stable_diffusion_tuning.py --train_batch_size 32. Let's probably do them in separate PRs.

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

HuggingFaceDocBuilderDev commented Sep 25, 2023

The documentation is not available anymore as the PR was closed or merged.

@vwxyzjn vwxyzjn requested a review from lvwerra September 25, 2023 20:08
Copy link
Copy Markdown
Member

@lvwerra lvwerra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot @vwxyzjn - LGTM! I'll also let @younesbelkada have a look :)

@lvwerra lvwerra requested a review from younesbelkada October 2, 2023 09:33
Copy link
Copy Markdown
Contributor

@younesbelkada younesbelkada left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @vwxyzjn !

@vwxyzjn vwxyzjn merged commit 95aea7c into huggingface:main Oct 9, 2023
lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024
* Use uniform config

* quick fix

* refactor

* update docs
yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025
* Use uniform config

* quick fix

* refactor

* update docs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants