Skip to content

Conversation

@dg845
Copy link
Collaborator

@dg845 dg845 commented Jun 23, 2023

What does this PR do?

This PR implements an example DDPO (project, paper, code) finetuning script as discussed in #3768.

Command to launch an experiment

(TODO)

Testing Machine

(TODO)

TODO

  • Get script to work
  • Add script tests
  • Add requirements.txt
  • Write README
  • Add to docs

Discussion

  • (TBD)

CC

@patrickvonplaten
@jannerm and @kvablack (authors of original paper and code)
@abhijitpal1247

@dg845 dg845 mentioned this pull request Jun 23, 2023
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

@dg845 dg845 marked this pull request as draft June 23, 2023 09:54
@github-actions
Copy link
Contributor

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

@github-actions github-actions bot added the stale Issues that haven't received updates label Jul 23, 2023
@github-actions github-actions bot closed this Jul 31, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

stale Issues that haven't received updates

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants