Skip to content

Fix DPOTrainer + PEFT 2#1049

Merged
lvwerra merged 1 commit intohuggingface:mainfrom
rdk31:main
Dec 1, 2023
Merged

Fix DPOTrainer + PEFT 2#1049
lvwerra merged 1 commit intohuggingface:mainfrom
rdk31:main

Conversation

@rdk31
Copy link
Copy Markdown
Contributor

@rdk31 rdk31 commented Dec 1, 2023

The same problem as described in #877, just with the reference model.

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

@lvwerra lvwerra requested a review from kashif December 1, 2023 16:00
@lvwerra lvwerra added the 🏋 DPO Related to DPO label Dec 1, 2023
Copy link
Copy Markdown
Collaborator

@kashif kashif left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice! thank you! makes it clearer as well

@lvwerra lvwerra merged commit a60ceef into huggingface:main Dec 1, 2023
lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024
yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

🏋 DPO Related to DPO

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants