Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add InstructionBacktranslation #486

Merged
merged 11 commits into from
Mar 27, 2024
Merged

Conversation

alvarobartt
Copy link
Member

@alvarobartt alvarobartt commented Mar 27, 2024

Description

This PR adds the InstructionBacktranslation task from the paper "Self Alignment with Instruction Backtranslation", that scores individual responses for given instructions, while also providing a reasoning for it.

Besides that this PR fixes a bug within ultrafeedback/helpfulness.jinja2 template, as it was using a wrong formatting for the instruction leading to it not being replaced on Template.render.

@alvarobartt alvarobartt added this to the 1.0.0 milestone Mar 27, 2024
@alvarobartt alvarobartt self-assigned this Mar 27, 2024
@alvarobartt alvarobartt marked this pull request as ready for review March 27, 2024 14:18
@gabrielmbmb gabrielmbmb merged commit 39bf306 into core-refactor Mar 27, 2024
4 checks passed
@gabrielmbmb gabrielmbmb deleted the instruction-back-translation branch March 27, 2024 15:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

None yet

2 participants