-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Add Model] Pairwise Preference Model #123
Merged
Merged
Commits on May 9, 2024
-
I created a PairPMPipeline class to use the pair preference model. I also presented an example to use the preference model.
Configuration menu - View commit details
-
Copy full SHA for d7e664b - Browse repository at this point
Copy the full SHA d7e664bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a374ec1 - Browse repository at this point
Copy the full SHA a374ec1View commit details
Commits on May 10, 2024
-
The training and use of the models are similar to that of Slic paper SLiC-HF: Sequence Likelihood Calibration with Human Feedback.
Configuration menu - View commit details
-
Copy full SHA for f727b8c - Browse repository at this point
Copy the full SHA f727b8cView commit details
Commits on May 12, 2024
-
Update rewardbench/models/__init__.py
Co-authored-by: Nathan Lambert <nathanl@allenai.org>
Configuration menu - View commit details
-
Copy full SHA for b8b53e9 - Browse repository at this point
Copy the full SHA b8b53e9View commit details
Commits on May 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a533f75 - Browse repository at this point
Copy the full SHA a533f75View commit details -
we now use task, model, and tokenizer to init the pipeline.
Configuration menu - View commit details
-
Copy full SHA for e570d0d - Browse repository at this point
Copy the full SHA e570d0dView commit details
Commits on May 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d66b833 - Browse repository at this point
Copy the full SHA d66b833View commit details -
Configuration menu - View commit details
-
Copy full SHA for a9f8217 - Browse repository at this point
Copy the full SHA a9f8217View commit details -
Configuration menu - View commit details
-
Copy full SHA for 098aef6 - Browse repository at this point
Copy the full SHA 098aef6View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.