-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add RLHF Reward Trainer and Loss #3435
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Unit Test Results 6 files ± 0 6 suites ±0 42m 52s ⏱️ - 36m 46s For more details on these failures, see this check. Results for commit 439ec2a. ± Comparison against base commit 9112470. This pull request removes 33 and adds 2779 tests. Note that renamed tests count towards both.
This pull request removes 4 skipped tests and adds 9 skipped tests. Note that renamed tests count towards both.
♻️ This comment has been updated with latest results. |
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 13, 2023 13:23
1f51206
to
936194d
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 13, 2023 13:43
446cf0f
to
becf832
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 14, 2023 20:43
3279c22
to
316e2bf
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 14, 2023 23:34
9b64a1a
to
dd675d6
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 14, 2023 23:40
b5d61b9
to
cae42ad
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 14, 2023 23:43
336b52b
to
a0808cf
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 15, 2023 00:00
b9b379e
to
9b46959
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 15, 2023 02:01
132af98
to
a93a1b8
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 15, 2023 02:07
f6c84b1
to
e311741
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 15, 2023 02:13
55e8d2b
to
1662827
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 15, 2023 02:37
0750f2d
to
4860265
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 20, 2023 14:26
439ec2a
to
82bbffe
Compare
asdataminer
force-pushed
the
rlhf_reward_loss
branch
from
June 20, 2023 15:04
c2527f0
to
4f852ea
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Code Pull Requests
Please provide the following:
Documentation Pull Requests
Note that the documentation HTML files are in
docs/
while the Markdown sources are inmkdocs/docs
.If you are proposing a modification to the documentation you should change only the Markdown files.
api.md
is automatically generated from the docstrings in the code, so if you want to change something in that file, first modifyludwig/api.py
docstring, then runmkdocs/code_docs_autogen.py
, which will createmkdocs/docs/api.md
.