Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] PPO runs with EnvRunner w/o old Policy API (also solves KL issues with PPORLModules). #39732
[RLlib] PPO runs with EnvRunner w/o old Policy API (also solves KL issues with PPORLModules). #39732
Changes from all commits
2762b77
041d4c2
0d0f0dc
1192243
dd679fe
c0bb25d
0a67532
8fd93f9
d5875b1
77b0036
4066431
8de01f9
5a9c823
346cbcd
442f9cd
0924207
4e8fb06
ecc516d
ff35676
be067e7
cdc3d2e
d451a83
9a8c5d7
6386860
09e384b
a269cf1
59b48ed
266ea1f
4ac7195
992ebbe
ab30fd2
61f7eaf
e3bf8f0
2120e75
db0b93d
6c016a2
d8de7f6
149c5ea
6eef936
d6f1b58
5deaa74
8a3621c
82f9152
4bdf1a8
e12b356
f379269
dffc9fb
c124d98
8d8119d
fac1815
e011776
eba5667
069979c
fc3d387
aa469de
6c49aca
e5b01a3
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing