[RLlib] Policies get/set_state fixes and enhancements. #16354

sven1977 · 2021-06-10T12:11:13Z

Policies currently do not properly return their exploration state when calling Policy.get_state(). This PR adds the exploration state to the return value of policy.get_state(). Exploration.get_info() has been renamed into get_state() (backward compatible). A new Exploration.set_state() method has been added, which is used by Policy.set_state().

This is in preparation of:

making policies addable to/deletable from a worker's policy_map in-flight
self-play with 100s of policy snapshots
league-based training

This may also fix:
#16065

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…cy_fix_get_set_state

wip

86c96df

sven1977 requested a review from michaelzhiluo June 10, 2021 12:11

sven1977 assigned michaelzhiluo Jun 10, 2021

sven1977 added 7 commits June 10, 2021 14:12

wip

49475aa

fix

8547b70

fix

db752dd

fix.

d39a270

fix and LINT.

339b46a

Merge branch 'master' of https://github.com/ray-project/ray into poli…

6584e97

…cy_fix_get_set_state

fix

7c63068

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Jun 14, 2021

sven1977 mentioned this pull request Jun 14, 2021

[RLlib] Restored DQNTrainer cannot solve the environment it was trained on. #16065

Closed

2 tasks

michaelzhiluo approved these changes Jun 15, 2021

View reviewed changes

sven1977 merged commit d0014cd into ray-project:master Jun 15, 2021

laphang mentioned this pull request Jun 29, 2021

[rllib] test reward much lower than training reward on parametric DQN #15162

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Policies get/set_state fixes and enhancements. #16354

[RLlib] Policies get/set_state fixes and enhancements. #16354

sven1977 commented Jun 10, 2021 •

edited

Loading

[RLlib] Policies get/set_state fixes and enhancements. #16354

[RLlib] Policies get/set_state fixes and enhancements. #16354

Conversation

sven1977 commented Jun 10, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 commented Jun 10, 2021 •

edited

Loading