[rllib] Improve test learning check, fix flaky two step qmix #16843

krfricke · 2021-07-02T14:01:34Z

Why are these changes needed?

two step qmix was previously very flaky.

The core test has been fixed by providing a seed. For this to work, passing the seed in a GroupAgentWrapper had to be enabled.

Secondly, learning test checks recently used analysis.trials[0].last_result to check for success. This PR introduces a change, that collects all trials results and picks the best achieved reward. This means that the test will always succeed if at least one of the trials achieved the desired performance.

This won't make a difference for single trial runs.
This is usually good for parameter sweeps, as we expect at least one trial to achieve good performance, not all
This might not be suitable for some tests that require that all trials achieve good performance. However, since the test was previously just about testing that the first trial achieved good performance, this doesn't seem like a regression to me.

cc @ericl, @sven1977 and @richardliaw for your opinion on this change.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

# Conflicts: # rllib/BUILD

richardliaw · 2021-07-06T18:08:38Z

This change looks fine to me.

krfricke · 2021-07-06T18:39:06Z

I'll merge because I don't see how this should lead to any regressions. We can revert if someone has concerns.

Kai Fricke added 2 commits July 2, 2021 14:56

[rllib] Improve test learning check, fix flaky two step qmix

438983d

Merge branch 'master' into rllib-flaky

eb06778

# Conflicts: # rllib/BUILD

krfricke requested a review from sven1977 July 2, 2021 14:01

krfricke assigned sven1977 Jul 2, 2021

Check for seed attribute in child env

dff3190

richardliaw approved these changes Jul 6, 2021

View reviewed changes

krfricke merged commit 10fd711 into ray-project:master Jul 6, 2021

krfricke deleted the rllib-flaky branch July 6, 2021 18:39

jiaodong pushed a commit that referenced this pull request Jul 11, 2021

[rllib] Improve test learning check, fix flaky two step qmix (#16843)

6cf4727

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Improve test learning check, fix flaky two step qmix #16843

[rllib] Improve test learning check, fix flaky two step qmix #16843

krfricke commented Jul 2, 2021

richardliaw commented Jul 6, 2021

krfricke commented Jul 6, 2021

[rllib] Improve test learning check, fix flaky two step qmix #16843

[rllib] Improve test learning check, fix flaky two step qmix #16843

Conversation

krfricke commented Jul 2, 2021

Why are these changes needed?

Related issue number

Checks

richardliaw commented Jul 6, 2021

krfricke commented Jul 6, 2021