[RLlib] Unify pattern of examples and learning tests. #45023

sven1977 · 2024-04-28T18:09:21Z

Unify examples, tuned_examples (learning tests), and release tests into a common pattern of execution.

Config files should become python executable scripts.
All these scripts/config files should have a common command line arg schema, supporting common arguments such as --wandb-key, --num-env-runners, --num-gpus, --env, etc..
All these scripts/config should be runnable as tests, meaning they should all define a) one or more stopping criteria, and b) (optionsl) a passing criterium. If b) is not provided, RLlib will automatically try to find a good passing criterium in the stopping criterium (try eval/episode_reward_mean first, then try episode_reward_mean)
Cleanup all BUILD learning_tests and tag old API stack ones as such. Remove the "_envrunner" suffix from the new API stack tuned_examples, b/c new stack should become more and more the norm.
Move a single release test (pong_ppo) into this new format as well (for testing purposes).

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

simonsays1980

LGTM. The great thing about it is we can finally test all tuned examples without alw<s putting a tune.Tuner in :)

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

Signed-off-by: sven1977 <svenmika1977@gmail.com>

aslonnie

approval for docs change.

simonsays1980

LGTM.

simonsays1980 · 2024-05-08T10:12:43Z

rllib/tuned_examples/ppo/multi_agent_pendulum_ppo.py

+parser = add_rllib_example_script_args()
+# Use `parser` to add your own custom command line options to this script
+# and (if needed) use their values toset up `config` below.
+args = parser.parse_args()

 register_env("multi_agent_pendulum", lambda _: MultiAgentPendulum({"num_agents": 2}))


@sven1977 I think we must use here the "num_agents": args.num_agents?

…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/BUILD

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/tuned_examples/dqn/cartpole_dqn.py # rllib/tuned_examples/ppo/cartpole-ppo-fake-gpus.yaml # rllib/tuned_examples/ppo/cartpole-ppo-grid-search-example.yaml # rllib/tuned_examples/ppo/cartpole-ppo-hyperband.yaml # rllib/tuned_examples/ppo/cartpole_ppo.py # rllib/tuned_examples/ppo/cartpole_truncated_ppo.py # rllib/tuned_examples/ppo/multi_agent_pendulum_ppo.py # rllib/tuned_examples/ppo/pendulum_ppo.py # rllib/tuned_examples/ppo/recomm-sys001-ppo.yaml # rllib/tuned_examples/sac/pendulum_sac.py

…y_pattern_of_examples_and_learning_tests

…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/tuned_examples/dqn/cartpole_dqn.py # rllib/tuned_examples/ppo/cartpole-ppo-fake-gpus.yaml # rllib/tuned_examples/ppo/cartpole-ppo-grid-search-example.yaml # rllib/tuned_examples/ppo/cartpole-ppo-hyperband.yaml # rllib/tuned_examples/ppo/cartpole_ppo.py # rllib/tuned_examples/ppo/cartpole_truncated_ppo.py # rllib/tuned_examples/ppo/multi_agent_pendulum_ppo.py # rllib/tuned_examples/ppo/pendulum_ppo.py # rllib/tuned_examples/ppo/recomm-sys001-ppo.yaml # rllib/tuned_examples/sac/pendulum_sac.py

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/env/env_runner_group.py # rllib/utils/test_utils.py

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 added 2 commits April 28, 2024 19:28

wip

06a1b93

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

4d7e75e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 assigned simonsays1980 Apr 28, 2024

sven1977 marked this pull request as ready for review April 28, 2024 18:09

sven1977 requested review from avnishn, ArturNiederfahrenhorst, maxpumperla, kouroshHakha and simonsays1980 as code owners April 28, 2024 18:09

sven1977 added 7 commits April 29, 2024 09:20

wip

6f23a60

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

02a07cc

…y_pattern_of_examples_and_learning_tests

wip

2372430

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

5a7e7b3

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

6a3a089

…y_pattern_of_examples_and_learning_tests

wip

69339dd

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fixes

58d3265

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 added rllib RLlib related issues rllib-newstack labels May 2, 2024

sven1977 added 2 commits May 3, 2024 09:20

fixes

2fad65e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

6a2d73d

…y_pattern_of_examples_and_learning_tests

simonsays1980 approved these changes May 3, 2024

View reviewed changes

sven1977 added 3 commits May 3, 2024 12:12

fixes

8416167

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

7303f14

…y_pattern_of_examples_and_learning_tests

wip

e734f2f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested a review from a team as a code owner May 3, 2024 13:29

aslonnie approved these changes May 3, 2024

View reviewed changes

simonsays1980 approved these changes May 8, 2024

View reviewed changes

sven1977 added 3 commits May 10, 2024 21:27

Merge branch 'master' of https://github.com/ray-project/ray into unif…

90f87bf

…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/BUILD

fixes

8df141e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

86d5a5f

…y_pattern_of_examples_and_learning_tests

sven1977 added 13 commits May 13, 2024 14:33

Merge branch 'master' of https://github.com/ray-project/ray into unif…

37d26de

…y_pattern_of_examples_and_learning_tests

Merge branch 'master' of https://github.com/ray-project/ray into unif…

51711d7

…y_pattern_of_examples_and_learning_tests

Merge branch 'master' of https://github.com/ray-project/ray into unif…

b6bd054

…y_pattern_of_examples_and_learning_tests

fixes

ffac4a1

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

e8f8874

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

e3bb696

…y_pattern_of_examples_and_learning_tests

Merge branch 'master' of https://github.com/ray-project/ray into unif…

cd5fb61

…y_pattern_of_examples_and_learning_tests

wip

0c94ed4

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

78d9c81

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

6144115

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

1cec156

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) May 16, 2024 02:29

github-actions bot added the go add ONLY when ready to merge, run all tests label May 16, 2024

sven1977 added 2 commits May 16, 2024 16:57

Merge branch 'master' of https://github.com/ray-project/ray into unif…

37ba669

…y_pattern_of_examples_and_learning_tests

test buildkite no file size limit for error stack printouts

b2d4196

Signed-off-by: sven1977 <svenmika1977@gmail.com>

github-actions bot disabled auto-merge May 16, 2024 14:59

sven1977 added 4 commits May 16, 2024 19:17

fixes

f610c90

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into unif…

a073cb9

…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/env/env_runner_group.py # rllib/utils/test_utils.py

fix

427a393

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

5ac423a

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) May 16, 2024 21:51

sven1977 merged commit 2329466 into ray-project:master May 16, 2024
7 checks passed

sven1977 deleted the unify_pattern_of_examples_and_learning_tests branch May 17, 2024 03:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Unify pattern of examples and learning tests. #45023

[RLlib] Unify pattern of examples and learning tests. #45023

sven1977 commented Apr 28, 2024 •

edited

Loading

simonsays1980 left a comment

aslonnie left a comment

simonsays1980 left a comment

simonsays1980 May 8, 2024

sven1977 May 13, 2024

[RLlib] Unify pattern of examples and learning tests. #45023

[RLlib] Unify pattern of examples and learning tests. #45023

Conversation

sven1977 commented Apr 28, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

simonsays1980 left a comment

Choose a reason for hiding this comment

aslonnie left a comment

Choose a reason for hiding this comment

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 May 8, 2024

Choose a reason for hiding this comment

sven1977 May 13, 2024

Choose a reason for hiding this comment

sven1977 commented Apr 28, 2024 •

edited

Loading