[RLlib] Add simple curriculum learning API and example script. #15740

sven1977 · 2021-05-11T16:19:32Z

This PR adds:

A simple curriculum learning API to execute a configurable function (optional) at the end of each training iteration
that determines, whether the env should be set to a new task.
Formalizes the already existing "task-get/set" API used in MAML to be generalized for curriculum learning (and e.g. MAML-style) task setting.
Adds example script and test case.

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…iculum_learning_api

richardliaw · 2021-05-14T16:42:17Z

rllib/agents/trainer_template.py

+
+            # Check `env_task_fn` for possible update of the env's task.
+            if self.config["env_task_fn"] is not None:
+                assert callable(self.config["env_task_fn"])


assertions in code can be a bit unwieldy, especially when the users see it.

I would consider writing a special check function that also prints a user friendly message.

richardliaw · 2021-05-14T16:43:13Z

rllib/examples/curriculum_learning.py

+parser.add_argument("--stop-iters", type=int, default=50)
+parser.add_argument("--stop-timesteps", type=int, default=200000)
+parser.add_argument("--stop-reward", type=float, default=10000.0)


add helpstring?

Done. I'll do this for all the other example scripts in a separate PR.

richardliaw · 2021-05-14T16:44:07Z

rllib/examples/curriculum_learning.py

+
+    if args.as_test:
+        check_learning_achieved(results, args.stop_reward)
+    ray.shutdown()


you don't need this right?

in local case, ray will shutdown when process exits

in remote case, ray will disconnect when process exits

I remember a time when not calling shutdown at the end of tests would lead to re-init errors when we run these tests in e.g. the CI.

richardliaw

nice!

richardliaw

consider highlighting this in a FAQ

…iculum_learning_api

sven1977 added 5 commits May 4, 2021 11:33

wip

6a4a0d4

wip.

864b3ad

Merge branch 'master' of https://github.com/ray-project/ray into curr…

aa6087d

…iculum_learning_api

LINT.

2c65bc7

Merge branch 'master' of https://github.com/ray-project/ray into curr…

92a5686

…iculum_learning_api

sven1977 requested a review from michaelzhiluo May 11, 2021 16:19

sven1977 assigned michaelzhiluo May 11, 2021

sven1977 added 3 commits May 11, 2021 18:22

wip

90826d5

wip

61768c1

Merge branch 'master' of https://github.com/ray-project/ray into curr…

b4cba75

…iculum_learning_api

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label May 13, 2021

richardliaw reviewed May 14, 2021

View reviewed changes

richardliaw approved these changes May 14, 2021

View reviewed changes

sven1977 added 5 commits May 16, 2021 12:23

Merge branch 'master' of https://github.com/ray-project/ray into curr…

8140870

…iculum_learning_api

wip.

d685d37

Merge branch 'master' of https://github.com/ray-project/ray into curr…

afe1356

…iculum_learning_api

wip.

2e58b40

wip.

d18c21e

sven1977 merged commit d89fb82 into ray-project:master May 16, 2021

sven1977 deleted the curriculum_learning_api branch June 2, 2023 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Add simple curriculum learning API and example script. #15740

[RLlib] Add simple curriculum learning API and example script. #15740

sven1977 commented May 11, 2021 •

edited

Loading

richardliaw May 14, 2021

sven1977 May 16, 2021

richardliaw May 14, 2021

sven1977 May 16, 2021

richardliaw May 14, 2021

sven1977 May 16, 2021

richardliaw left a comment

richardliaw left a comment

[RLlib] Add simple curriculum learning API and example script. #15740

[RLlib] Add simple curriculum learning API and example script. #15740

Conversation

sven1977 commented May 11, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

richardliaw May 14, 2021

Choose a reason for hiding this comment

sven1977 May 16, 2021

Choose a reason for hiding this comment

richardliaw May 14, 2021

Choose a reason for hiding this comment

sven1977 May 16, 2021

Choose a reason for hiding this comment

richardliaw May 14, 2021

Choose a reason for hiding this comment

sven1977 May 16, 2021

Choose a reason for hiding this comment

richardliaw left a comment

Choose a reason for hiding this comment

richardliaw left a comment

Choose a reason for hiding this comment

sven1977 commented May 11, 2021 •

edited

Loading