[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). #45654

sven1977 · 2024-05-31T12:06:26Z

Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup).

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980

LGTM: Very excited about the tests.

simonsays1980 · 2024-05-31T12:25:29Z

rllib/tuned_examples/dreamerv3/atari_100k.py

    )
    .env_runners(
+        num_env_runners=(args.num_env_runners or 0),


Why is it that we use only a single env runner to collect new samples?

The training ratio for this setup (Atari 100k) is 1024, which is huge anyways.
You sample 1 step, you update all models on one 1024 sized batch (B=16 x T=64) from the buffer.

So parallelizing the env collection does not make sense. You don't win any performance out of this. DreamerV3 is all about parallelizing on the learner side.

Interesting, I would have thought that you learn more about the environment if you roll out the new policy more broadly, which in turn would have improved the dynamics model faster to dream better.

Thanks for the clarification @sven1977

can-anyscale · 2024-05-31T13:34:20Z

release/release_tests.yaml

+      runtime_env:
+        - RLLIB_TEST_NO_JAX_IMPORT=1
+        - LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/ray/.mujoco/mujoco210/bin
+    cluster_compute: 1gpu_4cpus.yaml


this is using p2.xlarge which is about $1/hour; from cost perspective, i think it's absolutely fine to use this node for 12 hours per week

from the weekly release perspective, we are thinking about reduce many 24-hour tests to be 8-hour tests, but no concrete plan yet so should not be a blocker

Signed-off-by: sven1977 <svenmika1977@gmail.com>

aslonnie

(leaving for @can-anyscale to review the release test)

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…mer_v3_add_release_test Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/tuned_examples/dreamerv3/atari_100k.py

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…mer_v3_add_release_test

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…00k setup). (ray-project#45654) Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>

…00k setup). (ray-project#45654) Signed-off-by: Richard Liu <ricliu@google.com>

wip

d79cfb1

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested review from ArturNiederfahrenhorst and simonsays1980 as code owners May 31, 2024 12:06

sven1977 assigned simonsays1980 and can-anyscale May 31, 2024

wip

63a969e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980 approved these changes May 31, 2024

View reviewed changes

can-anyscale approved these changes May 31, 2024

View reviewed changes

sven1977 enabled auto-merge (squash) May 31, 2024 13:49

sven1977 disabled auto-merge May 31, 2024 13:49

github-actions bot added the go add ONLY when ready to merge, run all tests label May 31, 2024

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label May 31, 2024

tf==2.11.1

e9332de

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested a review from a team as a code owner May 31, 2024 15:25

fix

6f9499e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

aslonnie reviewed May 31, 2024

View reviewed changes

sven1977 added 8 commits May 31, 2024 22:07

wip

e54d905

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into drea…

18275a2

…mer_v3_add_release_test Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/tuned_examples/dreamerv3/atari_100k.py

wip

84c7ea3

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into drea…

0748415

…mer_v3_add_release_test

test removing byod file altogether

9e3987d

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

73f882f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

8a8f43f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' into dreamer_v3_add_release_test

d42ec0f

sven1977 enabled auto-merge (squash) June 4, 2024 05:43

sven1977 merged commit 4adb78b into ray-project:master Jun 4, 2024
6 checks passed

ryanaoleary pushed a commit to ryanaoleary/ray that referenced this pull request Jun 6, 2024

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 1…

b789067

…00k setup). (ray-project#45654) Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>

ryanaoleary pushed a commit to ryanaoleary/ray that referenced this pull request Jun 6, 2024

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 1…

f4448b3

…00k setup). (ray-project#45654) Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>

richardsliu pushed a commit to richardsliu/ray that referenced this pull request Jun 12, 2024

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 1…

f164741

…00k setup). (ray-project#45654) Signed-off-by: Richard Liu <ricliu@google.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). #45654

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). #45654

sven1977 commented May 31, 2024 •

edited

Loading

simonsays1980 left a comment

simonsays1980 May 31, 2024

sven1977 May 31, 2024

simonsays1980 May 31, 2024 •

edited

Loading

can-anyscale May 31, 2024

aslonnie left a comment

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). #45654

[RLlib] Activate DreamerV3 weekly release test (on Pong-v5 with the 100k setup). #45654

Conversation

sven1977 commented May 31, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 May 31, 2024

Choose a reason for hiding this comment

sven1977 May 31, 2024

Choose a reason for hiding this comment

simonsays1980 May 31, 2024 • edited Loading

Choose a reason for hiding this comment

can-anyscale May 31, 2024

Choose a reason for hiding this comment

aslonnie left a comment

Choose a reason for hiding this comment

sven1977 commented May 31, 2024 •

edited

Loading

simonsays1980 May 31, 2024 •

edited

Loading