[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. #39552

simonsays1980 · 2023-09-11T17:55:44Z

Why are these changes needed?

Pong example did not run due to a rollout_fragment_length that did not fit the train_batch_size. By setting the rollout_fragment_length to auto the rollout_fragment_length adapts to the train_batch_size.

Related issue number

Closes #38968

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

ArturNiederfahrenhorst

Great! Thanks for the fix

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

rllib/BUILD

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

simonsays1980 · 2023-09-13T07:47:36Z

@ArturNiederfahrenhorst Looking at the failed tests I think that the test is running on the wrong cluster: it requests a GPU (in the YAML) but there is none.

Shall we add "gpu" to the tags of the Pong test?

rllib/BUILD

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

sven1977

LGTM. Thanks! :)

ArturNiederfahrenhorst · 2023-09-13T18:07:43Z

@sven1977 I was the one who added the test. The reason we stumbled across this issue was that we don't test this code today - the issue was raised by a user who read the docs. In fact, all of our fine-tuned examples listed under doc/source/rllib/rllib-algorithms.rst are not tested.
Imho, they should be release tests given that RLlib's most used Algorithm is PPO by a good margin and that we refer to these as examples of it in our documentation.

ArturNiederfahrenhorst · 2023-09-13T18:09:28Z

I agree that the non-gpu learning test is not a good place, but instead, there should be another place where we execute this test. After all, if we can't prove learning on it, it should not be referred to as a fine-tuned example

ArturNiederfahrenhorst · 2023-09-13T18:22:53Z

I've opened an issue #39639

… to 'auto'. (ray-project#39552) Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

… to 'auto'. (ray-project#39552) Signed-off-by: Victor <vctr.y.m@example.com>

Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'.

96ccc7f

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

simonsays1980 requested review from sven1977, gjoliver, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners September 11, 2023 17:55

ArturNiederfahrenhorst approved these changes Sep 12, 2023

View reviewed changes

Test pong learning PPO

0a9b440

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

ArturNiederfahrenhorst reviewed Sep 12, 2023

View reviewed changes

rllib/BUILD Outdated Show resolved Hide resolved

simonsays1980 requested review from edoakes, shrekris-anyscale, sihanwang41, zcin, architkulkarni, a team, richardliaw, xwjiang2010, amogkam, matthewdeng, Yard1, wuisawesome, DmitriGekhtman, pcmoritz, kevin85421, ericl, scv119 and c21 as code owners September 12, 2023 22:36

simonsays1980 requested review from scottjlee, bveeramani, raulchen and a team as code owners September 12, 2023 22:36

simonsays1980 force-pushed the pong-benchmark-value-error branch 2 times, most recently from 8663b28 to 0a9b440 Compare September 12, 2023 23:06

simonsays1980 added 2 commits September 13, 2023 01:09

Fixed little typo in regression tests for PPO Pong.

1f7398c

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Merge branch 'master' into pong-benchmark-value-error

cf85d33

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

sven1977 reviewed Sep 13, 2023

View reviewed changes

rllib/BUILD Outdated Show resolved Hide resolved

sven1977 changed the title ~~Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'.~~ [RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. Sep 13, 2023

Removed pong test from BUILD, following @sven1977 's comment.

ea9ec5c

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

sven1977 approved these changes Sep 13, 2023

View reviewed changes

sven1977 merged commit 8d80377 into ray-project:master Sep 13, 2023
37 of 41 checks passed

simonsays1980 added a commit to simonsays1980/ray that referenced this pull request Sep 15, 2023

[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it…

96ead13

… to 'auto'. (ray-project#39552) Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

vymao pushed a commit to vymao/ray that referenced this pull request Oct 11, 2023

[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it…

a745c21

… to 'auto'. (ray-project#39552) Signed-off-by: Victor <vctr.y.m@example.com>

rickyyx mentioned this pull request Oct 24, 2023

[core] microbenchmark regression #40606

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. #39552

[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. #39552

simonsays1980 commented Sep 11, 2023 •

edited

ArturNiederfahrenhorst left a comment

simonsays1980 commented Sep 13, 2023 •

edited

sven1977 left a comment

ArturNiederfahrenhorst commented Sep 13, 2023 •

edited

ArturNiederfahrenhorst commented Sep 13, 2023

ArturNiederfahrenhorst commented Sep 13, 2023

[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. #39552

[RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. #39552

Conversation

simonsays1980 commented Sep 11, 2023 • edited

Why are these changes needed?

Related issue number

Checks

ArturNiederfahrenhorst left a comment

Choose a reason for hiding this comment

simonsays1980 commented Sep 13, 2023 • edited

sven1977 left a comment

Choose a reason for hiding this comment

ArturNiederfahrenhorst commented Sep 13, 2023 • edited

ArturNiederfahrenhorst commented Sep 13, 2023

ArturNiederfahrenhorst commented Sep 13, 2023

simonsays1980 commented Sep 11, 2023 •

edited

simonsays1980 commented Sep 13, 2023 •

edited

ArturNiederfahrenhorst commented Sep 13, 2023 •

edited