Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic #11056

Merged

Conversation

sven1977
Copy link
Contributor

@sven1977 sven1977 commented Sep 27, 2020

Trajectory view API:

  • Simple List Collector (on by default for PPO);
  • LSTM-agnostic

This finishes the following part of the Traj. View milestone:
Issue #10879

Closes Issue #10879

Checks

  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

@@ -59,6 +59,14 @@ def validate_config(config):
logger.warning("`sample_async=True` is not supported for PyTorch! "
"Multithreading can lead to crashes.")

# Switch on trajectory view API by default for this algo (if None).
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Switch on by default for A2/3C.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please set this in the A2C config instead of in the code.

@@ -33,6 +36,24 @@
# yapf: enable


def validate_config(config: TrainerConfigDict) -> None:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Switch on by default for PG.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please set this in the algorithm config instead of in the code, here and elsewhere.

@ericl ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Sep 28, 2020
@ericl
Copy link
Contributor

ericl commented Sep 28, 2020

LGTM, let's get it merged!

@@ -89,6 +89,9 @@
# Whether to fake GPUs (using CPUs).
# Set this to True for debugging on non-GPU machines (set `num_gpus` > 0).
"_fake_gpus": False,
# Switch on Trajectory View API for A2/3C by default.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PPO

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@@ -27,6 +30,9 @@
"num_workers": 0,
# Learning rate.
"lr": 0.0004,
# Switch on Trajectory View API for A2/3C by default.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PG

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@sven1977
Copy link
Contributor Author

Fixing last failing tests. ...

@sven1977 sven1977 merged commit 36bda84 into ray-project:master Oct 1, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
@author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants