-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] Unify pattern of examples and learning tests. #45023
[RLlib] Unify pattern of examples and learning tests. #45023
Conversation
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. The great thing about it is we can finally test all tuned examples without alw<s putting a tune.Tuner
in :)
…y_pattern_of_examples_and_learning_tests
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
approval for docs change.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
parser = add_rllib_example_script_args() | ||
# Use `parser` to add your own custom command line options to this script | ||
# and (if needed) use their values toset up `config` below. | ||
args = parser.parse_args() | ||
|
||
register_env("multi_agent_pendulum", lambda _: MultiAgentPendulum({"num_agents": 2})) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sven1977 I think we must use here the "num_agents": args.num_agents
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed
…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/BUILD
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/tuned_examples/dqn/cartpole_dqn.py # rllib/tuned_examples/ppo/cartpole-ppo-fake-gpus.yaml # rllib/tuned_examples/ppo/cartpole-ppo-grid-search-example.yaml # rllib/tuned_examples/ppo/cartpole-ppo-hyperband.yaml # rllib/tuned_examples/ppo/cartpole_ppo.py # rllib/tuned_examples/ppo/cartpole_truncated_ppo.py # rllib/tuned_examples/ppo/multi_agent_pendulum_ppo.py # rllib/tuned_examples/ppo/pendulum_ppo.py # rllib/tuned_examples/ppo/recomm-sys001-ppo.yaml # rllib/tuned_examples/sac/pendulum_sac.py
…y_pattern_of_examples_and_learning_tests
…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/tuned_examples/dqn/cartpole_dqn.py # rllib/tuned_examples/ppo/cartpole-ppo-fake-gpus.yaml # rllib/tuned_examples/ppo/cartpole-ppo-grid-search-example.yaml # rllib/tuned_examples/ppo/cartpole-ppo-hyperband.yaml # rllib/tuned_examples/ppo/cartpole_ppo.py # rllib/tuned_examples/ppo/cartpole_truncated_ppo.py # rllib/tuned_examples/ppo/multi_agent_pendulum_ppo.py # rllib/tuned_examples/ppo/pendulum_ppo.py # rllib/tuned_examples/ppo/recomm-sys001-ppo.yaml # rllib/tuned_examples/sac/pendulum_sac.py
…y_pattern_of_examples_and_learning_tests
Signed-off-by: sven1977 <svenmika1977@gmail.com>
…y_pattern_of_examples_and_learning_tests Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/env/env_runner_group.py # rllib/utils/test_utils.py
Unify examples, tuned_examples (learning tests), and release tests into a common pattern of execution.
--wandb-key
,--num-env-runners
,--num-gpus
,--env
, etc..Why are these changes needed?
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.