[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). #14796

sven1977 · 2021-03-19T12:11:38Z

This PR addresses the following problem:
RLlib's env rendering and video recording options are currently buggy.

Allows custom gym.Envs to be rendered in an automatic window by simply returning a np.array RGB-image in the render() method.
Alternatively, custom Envs can take care of their own rendering mechanism via their own window handling.
Fixes video recording for non-local mode and num_workers > 0.
Adds an example script that shows how to use both options in a simple corridor env.
Soft-obsoletes "monitor" config option for clarity.
Works also for evaluation-only (via evaluation config as shown in the new example script).

IMPORTANT NOTE:
A recent bug in openAI gym prevents RLlib's "record_env" option from recording videos properly. Instead, the produced mp4 files have a size of 1kb and are corrupted. A simple fix for this is described here:
openai/gym#1925

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…render_and_record_envs

rfali · 2021-06-01T07:44:46Z

I was trying to use the render and recorder for the PettingZoo environments, but only the render works (and the pygame window crashes at the end of episode) and the recorder doesn't record anything at all (there is no videos folder as well). Has this patch been verified to work with custom multi-agent envs?

I installed the nightly wheels from here and also upgraded gym to latest version.
ray: 2.0.0.dev0
gym: 0.18.3
pettingzoo: 1.8.2

First, I verified that the rllib/examples/env_rendering_and_recording.py works, renders and saves the videos. There is a helpful prompt that says the recorded video is being saved with location path.

I then tried 2 pettingzoo environments (waterworld and space_invaders), both of them did render but the pygame window crashes. If render is set to False, then the training completes but there are no videos folder, or any prompt that they are being saved. Here is the code I tried from one of the rllib examples rllib/examples/multi_agent_parameter_sharing.py

from ray import tune
from ray.tune.registry import register_env
from ray.rllib.env.wrappers.pettingzoo_env import PettingZooEnv
from pettingzoo.sisl import waterworld_v3

if __name__ == "__main__":

    def env_creator(args):
        return PettingZooEnv(waterworld_v3.env())

    env = env_creator({})
    register_env("waterworld", env_creator)

    obs_space = env.observation_space
    act_space = env.action_space

    policies = {"shared_policy": (None, obs_space, act_space, {})}

    # for all methods
    policy_ids = list(policies.keys())

    tune.run(
        "APEX_DDPG",
        stop={"episodes_total": 10},
        checkpoint_freq=10,
        local_dir="my_results",
        config={

            # Enviroment specific
            "env": "waterworld",

            # General
            "num_gpus": 1,
            "num_workers": 2,
            "num_envs_per_worker": 8,
            "learning_starts": 1000,
            "buffer_size": int(1e5),
            "compress_observations": True,
            "rollout_fragment_length": 20,
            "train_batch_size": 512,
            "gamma": .99,
            "n_step": 3,
            "lr": .0001,
            "prioritized_replay_alpha": 0.5,
            "final_prioritized_replay_beta": 1.0,
            "target_network_update_freq": 50000,
            "timesteps_per_iteration": 25000,

            # Method specific
            "multiagent": {
                "policies": policies,
                "policy_mapping_fn": (lambda agent_id: "shared_policy"),
            },
            "evaluation_interval": 1,
            "evaluation_num_episodes": 2,
            "evaluation_num_workers": 1,
            "evaluation_config": {
                "record_env": "videos",
                "render_env": False,
            },
        },
    )

This one uses the space_invaders game and I also moved the render and recorder out of the evaluation config, but there was no change to the outcome.

from ray import tune
from ray.tune.registry import register_env
from ray.rllib.env.wrappers.pettingzoo_env import PettingZooEnv
from pettingzoo.atari import space_invaders_v1

if __name__ == "__main__":

    def env_creator(args):
        return PettingZooEnv(space_invaders_v1.env())

    env = env_creator({})
    register_env("space_invaders", env_creator)

    obs_space = env.observation_space
    act_space = env.action_space

    policies = {"shared_policy": (None, obs_space, act_space, {})}

    # for all methods
    policy_ids = list(policies.keys())

    tune.run(
        "PPO",
        stop={"episodes_total": 10},
        checkpoint_freq=10,
        local_dir="my_results",
        config={
            # Enviroment specific
            "env": "space_invaders",

            # General
            "num_gpus": 1,
            "num_workers": 1,
            "num_envs_per_worker": 2,
            "record_env": "videos",
            "render_env": False,
        
        },
    )

Please let me know if I should open this as a separate issue. Thanks

sven1977 added 7 commits March 12, 2021 15:27

wip and LINT.

2232968

Merge branch 'master' of https://github.com/ray-project/ray

e9054c5

Merge branch 'master' of https://github.com/ray-project/ray

9e3094b

Merge branch 'master' of https://github.com/ray-project/ray

cee174e

Merge branch 'master' of https://github.com/ray-project/ray

19a56cd

Merge branch 'master' of https://github.com/ray-project/ray

1dd5637

wip.

0803823

sven1977 requested a review from michaelzhiluo March 19, 2021 12:11

sven1977 assigned michaelzhiluo Mar 19, 2021

sven1977 added 4 commits March 19, 2021 13:18

wip.

6e1deb0

fixes.

c4dcd32

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

dfbea65

…render_and_record_envs

wip.

bec3efc

sven1977 mentioned this pull request Mar 22, 2021

[rllib] The Monitor wrapper records training episodes instead of evaluation #14734

Closed

2 tasks

fix

631f29d

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Mar 22, 2021

michaelzhiluo approved these changes Mar 23, 2021

View reviewed changes

sven1977 added 3 commits March 23, 2021 08:32

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

ce1caa3

…render_and_record_envs

wip.

49aec86

wip.

577f7c5

sven1977 merged commit f859ebb into ray-project:master Mar 23, 2021

rfali mentioned this pull request Jun 1, 2021

rllib - Support for Recording videos using gym monitor for MultiAgentEnv #8403

Closed

rfali mentioned this pull request Jun 2, 2021

[rllib] Video Recording not working for some environment (after PR 14796) #16200

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). #14796

[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). #14796

sven1977 commented Mar 19, 2021 •

edited

Loading

rfali commented Jun 1, 2021 •

edited

Loading

[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). #14796

[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). #14796

Conversation

sven1977 commented Mar 19, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

rfali commented Jun 1, 2021 • edited Loading

sven1977 commented Mar 19, 2021 •

edited

Loading

rfali commented Jun 1, 2021 •

edited

Loading