Introduce benchmark utilities #165

vwxyzjn · 2022-04-18T02:13:02Z

Description

This PR introduces utilities that help us to run the benchmark experiments more smoothly.

Previously, we relied on a very simple mechanism for conducting benchmark experiments such as

Lines 3 to 7 in 443bb14

    
           for env_id in "HalfCheetah-v2" "Walker2d-v2" "Hopper-v2"; do 
        
               for seed in 1 2 3; do 
        
                   poetry run python cleanrl/sac_continuous_action.py --track --capture-video --wandb-project-name cleanrl --wandb-entity openrlbenchmark --env-id $env_id --seed $seed 
        
               done 
        
           done

Such bash script usage is pretty straightforward but lacks flexibility and configurability. This PR introduce a command such as

OMP_NUM_THREADS=1 python -m cleanrl_utils.benchmark \
    --env-ids CartPole-v1 Acrobot-v1 MountainCar-v0 \
    --command "poetry run python cleanrl/ppo.py --track --cuda False" \
    --num-seeds 3 \
    --workers 3

which will automatically run experiments withworkers=3 subprocesses. A full example can be seen here:

# export WANDB_ENTITY=openrlbenchmark
# export WANDB_PROJECT=cleanrl

OMP_NUM_THREADS=1 python -m cleanrl_utils.benchmark \
    --env-ids CartPole-v1 Acrobot-v1 MountainCar-v0 \
    --command "poetry run python cleanrl/ppo.py --track --cuda False" \
    --num-seeds 3 \
    --workers 3

poetry install -E atari
python -m cleanrl_utils.benchmark \
    --env-ids PongNoFrameskip-v4 BeamRiderNoFrameskip-v4 BreakoutNoFrameskip-v4 \
    --command "poetry run python cleanrl/ppo_atari.py --track" \
    --num-seeds 3 \
    --workers 1

python -m cleanrl_utils.benchmark \
    --env-ids PongNoFrameskip-v4 BeamRiderNoFrameskip-v4 BreakoutNoFrameskip-v4 \
    --command "poetry run python cleanrl/ppo_atari_lstm.py --track" \
    --num-seeds 3 \
    --workers 1

python -m cleanrl_utils.benchmark \
    --env-ids PongNoFrameskip-v4 BeamRiderNoFrameskip-v4 BreakoutNoFrameskip-v4 \
    --command "poetry run python cleanrl/ppo_atari_envpool.py --track" \
    --num-seeds 3 \
    --workers 1

poetry install -E "mujoco pybullet"
python -c "import mujoco_py"
OMP_NUM_THREADS=1 python -m cleanrl_utils.benchmark \
    --env-ids HalfCheetah-v2 Walker2d-v2 Hopper-v2 \
    --command "poetry run python cleanrl/ppo_continuous_action.py --track --cuda False" \
    --num-seeds 3 \
    --workers 3

poetry install -E procgen
python -m cleanrl_utils.benchmark \
    --env-ids starpilot bossfight bigfish \
    --command "poetry run python cleanrl/ppo_procgen.py --track --track" \
    --num-seeds 3 \
    --workers 1

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.

vercel · 2022-04-18T02:13:05Z

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/9tPNxQNMcQfs1X6PgsoMTTb7YCW8
✅ Preview: https://cleanrl-git-benchmark-utilities-vwxyzjn.vercel.app

gitpod-io · 2022-04-18T02:13:06Z

dosssman

Quite useful addition. Will try to properly use it for the next benchmarks.

vwxyzjn · 2022-04-20T02:47:23Z

Added more documentation. Going to merge this soon.

yooceii · 2022-04-20T04:37:03Z

cleanrl_utils/benchmark.py

+import argparse
+import shlex
+import subprocess
+
+
+def parse_args():
+    # fmt: off
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--env-ids", nargs="+", default=["CartPole-v1", "Acrobot-v1", "MountainCar-v0"],
+        help="the ids of the environment to benchmark")
+    parser.add_argument("--command", type=str, default="poetry run python cleanrl/ppo.py",
+        help="the command to run")
+    parser.add_argument("--num-seeds", type=int, default=3,
+        help="the number of random seeds")
+    parser.add_argument('--workers', type=int, default=0,
+        help='the number of eval workers to run benchmark experimenets (skips evaluation when set to 0)')
+    args = parser.parse_args()
+    # fmt: on
+    return args
+
+
+def run_experiment(command: str):
+    command_list = shlex.split(command)
+    print(f"running {command}")
+    fd = subprocess.Popen(command_list)
+    return_code = fd.wait()
+    assert return_code == 0
+
+
+if __name__ == "__main__":
+    args = parse_args()
+    commands = []
+    for seed in range(1, args.num_seeds + 1):
+        for env_id in args.env_ids:
+            commands += [" ".join([args.command, "--env-id", env_id, "--seed", str(seed)])]
+
+    print(commands)
+
+    if args.workers > 0:
+        from concurrent.futures import ThreadPoolExecutor
+
+        executor = ThreadPoolExecutor(max_workers=args.workers, thread_name_prefix="cleanrl-benchmark-worker-")
+        for command in commands:
+            executor.submit(run_experiment, command)
+        executor.shutdown(wait=True, cancel_futures=False)


If it simply spawns multiple processes, why don't we instead write a bash script?

I’m open to using a bash script. But the main benefit here is pythons code is a bit easier to read and also allows me to set a maximum number of workers. I don’t know if I can Set a maximum number of workers with bash.

I think u can use poetry run python cleanrl/ppo.py & to spawn a new process and run it in the background.

True. The issue is setting a maximum amount of workers. Imagine I have 60 commands to run, poetry run python cleanrl/ppo.py & is just gonna overflow my CPU.

docs/get-started/benchmark-utility.md

vwxyzjn · 2022-04-23T23:53:10Z

Merging now.

vwxyzjn requested review from dosssman and yooceii April 18, 2022 02:13

vercel bot deployed to Preview April 18, 2022 02:17 View deployment

vercel bot deployed to Preview April 18, 2022 02:21 View deployment

vwxyzjn force-pushed the benchmark-utilities branch from 5b3a5ff to 8c8dc56 Compare April 18, 2022 02:24

vercel bot deployed to Preview April 18, 2022 02:24 View deployment

vwxyzjn added 3 commits April 17, 2022 22:37

Introduce benchmark utilities

25ac2bf

Add PPO benchmark script

3d62bbc

Fix typo

2c85aba

vwxyzjn force-pushed the benchmark-utilities branch from 8c8dc56 to 2c85aba Compare April 18, 2022 02:37

vercel bot deployed to Preview April 18, 2022 02:37 View deployment

update benchmark scripts

594ab1a

vercel bot deployed to Preview April 18, 2022 02:38 View deployment

Fix issues with headless mode

e45e9a9

vercel bot deployed to Preview April 18, 2022 02:42 View deployment

Fix workers

1a21a5a

vercel bot deployed to Preview April 18, 2022 02:42 View deployment

dosssman approved these changes Apr 19, 2022

View reviewed changes

add documentation

d165a5e

vercel bot deployed to Preview April 20, 2022 02:26 View deployment

vwxyzjn mentioned this pull request Apr 20, 2022

Add docs for c51.py and c51_atari.py #159

Merged

19 tasks

yooceii reviewed Apr 20, 2022

View reviewed changes

vwxyzjn merged commit 5184afc into master Apr 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce benchmark utilities #165

Introduce benchmark utilities #165

vwxyzjn commented Apr 18, 2022

vercel bot commented Apr 18, 2022 •

edited

gitpod-io bot commented Apr 18, 2022

dosssman left a comment

vwxyzjn commented Apr 20, 2022

yooceii Apr 20, 2022

vwxyzjn Apr 20, 2022

yooceii Apr 21, 2022

vwxyzjn Apr 21, 2022

vwxyzjn commented Apr 23, 2022

	for env_id in "HalfCheetah-v2" "Walker2d-v2" "Hopper-v2"; do
	for seed in 1 2 3; do
	poetry run python cleanrl/sac_continuous_action.py --track --capture-video --wandb-project-name cleanrl --wandb-entity openrlbenchmark --env-id $env_id --seed $seed
	done
	done

Introduce benchmark utilities #165

Introduce benchmark utilities #165

Conversation

vwxyzjn commented Apr 18, 2022

Description

Types of changes

Checklist:

vercel bot commented Apr 18, 2022 • edited

gitpod-io bot commented Apr 18, 2022

dosssman left a comment

Choose a reason for hiding this comment

vwxyzjn commented Apr 20, 2022

yooceii Apr 20, 2022

Choose a reason for hiding this comment

vwxyzjn Apr 20, 2022

Choose a reason for hiding this comment

yooceii Apr 21, 2022

Choose a reason for hiding this comment

vwxyzjn Apr 21, 2022

Choose a reason for hiding this comment

vwxyzjn commented Apr 23, 2022

vercel bot commented Apr 18, 2022 •

edited