Refactor to use tyro #424

vwxyzjn · 2023-10-16T01:55:50Z

Description

Better code, IDE support via tyro. Also refactors PPO and closes #206

Gonna do #408 separately.

I also got to redocument ppo_atari_multigpu about the scaling log as well.

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the tests accordingly (if applicable).
I have updated the documentation and previewed the changes via mkdocs serve.
- I have explained note-worthy implementation details.
- I have explained the logged metrics.
- I have added links to the original paper and related papers.

If you need to run benchmark experiments for a performance-impacting changes:

I have contacted @vwxyzjn to obtain access to the openrlbenchmark W&B team.
I have used the benchmark utility to submit the tracked experiments to the openrlbenchmark/cleanrl W&B project, optionally with --capture-video.
I have performed RLops with python -m openrlbenchmark.rlops.
- For new feature or bug fix:
  - I have used the RLops utility to understand the performance impact of the changes and confirmed there is no regression.
- For new algorithm:
  - I have created a table comparing my results against those from reputable sources (i.e., the original paper or other reference implementation).
- I have added the learning curves generated by the python -m openrlbenchmark.rlops utility to the documentation.
- I have added links to the tracked experiments in W&B, generated by python -m openrlbenchmark.rlops ....your_args... --report, to the documentation.

vercel · 2023-10-16T01:55:55Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
cleanrl	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Nov 28, 2023 0:32am

sdpkjc

Elegant code! There are also some test case commands that need to be modified.

sdpkjc · 2023-10-16T04:39:06Z

cleanrl/ppo_rnd_envpool.py

+    args = tyro.cli(Args)
+    args.batch_size = int(args.num_envs * args.num_steps)
+    args.minibatch_size = int(args.batch_size // args.num_minibatches)
+    args.num_iterations = args.total_timesteps // args.batch_size


args.num_iterations does not appear to replace the previous num_updates. Defined but not used.

…to refactor-tyro

vwxyzjn · 2023-11-28T01:32:08Z

Closes #418 too (added CI for 3.8, 3.9, 3.10)

Refactor to use tyro

cd4851e

vwxyzjn added 2 commits October 15, 2023 22:05

push

b97d54f

psuh

b87a015

vercel bot deployed to Preview October 16, 2023 02:06 View deployment

refactor

896f346

vercel bot deployed to Preview October 16, 2023 02:16 View deployment

vwxyzjn added 2 commits October 15, 2023 22:16

fix pre-commit

6220645

fix pre-commit

adbf836

vercel bot deployed to Preview October 16, 2023 02:17 View deployment

vwxyzjn requested a review from sdpkjc October 16, 2023 02:18

sdpkjc reviewed Oct 16, 2023

View reviewed changes

vwxyzjn mentioned this pull request Oct 16, 2023

ppo_continuous_action huggingface integration #423

Merged

18 tasks

vwxyzjn and others added 2 commits October 16, 2023 09:06

fix commend

8af1e13

Merge branch 'master' into refactor-tyro

0b61550

vercel bot deployed to Preview October 16, 2023 14:40 View deployment

vwxyzjn added 3 commits October 16, 2023 17:02

refactor

96a56b8

Merge branch 'refactor-tyro' of https://github.com/vwxyzjn/cleanrl in…

a8795a9

…to refactor-tyro

update poetry

cb6b47a

vercel bot deployed to Preview October 16, 2023 21:03 View deployment

fix test case

cfeedb0

vercel bot deployed to Preview October 16, 2023 22:04 View deployment

quick fix

9c0959c

vercel bot deployed to Preview October 16, 2023 22:06 View deployment

fix

5f3f716

vercel bot deployed to Preview October 17, 2023 00:38 View deployment

update optuna

08f4392

vercel bot deployed to Preview October 17, 2023 00:51 View deployment

quick change

de6c829

vercel bot deployed to Preview October 17, 2023 00:52 View deployment

vwxyzjn mentioned this pull request Nov 19, 2023

Pyyaml error on poetry install #418

Closed

3 tasks

update docs

60b71f7

vercel bot deployed to Preview November 27, 2023 22:06 View deployment

update ppo docs

7a96de2

vercel bot deployed to Preview November 27, 2023 22:30 View deployment

vwxyzjn added 2 commits November 27, 2023 17:30

bump version

89846df

bump version

4f0dc48

vercel bot deployed to Preview November 27, 2023 22:30 View deployment

vwxyzjn marked this pull request as ready for review November 27, 2023 22:32

bump test cases

d821748

vercel bot deployed to Preview November 27, 2023 22:34 View deployment

add benchmark utility docs

7880155

vercel bot deployed to Preview November 27, 2023 22:44 View deployment

bump test

50ec155

vercel bot deployed to Preview November 27, 2023 22:46 View deployment

fix #418

940595a

vercel bot deployed to Preview November 27, 2023 23:23 View deployment

update requirements.txt

b0caf45

vercel bot deployed to Preview November 27, 2023 23:25 View deployment

test

aaf7dd0

vercel bot deployed to Preview November 27, 2023 23:27 View deployment

add numpy

2fb4814

vercel bot deployed to Preview November 28, 2023 00:32 View deployment

vwxyzjn merged commit 35896b1 into master Nov 28, 2023
52 checks passed

This was referenced Nov 28, 2023

numpy version issue with python 3.10 #417

Closed

Upgrade gym version to 0.26.1 #263

Closed

Various minor PPO refactors #167

Closed

Video upload Issue - wandb #397

Closed

Liberate the requirements.txt #387

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor to use tyro #424

Refactor to use tyro #424

vwxyzjn commented Oct 16, 2023 •

edited

Loading

vercel bot commented Oct 16, 2023 •

edited

Loading

sdpkjc left a comment

sdpkjc Oct 16, 2023

vwxyzjn commented Nov 28, 2023

Refactor to use tyro #424

Refactor to use tyro #424

Conversation

vwxyzjn commented Oct 16, 2023 • edited Loading

Description

Types of changes

Checklist:

vercel bot commented Oct 16, 2023 • edited Loading

sdpkjc left a comment

Choose a reason for hiding this comment

sdpkjc Oct 16, 2023

Choose a reason for hiding this comment

vwxyzjn commented Nov 28, 2023

vwxyzjn commented Oct 16, 2023 •

edited

Loading

vercel bot commented Oct 16, 2023 •

edited

Loading