Add PPO documentation #163

vwxyzjn · 2022-04-18T02:04:27Z

Description

Types of changes

Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2022-04-18T02:04:29Z

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/5CkswQxbRMkZh3yzhki7iWUgT4Dc
✅ Preview: https://cleanrl-git-ppo-docs-continued-vwxyzjn.vercel.app

gitpod-io · 2022-04-18T02:04:30Z

vwxyzjn · 2022-04-24T01:43:12Z

@yooceii @dosssman this is ready for review. I didn't do the I have added links to the PR related to the algorithm because I found adding the PR to be less helpful than I anticipated... Ideally, the users should be able to reproduce the same results in the latest master without checking out each PR.

yooceii · 2022-04-24T03:46:52Z

docs/rl-algorithms/ppo.md

@@ -88,12 +110,14 @@ Learning curves:

 Tracked experiments and game play videos:


nit, if it's classic control env, then probably use videos instead game play videos.

yooceii

lgtm

dosssman

All clear on my side too.

cache more documentation

817ef22

vwxyzjn mentioned this pull request Apr 18, 2022

Change ppo.py's default timesteps #164

Merged

4 tasks

Add documentation

9bafe30

vercel bot deployed to Preview April 24, 2022 00:26 View deployment

Add ppo_continuous_action.py documentation

6991ac2

vercel bot deployed to Preview April 24, 2022 00:52 View deployment

Quick fix

c1cef3b

vercel bot deployed to Preview April 24, 2022 00:52 View deployment

vwxyzjn added 2 commits April 23, 2022 21:05

Add ppo_atari_lstm.py docs

8c2f7dd

Add explanation of metrics

2621795

vercel bot deployed to Preview April 24, 2022 01:22 View deployment

Add ppo_atari_envpool.py docs

9082ad1

vercel bot deployed to Preview April 24, 2022 01:30 View deployment

Add ppo_procgen.py docs

cbc725f

vercel bot deployed to Preview April 24, 2022 01:37 View deployment

Fix docs

754627b

vercel bot deployed to Preview April 24, 2022 01:41 View deployment

vwxyzjn marked this pull request as ready for review April 24, 2022 01:41

vwxyzjn requested review from dosssman and yooceii April 24, 2022 01:41

vwxyzjn mentioned this pull request Apr 24, 2022

Refactor documentation #121

Closed

10 tasks

yooceii reviewed Apr 24, 2022

View reviewed changes

yooceii approved these changes Apr 24, 2022

View reviewed changes

dosssman approved these changes Apr 26, 2022

View reviewed changes

vwxyzjn merged commit 2fff248 into master Apr 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PPO documentation #163

Add PPO documentation #163

vwxyzjn commented Apr 18, 2022 •

edited

Loading

vercel bot commented Apr 18, 2022 •

edited

Loading

gitpod-io bot commented Apr 18, 2022

vwxyzjn commented Apr 24, 2022

yooceii Apr 24, 2022

yooceii left a comment

dosssman left a comment

		@@ -88,12 +110,14 @@ Learning curves:

		Tracked experiments and game play videos:

Add PPO documentation #163

Add PPO documentation #163

Conversation

vwxyzjn commented Apr 18, 2022 • edited Loading

Description

Types of changes

Checklist:

vercel bot commented Apr 18, 2022 • edited Loading

gitpod-io bot commented Apr 18, 2022

vwxyzjn commented Apr 24, 2022

yooceii Apr 24, 2022

Choose a reason for hiding this comment

yooceii left a comment

Choose a reason for hiding this comment

dosssman left a comment

Choose a reason for hiding this comment

vwxyzjn commented Apr 18, 2022 •

edited

Loading

vercel bot commented Apr 18, 2022 •

edited

Loading