DDPG documnetation tweaks; added Q loss equations and light explanation #145

dosssman · 2022-03-23T02:34:00Z

Description

Fixed a few typos and proposed some reformulations of a few sentences.
Added a little bit more details regarding DDPG's Q loss.

Other comments

Regarding the hard time reproducing ddpg on Mujoco-v1, I was wondering how feasible it would be to run fujimoto's DDPG.py etc.. on free-mujoco

Other than that, great job on the pretty complete documentation for DDPG @vwxyzjn @yooceii , and sorry for being late to the party 🙇

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2022-03-23T02:34:05Z

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/CSE1uakxpjPwtLa1Dm9cmjwxxE4g
✅ Preview: https://cleanrl-git-fork-dosssman-ddpg-docs-tweaks-vwxyzjn.vercel.app

gitpod-io · 2022-03-23T02:34:07Z

vwxyzjn · 2022-03-23T02:40:23Z

This PR is a follow-up on #137. Thanks @dosssman for this fix! I will take a look at it tomorrow :)

Regarding the hard time reproducing ddpg on Mujoco-v1, I was wondering how feasible it would be to run fujimoto's DDPG.py etc.. on free-mujoco

There it is: https://wandb.ai/openrlbenchmark/openrlbenchmark/reports/MuJoCo-sfujim-TD3--VmlldzoxNzIyODIz

dosssman · 2022-03-23T02:42:02Z

Thanks. The report seems privated though:

vwxyzjn · 2022-03-23T02:47:59Z

Could you try it again?

dosssman · 2022-03-23T02:55:24Z

All good now

vwxyzjn

LGTM. Thanks!

DDPG documnetation tweaks; added Q loss equations and light explanation

7854e0c

vercel bot deployed to Preview March 23, 2022 02:34 View deployment

dosssman requested a review from vwxyzjn March 23, 2022 02:40

vwxyzjn approved these changes Mar 23, 2022

View reviewed changes

vwxyzjn merged commit cfed3dd into vwxyzjn:master Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DDPG documnetation tweaks; added Q loss equations and light explanation #145

DDPG documnetation tweaks; added Q loss equations and light explanation #145

dosssman commented Mar 23, 2022 •

edited

vercel bot commented Mar 23, 2022 •

edited

gitpod-io bot commented Mar 23, 2022

vwxyzjn commented Mar 23, 2022

dosssman commented Mar 23, 2022

vwxyzjn commented Mar 23, 2022

dosssman commented Mar 23, 2022

vwxyzjn left a comment

DDPG documnetation tweaks; added Q loss equations and light explanation #145

DDPG documnetation tweaks; added Q loss equations and light explanation #145

Conversation

dosssman commented Mar 23, 2022 • edited

Description

Other comments

Types of changes

Checklist:

vercel bot commented Mar 23, 2022 • edited

gitpod-io bot commented Mar 23, 2022

vwxyzjn commented Mar 23, 2022

dosssman commented Mar 23, 2022

vwxyzjn commented Mar 23, 2022

dosssman commented Mar 23, 2022

vwxyzjn left a comment

Choose a reason for hiding this comment

dosssman commented Mar 23, 2022 •

edited

vercel bot commented Mar 23, 2022 •

edited