-
Notifications
You must be signed in to change notification settings - Fork 549
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: ddpg action bias #299
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
Thanks for the PR. Running some benchmark experiments now. |
Using the following snippet from #307 python rlops.py --exp-name ddpg_continuous_action \
--wandb-project-name cleanrl \
--wandb-entity openrlbenchmark \
--tags pr-299 rlops-pilot \
--env-ids Hopper-v2 Walker2d-v2 HalfCheetah-v2 \
--output-filename compare.png \
--report we generate the following image Discussion
What remains is to update the documentation and optionally run more experiments in more envs. |
Experiments were done, and the docs were updated. Using the following command from #307 generated the following figure and table
|
Thanks @sdpkjc for this PR and raising the issue. |
Description
Fixes the first part of #297
Types of changes
Checklist:
pre-commit run --all-files
passes (required).mkdocs serve
.If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.
--capture-video
flag toggled on (required).mkdocs serve
.width=500
andheight=300
).