My DDPG implementation as PL system

The Algorithm

Pseudocode

`LunarLanderContinuous-v2` parameters:

Bad implementation

DDPG net:

`BipedalWalker-v3` parameters:

Bad implementation

DDPG net:

Credits:

Papers:

@report{Silver2014, author = {David Silver and Nicolas Heess and Thomas Degris and Daan Wierstra and Martin Riedmiller}, keywords = {ICML,boring formatting information,machine learning}, title = {2014 - Silver - Deterministic Policy Gradient Algorithms.pdf}, year = {2014}, } (paper)
@article{Sutton1999, author = {Richard S Sutton and David Mcallester and Satinder Singh and Yishay Mansour}, title = {Policy Gradient Methods for Reinforcement Learning with Function Approximation}, year = {1999}, } (paper)
@article{Lillicrap2016, author = {Timothy P Lillicrap and Jonathan J Hunt and Alexander Pritzel and Nicolas Heess and Tom Erez and Yuval Tassa and David Silver and Daan Wierstra}, title = {CONTINUOUS CONTROL WITH DEEP REINFORCEMENT LEARNING}, url = { https://goo.gl/J4PIAz }, year = {2016}, } (paper)
@article{weng2018PG, title = "Policy Gradient Algorithms", author = "Weng, Lilian", journal = "lilianweng.github.io/lil-log", year = "2018", url = "https://lilianweng.github.io/lil-log/2018/04/08/policy-gradient-algorithms.html" }

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
static		static
.gitignore		.gitignore
README.md		README.md
alg_constrants_amd_packages.py		alg_constrants_amd_packages.py
alg_general_functions.py		alg_general_functions.py
alg_logger.py		alg_logger.py
alg_memory.py		alg_memory.py
alg_module.py		alg_module.py
alg_net.py		alg_net.py
alg_train.py		alg_train.py
drafts1.py		drafts1.py
play_game.py		play_game.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

static

static

.gitignore

.gitignore

README.md

README.md

alg_constrants_amd_packages.py

alg_constrants_amd_packages.py

alg_general_functions.py

alg_general_functions.py

alg_logger.py

alg_logger.py

alg_memory.py

alg_memory.py

alg_module.py

alg_module.py

alg_net.py

alg_net.py

alg_train.py

alg_train.py

drafts1.py

drafts1.py

play_game.py

play_game.py

Repository files navigation

My DDPG implementation as PL system

The Algorithm

Pseudocode

`LunarLanderContinuous-v2` parameters:

`BipedalWalker-v3` parameters:

Credits:

Papers:

About

Releases

Packages

Languages

Arseni1919/PL_DDPG

Folders and files

Latest commit

History

Repository files navigation

My DDPG implementation as PL system

The Algorithm

Pseudocode

LunarLanderContinuous-v2 parameters:

BipedalWalker-v3 parameters:

Credits:

Papers:

About

Resources

Stars

Watchers

Forks

Languages

`LunarLanderContinuous-v2` parameters:

`BipedalWalker-v3` parameters: