DeepDPG-TensorFlow

TensorFlow Implementation of Deep Deterministic Policy Gradients

Intro

Replay buffers and target networks, as first proposed in ATARI playing paper, have made it possible to train deep value networks (DQN) over complicated environments. This is great, but DQN only works fine with discrete domains, since it relies on finding the action that maximizes the action-value function. Insisting on solving continuous valued cases, same authors came up with this model-free off-policy actor-critic algorithm, again by putting the DQN successes to good use. Here the exact algorithm is implemented using TensorFlow for continuous OpenAI Gym environments.

Overview

This code contains:

Deep Q-Networking and Policy Improvement
Easy Network Setting and Batch Normalization at Will
- changing your network architecture reduces to editing a list
Experience Replay Memory
- makes the algorithm off-policy
Target Networks for Both Action-Value and Policy Functions
- stabilizes the learning process
Ornstein—Uhlenbeck Action Noise for Exploration
It's Modular

A Playground for Controlling OpenAI Gym

Can play with and tune network settings in config.py and control other environments.

TODOS

extend it to MuJoCo environments
saving and loading checkpoints (net weights)
make nice summaries

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
_config.yml		_config.yml
agent.py		agent.py
buildnet.py		buildnet.py
config.py		config.py
environment.py		environment.py
exp_replay.py		exp_replay.py
main.py		main.py
netclass.py		netclass.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

_config.yml

_config.yml

agent.py

agent.py

buildnet.py

buildnet.py

config.py

config.py

environment.py

environment.py

exp_replay.py

exp_replay.py

main.py

main.py

netclass.py

netclass.py

Repository files navigation

DeepDPG-TensorFlow

Intro

Overview

A Playground for Controlling OpenAI Gym

TODOS

References

About

Releases

Packages

Languages

MahanFathi/DeepDPG-TensorFlow

Folders and files

Latest commit

History

Repository files navigation

DeepDPG-TensorFlow

Intro

Overview

A Playground for Controlling OpenAI Gym

TODOS

References

About

Topics

Resources

Stars

Watchers

Forks

Languages