Asynchronous-DDPG_distributed_tensorflow

Distributed Tensorflow Implementation of asynchronous ddpg.

Implementation is on Tensorflow 1.2.1.

DDPG script is based on songrotek's repo. https://github.com/songrotek/DDPG.git

One of popular pain-points of reinforcement learning is too long learning time. Thus, A3C was proposed for parallel learning to efficiently learn the agent. However, for DDPG, one of strong alogrithm for continuous action episode, there are a few research for parallel learning. One of them is intentional unintentional agent, which is to learn several tasks simultaneously (https://arxiv.org/abs/1707.03300). In here, I validate parallel learning of ddpg for simpler experiment than IU agent's one. Each workers learn just one task. After learning several episodes, their training information is merged with parameter server.

GYM Reacher-v1 game

./auto_run.sh

You need to set your hostname and port number in gym_addpg.py code. The number of parameter servers and workers can be set in auto_run.sh script file.

Settings

Almost Settings are same to songrotek's ones, except learning rate of critic networks. The number of parameter server and workers is 1 and 4, repectively.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
figures		figures
LICENSE		LICENSE
README.md		README.md
actor_network_bn.py		actor_network_bn.py
actor_network_bn.pyc		actor_network_bn.pyc
auto_run.sh		auto_run.sh
critic_network.py		critic_network.py
critic_network.pyc		critic_network.pyc
ddpg.py		ddpg.py
ddpg.pyc		ddpg.pyc
filter_env.py		filter_env.py
filter_env.pyc		filter_env.pyc
gym_addpg.py		gym_addpg.py
ou_noise.py		ou_noise.py
ou_noise.pyc		ou_noise.pyc
replay_buffer.py		replay_buffer.py
replay_buffer.pyc		replay_buffer.pyc

License

jsikyoon/Asynchronous-DDPG_distributed_tensorflow

Folders and files

Latest commit

History

Repository files navigation

Asynchronous-DDPG_distributed_tensorflow

GYM Reacher-v1 game

Settings

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Languages