Standard implementation of the DDPG algorithm
https://arxiv.org/abs/1509.02971
Coupled with DeepMind's newest ditributional bellman equation update, chekout critic_network.py (loss function) and ddpg.py (train function) for details.
https://arxiv.org/pdf/1707.06887.pdf
FILES:
actor_network.py: The code for structure of the actor network
critic_network.py: The code for structure of the critic network
ou_noise.py:The random noise generator
ddpg.py:The code for the ddpg algorithm
gym_ddpg.py:Running ddpg on the environment
-
Notifications
You must be signed in to change notification settings - Fork 1
yhyu13/C51-DDPG
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published