Skip to content
/ ACER_tf Public

Implementation for ACER in tensorflow and sonnet by deepmind

Notifications You must be signed in to change notification settings

hercky/ACER_tf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Implementation of ACER (Actor-Critic with Experience replay)

Contains the tensorflow and sonnet implementation for SAMPLE EFFICIENT ACTOR-CRITIC WITH EXPERIENCE REPLAY by Ziyu Wang, Victor Bapst et al from Deepmind (https://arxiv.org/abs/1611.01224).

The current version is tested only for MuJoCo gym environments,.

Major dependencies

Running

python train.py --model_dir ./tmp_model/ --env InvertedPendulum-v1 --eval_every_sec 60 --num_agents 4

See python train.py --help for a full list of options.

You can monitor training progress in Tensorboard:

tensorboard --logdir=/tmp_model/

Components

  • train.py contains the main method to start training.
  • agent.py contains the code for the agent threads and actual ACER algortihm
  • advantage_net.py contains code for building the stochasitic dueling net
  • policy_net.py contains code for building the policy network
  • memory.py contains the memory class for experience replay

References

About

Implementation for ACER in tensorflow and sonnet by deepmind

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages