Skip to content

zencoding/ACER

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ACER

MIT License

Actor-critic with experience replay (ACER) [1]. The agent also receives the previous action and reward [2]. Uses batch off-policy updates to improve stability.

Run with python main.py <options>. To run asynchronous advantage actor-critic (A3C) [3] (but with a Q-value head), use the --on-policy option.

Requirements

Acknowledgements

References

[1] Sample Efficient Actor-Critic with Experience Replay
[2] Learning to Navigate in Complex Environments
[3] Asynchronous Methods for Deep Reinforcement Learning

About

Actor-critic with experience replay

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%