Skip to content

siemanko/a3c

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A3C implementation in Python

THIS IS STILL WORK IN PROGRESS - IT DOES NOT WORK YET!

Lessons learned

  1. Hogwild - turns out that to run hogwild in Tensorflow one needs to create separate tensorflow graphs with shared variables. Example in hogwild/mnist.py
  2. Parallel simulators with OpenAI gym - turns out that running more than 4 threads with Atari simulator in parallel is not supported. Check out parallel_simulator/thread_test.py. On second try, I forced gym to be run on a separate process, this way the bug does not occur (parallel_simulator/process_test.py)
  3. To make sure that the preprocessing makes sense, I visualize it in preproc/preproc.py. It behaves a bit weird for Pong, but should be possible to recover policy from it.

About

Asynchronous Advantage Actor Critic

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages