A3C implementation in Python

THIS IS STILL WORK IN PROGRESS - IT DOES NOT WORK YET!

Hogwild - turns out that to run hogwild in Tensorflow one needs to create separate tensorflow graphs with shared variables. Example in hogwild/mnist.py
Parallel simulators with OpenAI gym - turns out that running more than 4 threads with Atari simulator in parallel is not supported. Check out parallel_simulator/thread_test.py. On second try, I forced gym to be run on a separate process, this way the bug does not occur (parallel_simulator/process_test.py)
To make sure that the preprocessing makes sense, I visualize it in preproc/preproc.py. It behaves a bit weird for Pong, but should be possible to recover policy from it.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
hogwild		hogwild
parallel_simulator		parallel_simulator
preproc		preproc
the_thing		the_thing
.gitignore		.gitignore
README.md		README.md

Provide feedback