GitHub - happywu/A3C: MXNET + OpenAI Gym implementation of A3C from "Asynchronous Methods for Deep Reinforcement Learning"

Still in progress.

This is a MXNET implementation of A3C as described in "Asynchronous Methods for Deep Reinforcement Learning.

If you don't want to run FlappyBird, you can ignore this.

To run experiment:

python a3c.py --game-source=flappybird --num-threads=16 --save-model-prefix=a3c-flappybird --save-every=1000

To eval, I have upload a checkpoint of mine, you could try your own parameters.

python a3c.py --test --model-prefix=a3ce-8 --load-epoch=305000 --game-source=flappybird

If you train on computer without GPUS, please change "devs = gpu(1)" to "devs = cpu()"

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.idea		.idea
assets		assets
game		game
README.md		README.md
a3c.py		a3c.py
a3c_shared.py		a3c_shared.py
a3c_shared42x42.py		a3c_shared42x42.py
a3c_shared_test.py		a3c_shared_test.py
a3ce-8-305000.params		a3ce-8-305000.params
a3ce-8-305000.states		a3ce-8-305000.states
a3ce-8-symbol.json		a3ce-8-symbol.json
a3cmodule.py		a3cmodule.py
async_dqn.py		async_dqn.py
async_dqn_test.py		async_dqn_test.py
async_dqn_test_new.py		async_dqn_test_new.py
dqn_flappybird.py		dqn_flappybird.py
flappybirdprovider.py		flappybirdprovider.py
myinit.py		myinit.py
rl_data.py		rl_data.py
runflappybird.py		runflappybird.py
sym.py		sym.py