Application of different Reinforcement Learning algorithms on the Atari game Pong.
-
A2C is a synchronous variant of the A3C algorithm (https://arxiv.org/pdf/1708.05144)
$ python -m run_reinforce_fc
$ python -m run_reinforce_lstm
$ python -m run_a2c
$ python -m run_ppo_fc