Author : Furrer Stanislas, Luis Carvahlo
Date : 15.05.2019
In this project we taught an agent to play the game Pong from the PyGame learning environment. We used policy gradient approaches to learn the task : Actor Critic Versus Advantage Actor-Critic (A2C)