Uses Caffe2 to implement reinforcement learning via gradient descent.
The problem solved is the openAI Gym Cartpole problem.
Currently the script manages the forward pass, but I don't get how to do the backward pass within the context of Caffe2.
This script is inspired by this Pytorch example