GitHub - keithmgould/caffe2_cartpole_reinforce: Caffe2 Reinforcement Learning via Gradient Descent on Cartpole

Uses Caffe2 to implement reinforcement learning via gradient descent.

The problem solved is the openAI Gym Cartpole problem.

Currently the script manages the forward pass, but I don't get how to do the backward pass within the context of Caffe2.

This script is inspired by this Pytorch example

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
reinforce.py		reinforce.py

Provide feedback