PPO and AWR

Algorithm 1: Proximal Policy Optimization

Algorithm 2: Advantage-Weighted Regression

Write the reinforcement learning algorithm with C++ and use Box2D & Caffe.

platform: Android
tool: CIDE3