Skip to content

Latest commit

 

History

History
7 lines (7 loc) · 306 Bytes

README.md

File metadata and controls

7 lines (7 loc) · 306 Bytes

PPO and AWR

Algorithm 1: Proximal Policy Optimization

Algorithm 2: Advantage-Weighted Regression

Write the reinforcement learning algorithm with C++ and use Box2D & Caffe.

  • platform: Android
  • tool: CIDE3

box2d run car demo