Merge pull request #168 from chainer/muupan-patch-1

Add PPO to README as an implemented algorithm
chainer · Nov 13, 2017 · 6027634 · 6027634
2 parents 04e938e + 36110ce
commit 6027634
Showing 1 changed file with 1 addition and 0 deletions.
diff --git a/README.md b/README.md
@@ -52,6 +52,7 @@ Following algorithms have been implemented in ChainerRL:
 - DDPG (Deep Deterministic Poilcy Gradients) (including SVG(0))
 - PGT (Policy Gradient Theorem)
 - PCL (Path Consistency Learning)
+- PPO (Proximal Policy Optimization)
 
 Q-function based algorithms such as DQN can utilize a Normalized Advantage Function (NAF) to tackle continuous-action problems as well as DQN-like discrete output networks.