Minor PPO documentation clarification (#192)

* minor documentation clarification * fix flake8
cpnota · Jan 14, 2021 · 21ed3b3 · 21ed3b3
1 parent 235598f
commit 21ed3b3
Showing 1 changed file with 2 additions and 1 deletion.
diff --git a/all/presets/continuous/ppo.py b/all/presets/continuous/ppo.py
@@ -52,7 +52,8 @@ class PPOContinuousPreset(Preset):
         eps (float): Stability parameters for the Adam optimizer.
         entropy_loss_scaling (float): Coefficient for the entropy term in the total loss.
         value_loss_scaling (float): Coefficient for the value function loss.
-        clip_grad (float): The maximum magnitude of the gradient for any given parameter. Set to 0 to disable.
+        clip_grad (float): Clips the gradient during training so that its L2 norm (calculated over all parameters) 
+        # is no greater than this bound. Set to 0 to disable.
         clip_initial (float): Value for epsilon in the clipped PPO objective function at the beginning of training.
         clip_final (float): Value for epsilon in the clipped PPO objective function at the end of training.
         epochs (int): Number of times to iterature through each batch.