Skip to content

Commit

Permalink
Update ppo2 documentation (#686)
Browse files Browse the repository at this point in the history
* Update ppo2.rst

* Updated contributor list in changelog.rst

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
  • Loading branch information
kvenkman and araffin committed Apr 4, 2020
1 parent 950c2a5 commit 31d9c83
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 3 deletions.
3 changes: 2 additions & 1 deletion docs/misc/changelog.rst
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,7 @@ Others:
Documentation:
^^^^^^^^^^^^^^
- Fix typos (@caburu)
- Fix typos in PPO2 (@kvenkman)


Release 2.10.0 (2020-03-11)
Expand Down Expand Up @@ -688,4 +689,4 @@ Thanks to @bjmuld @iambenzo @iandanforth @r7vme @brendenpetersen @huvar @abhiskk
@XMaster96 @kantneel @Pastafarianist @GerardMaggiolino @PatrickWalter214 @yutingsz @sc420 @Aaahh @billtubbs
@Miffyli @dwiel @miguelrass @qxcv @jaberkow @eavelardev @ruifeng96150 @pedrohbtp @srivatsankrishnan @evilsocket
@MarvineGothic @jdossgollin @SyllogismRXS @rusu24edward @jbulow @Antymon @seheevic @justinkterry @edbeeching
@flodorner @KuKuXia @NeoExtended @solliet @mmcenta @richardwu @tirafesi @caburu @johannes-dornheim
@flodorner @KuKuXia @NeoExtended @solliet @mmcenta @richardwu @tirafesi @caburu @johannes-dornheim @kvenkman
4 changes: 2 additions & 2 deletions docs/modules/ppo2.rst
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,8 @@ PPO2
The `Proximal Policy Optimization <https://arxiv.org/abs/1707.06347>`_ algorithm combines ideas from A2C (having multiple workers)
and TRPO (it uses a trust region to improve the actor).

The main idea is that after an update, the new policy should be not too far form the old policy.
For that, ppo uses clipping to avoid too large update.
The main idea is that after an update, the new policy should be not too far from the old policy.
For that, PPO uses clipping to avoid too large update.

.. note::

Expand Down

0 comments on commit 31d9c83

Please sign in to comment.