Skip to content

Commit

Permalink
Merge pull request #516 from prabhatnagarajan/policy_docs
Browse files Browse the repository at this point in the history
Adds policies to the documentation
  • Loading branch information
muupan committed Aug 20, 2019
2 parents 8bee9da + c01ae56 commit b0f003e
Show file tree
Hide file tree
Showing 3 changed files with 39 additions and 0 deletions.
2 changes: 2 additions & 0 deletions chainerrl/policies/gaussian_policy.py
Expand Up @@ -31,8 +31,10 @@ class FCGaussianPolicy(chainer.ChainList, Policy):
Let y as the output of the mean layer.
If bound_mean=False:
mean = y (if bound_mean=False)
If bound_mean=True:
mean = min_action + tanh(y) * (max_action - min_action) / 2
The variance of the Gaussian is computed as follows:
Let y as the output of the variance layer.
variance = softplus(y) + min_var
Expand Down
36 changes: 36 additions & 0 deletions docs/policies.rst
@@ -0,0 +1,36 @@
========
Policies
========

Policy interfaces
=================

.. autoclass:: chainerrl.policy.Policy
:members:

.. automethod:: __call__

Policy implementations
======================

.. autoclass:: chainerrl.policies.ContinuousDeterministicPolicy

.. autoclass:: chainerrl.policies.FCDeterministicPolicy

.. autoclass:: chainerrl.policies.FCBNDeterministicPolicy

.. autoclass:: chainerrl.policies.FCLSTMDeterministicPolicy

.. autoclass:: chainerrl.policies.FCGaussianPolicy

.. autoclass:: chainerrl.policies.FCGaussianPolicyWithStateIndependentCovariance

.. autoclass:: chainerrl.policies.FCGaussianPolicyWithFixedCovariance

.. autoclass:: chainerrl.policies.GaussianHeadWithStateIndependentCovariance

.. autoclass:: chainerrl.policies.MellowmaxPolicy

.. autoclass:: chainerrl.policies.SoftmaxPolicy

.. autoclass:: chainerrl.policies.FCSoftmaxPolicy
1 change: 1 addition & 0 deletions docs/reference.rst
Expand Up @@ -11,5 +11,6 @@ API Reference
experiments
explorers
links
policies
recurrent
replay_buffers

0 comments on commit b0f003e

Please sign in to comment.