stable_baselines.common.policies

Policy Networks

Stable-baselines provides a set of default policies, that can be used with most action spaces. To customize the default policies, you can specify the policy_kwargs parameter to the model class you use. Those kwargs are then passed to the policy on instantiation (see custom_policy for an example). If you need more control on the policy architecture, you can also create a custom policy (see custom_policy).

Note

CnnPolicies are for images only. MlpPolicies are made for other type of features (e.g. robot joints)

Warning

For all algorithms (except DDPG, TD3 and SAC), continuous actions are clipped during training and testing (to avoid out of bound error).

Available Policies

MlpPolicy MlpLstmPolicy MlpLnLstmPolicy CnnPolicy CnnLstmPolicy CnnLnLstmPolicy

Base Classes

BasePolicy

ActorCriticPolicy

FeedForwardPolicy

LstmPolicy

MLP Policies

MlpPolicy

MlpLstmPolicy

MlpLnLstmPolicy

CNN Policies

CnnPolicy

CnnLstmPolicy

CnnLnLstmPolicy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

policies.rst

policies.rst

Policy Networks

Base Classes

MLP Policies

CNN Policies

Files

policies.rst

Latest commit

History

policies.rst

File metadata and controls

Policy Networks

Base Classes

MLP Policies

CNN Policies