stable_baselines.common.policies
Stable-baselines provides a set of default policies, that can be used with most action spaces. To customize the default policies, you can specify the policy_kwargs
parameter to the model class you use. Those kwargs are then passed to the policy on instantiation (see custom_policy
for an example). If you need more control on the policy architecture, you can also create a custom policy (see custom_policy
).
Note
CnnPolicies are for images only. MlpPolicies are made for other type of features (e.g. robot joints)
Warning
For all algorithms (except DDPG, TD3 and SAC), continuous actions are clipped during training and testing (to avoid out of bound error).
Available Policies
MlpPolicy MlpLstmPolicy MlpLnLstmPolicy CnnPolicy CnnLstmPolicy CnnLnLstmPolicy
BasePolicy
ActorCriticPolicy
FeedForwardPolicy
LstmPolicy
MlpPolicy
MlpLstmPolicy
MlpLnLstmPolicy
CnnPolicy
CnnLstmPolicy
CnnLnLstmPolicy