Skip to content
#

stable

Here are 43 public repositories matching this topic...

Experimental version of Stable Baslines3 which expands SB3 2.2.1 to be able to define a multi algorithm training. Usage will be based on defer actions, observation space and rewards between its inner algorithms (PPO, DQN, SAC...). It is thought for projects which may rely on different strategies for different actions with a focused training

  • Updated Jun 20, 2024
  • Python

Improve this page

Add a description, image, and links to the stable topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the stable topic, visit your repo's landing page and select "manage topics."

Learn more