Skip to content

v2.8.0: Fix for `MaskablePPO` and markdown doc

Choose a tag to compare

@araffin araffin released this 01 Apr 11:30
· 4 commits to master since this release
50d8a8f

Breaking Changes:

  • Removed support for Python 3.9, please upgrade to Python >= 3.10
  • Upgraded to Stable-Baselines3 >= 2.8.0
  • Set strict=True for every call to zip(...)

New Features:

  • Added official support for Python 3.13

Bug Fixes:

  • Fixed MaskablePPO and RecurrentPPO inaccurate n_updates counting when target_kl early exits the training loop
  • Fixed RecurrentPPO and MaskablePPO forward and predict not reshaping the action before clipping it (@immortal-boy)
  • Do not call forward() method directly in RecurrentPPO (@immortal-boy)
  • Fixed MaskableCategorical.apply_masking() crashing with ValueError: Simplex when cached probs deviate from sum=1 in float32 with large action spaces (torch 2.9+) (@kirann-05)

Documentation:

  • Switched to markdown documentation (using MyST parser)

New Contributors

Full Changelog: v2.7.1...v2.8.0