Skip to content

PPOSIL, fix continuous actions and PPO

Compare
Choose a tag to compare
@kengz kengz released this 08 Aug 06:19
· 2403 commits to master since this release
fb617ae

This release adds PPOSIL, fixes some small issues with continuous actions, and PPO ratio computation.

Implementations

#145 Implement PPOSIL. Improve debug logging
#143 add Arch installer thanks to @angel-ayala

Bug Fixes

#138 kill hanging processes of Electron for plotting
#145 fix PPO wrong graph update sequence causing ratio to be 1. Fix continuous action output construction. add guards.
#146 fix continuous actions and add full tests