Skip to content

Latest commit

 

History

History
66 lines (58 loc) · 1.75 KB

CHANGELOG.md

File metadata and controls

66 lines (58 loc) · 1.75 KB

Changes

v5.0.0 (XX 04, 2023)

Features 🔊

  • Gymnasium
  • DeepMind Control Suite wrapper
  • ELU activation
  • Orthogonal initializer
  • Optional state-action merging layer index (Critic model)

Bug fixes 🛠️

  • Optimized critic
  • Optimized server
  • backend.epsilon() from Keras backend

v4.1.1 (September 2, 2022)

Bug fixes 🛠️

  • update default config.yaml

v4.1.0 (February 9, 2022)

Features 🔊

  • .fit()
  • AgentCallback

v4.0.0 (February 5, 2022)

Features 🔊

  • Render environments to WanDB
  • Grouping of runs in WanDB
  • SampleToInsertRatio rate limiter
  • Global Gradient Clipping to avoid exploding gradients
  • Softplus for numerical stability
  • YAML configuration file
  • LogCosh instead of Huber loss
  • Critic network with Add layer applied on state & action branches
  • Custom uniform initializer
  • XLA (Accelerated Linear Algebra) compiler
  • Optimized Replay Buffer (google-deepmind/reverb#90)
  • split into Agent, Learner, Tester and Server

Bug fixes 🛠️

  • Fixed creating of saving path for models
  • Fixed model's summary()

v3.2.4 (July 7, 2021)

Features 🔊

  • Reverb
  • setup.py (package is available on PyPI)
  • split into Agent, Learner and Tester
  • Use custom model and layer for defining Actor-Critic
  • MultiCritic - concatenating multiple critic networks into one network
  • Truncated Quantile Critics

v2.0.2 (May 23, 2021)

Features 🔊

  • update Dockerfile
  • update README.md
  • formatted code by Black & Flake8

v2.0.1 (April 27, 2021)

Bug fixes 🛠️

  • fixed Critic model

v2.0.0 (April 22, 2021)

Features 🔊

  • Add Huber loss
  • In test mode, rendering to the video file
  • Normalized observation by Min-max method
  • Remove TD3 algorithm