-
Notifications
You must be signed in to change notification settings - Fork 324
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Adds fixed alpha version of Soft Actor Critic algorithm (#178)
* Fixes a bug where, in sac.py, self._alpha was not being re-computed after loading self._log_alpha from an optim_state_dict. * Adds fixed_alpha option for SAC, which sets alpha to be a constant and does not adapt the alpha value. Co-authored-by: jordan-schneider <jordan.jack.schneider@gmail.com>
- Loading branch information
1 parent
59bc259
commit a9ac84f
Showing
1 changed file
with
14 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters