Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient.
arXiv link
Yufei Wang*, Tianwei Ni*. In 7th ICML AutoML workshop, 2020.
(* indicates equal contribution)
PyTorch implementation of Meta-SAC and baselines.
- PyTorch 1.4+
- OpenAI Gym
- Mujoco
- Meta-SAC:
meta_sac
directory - SAC-v1, SAC-v2:
sac
directory - TD3:
TD3
directory