Soft actor-critic Yet another SAC implementation, for continuous and discrete action spaces. Reference implementation: https://github.com/rail-berkeley/softlearning