Entropy-Regularized-RL

A3C with Proper Entropy Bounses( Also GAE).

Run it by

python A3C.py

soft q learning

The gaussian kernel is from haarnoja. This is for atari, there exist some problems, I will fix them soon.

soft actor critic

After reading paper, I think sac can be almost like a3c...And because of the entropy, it's will not converge faster than a3c in my experiment.

Run it by

python sac.py

sac_new.py is ddpg style. fixed alpha, just for fun ~

Paper

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Equivalence Between Policy Gradients and Soft Q-Learning

Soft Actor-Critic Algorithms and Applications

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
A3C.py		A3C.py
README.md		README.md
gaussian_kernel.py		gaussian_kernel.py
sac.py		sac.py
sac_new.py		sac_new.py
softqlearning.py		softqlearning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A3C.py

A3C.py

README.md

README.md

gaussian_kernel.py

gaussian_kernel.py

sac.py

sac.py

sac_new.py

sac_new.py

softqlearning.py

softqlearning.py

Repository files navigation

Entropy-Regularized-RL

A3C with Proper Entropy Bounses( Also GAE).

soft q learning

soft actor critic

Paper

About

Releases

Packages

Languages

LihaoR/Entropy-Regularized-RL

Folders and files

Latest commit

History

Repository files navigation

Entropy-Regularized-RL

A3C with Proper Entropy Bounses( Also GAE).

soft q learning

soft actor critic

Paper

About

Resources

Stars

Watchers

Forks

Languages