Stein Variational Policy Gradient(SVPG)

Tensorflow Implementation of SVPG

Implementation is on Tensorflow r1.3

"Policy gradient methods have been successfully applied to many complex reinforcement learning problems. However, policy gradient methods suffer from high variance, slow convergence, and inefficient exploration. In this work, we introduce a maximum entropy policy optimization framework which explicitly encourages parameter exploration, and show that this framework can be reduced to a Bayesian inference problem." From Paper

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
svpg_cont_action		svpg_cont_action
svpg_ddpg		svpg_ddpg
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stein Variational Policy Gradient(SVPG)

About

Releases

Packages

Languages

License

jsikyoon/svpg_tensorflow

Folders and files

Latest commit

History

Repository files navigation

Stein Variational Policy Gradient(SVPG)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages