Skip to content

tehZevo/pset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Policy Search with Eligibility Traces

A finite difference-ish approach to policy gradients. It's like PGET, but exploring in parameter space instead of action space.

Why?

Because, why search action space and then perform gradient descent -- which requires an expensive gradient tape/graph -- when you can just search in parameter space instead?

(because it's easier to search in action space than it is to search in parameter space, but it's a method worth exploring regardless)

About

Policy Search with Eligibility Traces

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages