Skip to content

qihongl/demo-random-search-RL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

demo-random-search-RL

A toy demo based on the following paper:

Mania, H., Guy, A., & Recht, B. (2018). Simple random search provides a competitive approach to reinforcement learning. Retrieved from http://arxiv.org/abs/1803.07055

Method:

The basic random search in Mania et al. 2018 (see alg 1), which is basically the finite difference method.

Results:

Here's the learning curve on a 5x5 grid world, where the agent is trained to find the goal while avoiding the punishment.

Here's a sample path on the grid world.

  • red dot: reward
  • black dot: punishment

About

a minimal implementation of the random search algorithm for reinforcement learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published