ranking is not done correctly #20

adam-katona · 2018-12-12T14:26:41Z

In ES, compute_ranks() does an argsort, which will give different ranks to individuals with the same fitness.
This introduces a noise in the gradient estimate. This is not a big issue since the expected value of the noise is zero, but it can slow down convergence.
This is really only a problem in environments, where rewards are sparse, so a lot of individuals will have the same fitness.
Solution: Average ranks for individuals with equal fitness.

adam-katona · 2018-12-20T04:30:34Z

I just realized, that this might be intentional, so there is some extra exploration in flat regions of the fitness function.

adam-katona closed this as completed Dec 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ranking is not done correctly #20

ranking is not done correctly #20

adam-katona commented Dec 12, 2018

adam-katona commented Dec 20, 2018

ranking is not done correctly #20

ranking is not done correctly #20

Comments

adam-katona commented Dec 12, 2018

adam-katona commented Dec 20, 2018