Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ranking is not done correctly #20

Closed
adam-katona opened this issue Dec 12, 2018 · 1 comment
Closed

ranking is not done correctly #20

adam-katona opened this issue Dec 12, 2018 · 1 comment

Comments

@adam-katona
Copy link

In ES, compute_ranks() does an argsort, which will give different ranks to individuals with the same fitness.
This introduces a noise in the gradient estimate. This is not a big issue since the expected value of the noise is zero, but it can slow down convergence.
This is really only a problem in environments, where rewards are sparse, so a lot of individuals will have the same fitness.
Solution: Average ranks for individuals with equal fitness.

@adam-katona
Copy link
Author

I just realized, that this might be intentional, so there is some extra exploration in flat regions of the fitness function.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant