-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Divide by zero #1
Comments
I experienced this kind of difficulties in all sparse reward setting. Is ARS a good way to go for these optimization landscapes? |
Can we use a |
In my case, adding 1e-8 to the divisor made the trick... |
yeah @pedronahum , that would do it too! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi,
First and foremost, thanks for sharing the code. This is greatly appreciated.
Currently testing ARS in other learning environments and found that for very difficult environments the users of the code might face a divide by zero error, particularly at early stages of the learning process (ie, zero reward in all the initial rollouts).
Thanks,
The text was updated successfully, but these errors were encountered: