New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
for discrete env #21
Comments
I'm not too familiar with the DIAYN implementation, maybe @ben-eysenbach can help. |
Thank you, haarnoja. can SAC for discrete actions env? if it can, how? |
Yeah you can use SAC with discrete actions too, but this implementation does not support them. You would need to replace the policy with softmax distribution \pi(.,s) \propto \exp Q(s,.), which you can compute exactly for finite action space. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I read the paper DIAYN just now, and can't understand how to train the DIAYN in an env with discrete actions, because SAC is for continuous env. But in the paper, some experiments are based on mountain car and inverted pendulum. Thank you
The text was updated successfully, but these errors were encountered: