Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

for discrete env #21

Closed
ccplxx opened this issue Nov 9, 2018 · 3 comments
Closed

for discrete env #21

ccplxx opened this issue Nov 9, 2018 · 3 comments

Comments

@ccplxx
Copy link

ccplxx commented Nov 9, 2018

I read the paper DIAYN just now, and can't understand how to train the DIAYN in an env with discrete actions, because SAC is for continuous env. But in the paper, some experiments are based on mountain car and inverted pendulum. Thank you

@haarnoja
Copy link
Owner

haarnoja commented Nov 9, 2018

I'm not too familiar with the DIAYN implementation, maybe @ben-eysenbach can help.

@ccplxx
Copy link
Author

ccplxx commented Nov 10, 2018

Thank you, haarnoja. can SAC for discrete actions env? if it can, how?

@haarnoja
Copy link
Owner

Yeah you can use SAC with discrete actions too, but this implementation does not support them. You would need to replace the policy with softmax distribution \pi(.,s) \propto \exp Q(s,.), which you can compute exactly for finite action space.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants