-
Notifications
You must be signed in to change notification settings - Fork 341
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About the action space #4
Comments
For DDPG, the action space is For PolicyGradient, the action space is still So I implemented method for test Thank you very much. |
@Ceruleanacg Your explanation is very detailed, thank you very much. So you define an action as an operation (buy, sell or hold) on a stock. In |
In the method So for PolicyGradient that uses |
I am sorry I did not read the code carefully. I have the last two questions:
|
For question 1, if the For question 2, actually we will get local optimum, you can also set it true if you want, but, how to say, i found PolicyGradient performs very bad if I set it true :) If you have further questions, you can add my WeChat 17392810723, we could learn more from each other. |
Alright, thank you very much 👍 |
I think the action space should be
3 ** self.codes_count
rather thanself.codes_count * 3
?The text was updated successfully, but these errors were encountered: