Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A question about critic-loss in discrete sac? #85

Open
outshine-J opened this issue Oct 8, 2022 · 5 comments
Open

A question about critic-loss in discrete sac? #85

outshine-J opened this issue Oct 8, 2022 · 5 comments

Comments

@outshine-J
Copy link

I applied the code of discrete sac to a custom discrete action environment. During the training process, I found that the loss of critic did not decrease but increased, and the critic-loss value after the increase was very large, even reaching 200+, what is the problem? Caused, how can I fix it? thanks.

@fry404006308
Copy link

fry404006308 commented Oct 8, 2022 via email

@outshine-J
Copy link
Author

Added, the same happens even if I crop the reward.

@Mengyu-Messic
Copy link

@outshine-J
Hello, I have encountered the same problem, have you solved it?

@fry404006308
Copy link

fry404006308 commented Nov 4, 2022 via email

@outshine-J
Copy link
Author

outshine-J commented Nov 4, 2022

@Mengyu-Messic
You can find the answer by following the link. toshikwa/sac-discrete.pytorch#12 (comment). Other than that you can change this by setting a fixed temperature.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants