We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, thanks for your PG implementation. I found it's difficult to understand this episode code:
#train model def log_prob(self, policy_param, acs): if self.is_discrete: logits = policy_param log_prob = tf.keras.losses.sparse_categorical_crossentropy(\ y_true = acs, y_pred = logits, from_logits = True)
I think the log_prob function will just return tf.math.log(policy_param) and I do not understand why you calculate crossentropy loss here? Thank you.
tf.math.log(policy_param)
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Hi, thanks for your PG implementation.
I found it's difficult to understand this episode code:
I think the log_prob function will just return
tf.math.log(policy_param)
and I do not understand why you calculate crossentropy loss here? Thank you.The text was updated successfully, but these errors were encountered: