Is the get_kl() function correct? #13

zzzxxxttt · 2018-09-18T13:42:03Z

Thanks for your great code!
I notice that in the function get_kl(), you use policy net to generate the mean, log_std and std, then copy these three parameters and calculate the KL divergence between the original parameters and the copied parameters, which is obviously zero all the time. Is this a bug or a intended behavior?

zzzxxxttt closed this as completed Sep 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the get_kl() function correct? #13

Is the get_kl() function correct? #13

zzzxxxttt commented Sep 18, 2018 •

edited

Is the get_kl() function correct? #13

Is the get_kl() function correct? #13

Comments

zzzxxxttt commented Sep 18, 2018 • edited

zzzxxxttt commented Sep 18, 2018 •

edited