Skip to content

About value_norm in ppo #172

Answered by puyuan1996
t13m asked this question in Q&A
Jan 2, 2022 · 2 comments · 4 replies
Discussion options

You must be logged in to vote

First of all, thank you very much for your question.

The key insight of value normalization is that neural networks can more easily fit normalized data. Regarding the principle and experimental results of value normalization,

We use the normalizaed value in critic loss calculation and use the original unormalized value in advantage calc…

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
1 reply
@t13m
Comment options

Answer selected by PaParaZz1
Comment options

You must be logged in to vote
3 replies
@t13m
Comment options

@PaParaZz1
Comment options

@t13m
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
algo Add new algorithm or improve old one
3 participants