Question: normalize_reward not subtracting mean #348

ChenDRAG · 2021-03-10T03:29:17Z

stable-baselines3/stable_baselines3/common/vec_env/vec_normalize.py

Line 177 in 237223f

def normalize_reward(self, reward: np.ndarray) -> np.ndarray:

    def normalize_reward(self, reward: np.ndarray) -> np.ndarray:
        """
        Normalize rewards using this VecNormalize's rewards statistics.
        Calling this method does not update statistics.
        """
        if self.norm_reward:
            reward = np.clip(reward / np.sqrt(self.ret_rms.var + self.epsilon), -self.clip_reward, self.clip_reward)
        return reward

I wonder why we do not subtract mean in normalize reward? I have done some experiment and it indeed shows that "subtract mean" will reduce performance, but chould you tell me why exactly. Are there researches or explanations validing this thought?

The text was updated successfully, but these errors were encountered:

araffin · 2021-03-10T09:29:47Z

Hello,
you will find answers there:

araffin added the question Further information is requested label Mar 10, 2021

araffin closed this as completed Mar 11, 2021

araffin mentioned this issue Nov 10, 2022

[Question] What is the real intention for reward scaling with running variance of discounted rewards? #1165

Closed

4 tasks

araffin mentioned this issue Apr 4, 2023

Can't understand reward scaling in value clipping of PPO #1426

Open

araffin mentioned this issue Apr 13, 2023

[Question / Bug] Incorrect reward normalization in VecNormalize? #1447

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: normalize_reward not subtracting mean #348

Question: normalize_reward not subtracting mean #348

ChenDRAG commented Mar 10, 2021

araffin commented Mar 10, 2021

Question: normalize_reward not subtracting mean #348

Question: normalize_reward not subtracting mean #348

Comments

ChenDRAG commented Mar 10, 2021

araffin commented Mar 10, 2021