Describe the bug
The default value of normalize_advantage in ClipPPOLoss and KLPENPPOLoss is set to True even though the documentation states that it should be False.
The base class PPOLoss uses the correct default value.
Expected behavior
Change the default value to False to match the documentation and the base class.
Checklist