-
Notifications
You must be signed in to change notification settings - Fork 406
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Motivation
It would nice to add a new option to compute the clipped value loss as used in OpenAI's Baselines for PPO:
Currently, the PPOLoss().loss_critic
method calls the torchrl.objectives.utils.distance_loss
function which supports the "l1"
, "l2"
and "smooth_l1"
loss functions. Perhaps this new clipped value loss function could be implemented as a new loss type within distance_loss
.
The clipping fraction is also commonly reported as a metric by OpenAI's Baselines and this could be useful to report from PPOLoss
for the clipped value loss, as well as for ClipPPOLoss
for the loss_objective
loss term.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request