How to use custom loss ? #58

duburcqa · 2020-05-28T15:36:29Z

I would like to add the following extra term to the loss function,
$|| y_{pred} - y_{ref} ||_2^2$
where $y_{pred}$ is the action sampled by the distribution, and $y_{ref}$ can be computed by the actor.

What is the best way to do it using your framework ? The point is being able to take advantage of the analytical gradient computation.

The only way I can think of is to overwrite the whole learn method of the policy (i.e. PPO algorithm), but it feels inconvenient just to add an extra line of code...

Thank you in advance,

Best,
Alexis

The text was updated successfully, but these errors were encountered:

Trinkle23897 · 2020-05-29T00:15:16Z

That's a good question. Currently, you can either inherit the policy class (as you mentioned) or change the original framework's code to meet your expectations.

It can be discussed further. Some existing frameworks (like RLlib) modularized the loss function part. But in my opinion, this could be inconvenient for further development. Since the loss function is highly customizable, making the abstraction of the loss function will double the code complexity.

duburcqa · 2020-05-29T06:15:32Z

Ok, I got your point and I agree with you. But what about adding a loss_fn in the abstract base class for policies, that is basically doing nothing by default but can be overridden by the user? Because I really don't like the idea to have to overwrite 'learn' itself since it is a major source of error.

In this case, it is not a custom loss strictly speaking, but rather additional component to the original loss function (regularization), that may depend on the actor. So that it only consists in an extra function call before calling backward. I don't know if doing so is usual or not.

duburcqa · 2020-06-30T08:10:38Z

@Trinkle23897 Up !

Trinkle23897 · 2020-06-30T08:32:17Z

@Trinkle23897 Up !

I have no time after #106 before this Friday...Many things to do

duburcqa · 2020-06-30T08:45:58Z

No problem ! I can do it ! But what do you think about the idea ?

Trinkle23897 · 2020-06-30T10:05:41Z

I think that add loss_fn is okay, but what's its input?

oldcricket · 2021-09-10T19:54:30Z

@duburcqa It's a great idea to make it easier with a customized loss. I wondered if you have made any progress on that. Thanks!

MischaPanch · 2023-10-14T14:31:04Z

The loss is an integral part of the algorithm, so maybe inheriting and overriding is better than allowing users to pass custom losses. It's a central design question, I don't see it being necessary for the 1.0.0 release, but would keep the issue open

Trinkle23897 added the question Further information is requested label May 29, 2020

Trinkle23897 added this to TODO in Issue/PR Categories via automation May 29, 2020

Trinkle23897 moved this from TODO to Usage in Issue/PR Categories May 29, 2020

Trinkle23897 assigned duburcqa Jul 22, 2020

Trinkle23897 mentioned this issue Mar 1, 2022

How to use l2 regularization and dropout to prevent overfitting? #541

Closed

MischaPanch added the refactoring No change to functionality label Oct 14, 2023

MischaPanch added documentation and removed question Further information is requested labels Oct 14, 2023

MischaPanch closed this as completed Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use custom loss ? #58

How to use custom loss ? #58

duburcqa commented May 28, 2020 •

edited

Loading

Trinkle23897 commented May 29, 2020 •

edited

Loading

duburcqa commented May 29, 2020 •

edited

Loading

duburcqa commented Jun 30, 2020

Trinkle23897 commented Jun 30, 2020

duburcqa commented Jun 30, 2020

Trinkle23897 commented Jun 30, 2020

oldcricket commented Sep 10, 2021

MischaPanch commented Oct 14, 2023

How to use custom loss ? #58

How to use custom loss ? #58

Comments

duburcqa commented May 28, 2020 • edited Loading

Trinkle23897 commented May 29, 2020 • edited Loading

duburcqa commented May 29, 2020 • edited Loading

duburcqa commented Jun 30, 2020

Trinkle23897 commented Jun 30, 2020

duburcqa commented Jun 30, 2020

Trinkle23897 commented Jun 30, 2020

oldcricket commented Sep 10, 2021

MischaPanch commented Oct 14, 2023

duburcqa commented May 28, 2020 •

edited

Loading

Trinkle23897 commented May 29, 2020 •

edited

Loading

duburcqa commented May 29, 2020 •

edited

Loading