Loss function that takes a data dependent 3rd input #21

prabhuteja12 · 2019-02-18T15:24:52Z

Is there anyway to implement the functionality as described in this post. https://stackoverflow.com/questions/46464549/keras-custom-loss-function-accessing-current-input-pattern

Gist:
implementing something like code below

    def custom_loss_wrapper(input_tensor):
        def custom_loss(y_true, y_pred):
            return K.binary_crossentropy(y_true, y_pred) + K.mean(input_tensor)
   return custom_loss

This can be called with loss=custom_loss_wrapper(model.input)

The text was updated successfully, but these errors were encountered:

prabhuteja12 · 2019-03-04T11:10:51Z

@freud14 Just tagging you because you made the last commit on the rep. Please loop anyone else who might be a better fit to answer this question.

freud14 · 2019-03-04T12:43:44Z

Hi,
Sorry for the delay, I didn't get a notification from your initial post. The way to do that in PyToune is to simply return the input or the input dependent quantity as the output of your module. Then, in your loss function, you compute your loss and add it to your input dependent quantity.

def custom_loss(output, y_true):
    y_pred, input_tensor = output
    return F.binary_crossentropy(y_pred, y_true) + input_tensor.mean()

class MyPyTorchModule(nn.Module):
    def forward(self, input):
        ....
        ....
        return y_pred, input_tensor # or y_pred, input_tensor.mean() in your example

Thank you.

Frédérik

prabhuteja12 · 2019-03-04T13:09:27Z

Awesome! Definitely felt there would be a simple solution to this. Thank you :)

prabhuteja12 · 2019-03-04T14:19:07Z

Ooops . Looks like I closed it by accident.

I'm looking at a case that is slightly different from the examples above. I have a Dataset that outputs 3 tensors (say x, y, mask). I'm trying to compute the loss with something like this.
F.cross_entropy(input=x, target=y, reduction='none')[mask]

Now, I'm implement this

local_model.loss_function = partial(loss, ignore_mask=mask)
local_model.train_on_batch(x.to(device), y.long().to(device))`

Is there a better way to do this?

freud14 · 2019-03-04T15:00:23Z

Not sure to understand your case but something like this should do it:

def custom_loss(y_pred, input):
    y_true, mask = input
    return F.cross_entropy(input=y_pred, target=y_true, reduction='none')[mask]

Where your dataset has to output a tuple like (x, (y, mask)). You also will need to implement the collate_fn of the dataloader.

Btw, if you did local_model.to(device), you don't need to call .to(device) on your input tensor. PyToune will do it for you.

prabhuteja12 · 2019-03-04T15:10:40Z

The output of the dataset is a tuple like (x, y, mask) and not in the structure (x, (y, mask)). I would have to a Lambda dataset for it?

Also, would pytoune be able to infer the device from the original pytorch model?

freud14 · 2019-03-04T16:39:40Z

The output of the dataset is a tuple like (x, y, mask) and not in the structure (x, (y, mask)). I would have to a Lambda dataset for it?

Yes. More specifically, you have to pass a function for the parameter collate_fn to the Dataloader that transforms your (x, y, mask) tuple into a (x, (y, mask)) tuple.

Also, would pytoune be able to infer the device from the original pytorch model?

Yes if you've called local_model.to(device) beforehand.

freud14 · 2019-03-05T14:38:37Z

class ClientDataset(torch.utils.data.Dataset):
    def __getitem__(self, index):
        ...
        ...
        return x, y, mask

def my_collate_function(samples):
    x, y, mask = list(zip(*samples)) # transform sample list into a list for each input.
    return x, (y, mask)


loader = torch.utils.data.DataLoader(my_dataset_instance, collate_fn=my_collate_function)

Here is a code skeleton for what I mean.

prabhuteja12 · 2019-03-06T12:51:32Z

Hi,

Thank you for the suggestion :) I had something quite similar too..
Btw, do you have any profiling information about how much overhead PyToune adds to pytorch.

freud14 · 2019-03-06T13:27:01Z

Hi,
I did not do any profiling but it shouldn't add any observable overhead since you'd do the things it does anyway if you had your own training loop.

freud14 · 2019-03-12T12:29:57Z

Hi, have I answered to all your questions?

prabhuteja12 · 2019-03-12T12:31:08Z

Hi @freud14 ,
I was just about to reply to this when I saw your message. Yes! Thanks for clearing all of my questions.
I'll close this now.

prabhuteja12 closed this as completed Mar 4, 2019

prabhuteja12 reopened this Mar 4, 2019

prabhuteja12 closed this as completed Mar 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss function that takes a data dependent 3rd input #21

Loss function that takes a data dependent 3rd input #21

prabhuteja12 commented Feb 18, 2019

prabhuteja12 commented Mar 4, 2019

freud14 commented Mar 4, 2019 •

edited

prabhuteja12 commented Mar 4, 2019

prabhuteja12 commented Mar 4, 2019

freud14 commented Mar 4, 2019 •

edited

prabhuteja12 commented Mar 4, 2019

freud14 commented Mar 4, 2019

freud14 commented Mar 5, 2019

prabhuteja12 commented Mar 6, 2019

freud14 commented Mar 6, 2019

freud14 commented Mar 12, 2019

prabhuteja12 commented Mar 12, 2019

Loss function that takes a data dependent 3rd input #21

Loss function that takes a data dependent 3rd input #21

Comments

prabhuteja12 commented Feb 18, 2019

prabhuteja12 commented Mar 4, 2019

freud14 commented Mar 4, 2019 • edited

prabhuteja12 commented Mar 4, 2019

prabhuteja12 commented Mar 4, 2019

freud14 commented Mar 4, 2019 • edited

prabhuteja12 commented Mar 4, 2019

freud14 commented Mar 4, 2019

freud14 commented Mar 5, 2019

prabhuteja12 commented Mar 6, 2019

freud14 commented Mar 6, 2019

freud14 commented Mar 12, 2019

prabhuteja12 commented Mar 12, 2019

freud14 commented Mar 4, 2019 •

edited

freud14 commented Mar 4, 2019 •

edited