nnf_nll_loss - ignore_index #53

jwijffels · 2020-06-16T15:20:00Z

Would be great if nnf_nll_loss would have a default value for ignore_index
https://github.com/mlverse/torch/blob/master/R/nnf-loss.R#L343

dfalbel · 2020-06-16T15:37:05Z

Yeah should be -100 follwing the pytorch impl:

def nll_loss(input, target, weight=None, size_average=None, ignore_index=-100,
             reduce=None, reduction='mean'):

jwijffels · 2020-06-16T15:39:22Z

Yes. I tested that also but it said boom on my Windows machine

jwijffels · 2020-06-16T18:54:46Z

What I meant to say is that this crashes my session at the call of cpp_torch_namespace_nll_loss_self_Tensor_target_Tensor

library(torch)
m = nn_log_softmax(dim=1)
input = torch_randn(3, 5, requires_grad=TRUE)
target = torch_tensor(c(1L, 0L, 4L))
input = m(input)
output = nnf_nll_loss(input, target, ignore_index=-100L)
output

while it should be calling https://github.com/mlverse/torch/blob/master/src/lantern/lantern.h#L1649

dfalbel · 2020-06-16T20:09:59Z

ok, I'll take a look ASAP

dfalbel · 2020-06-16T20:21:59Z

This works for me if I do:

target = torch_tensor(c(1L, 0L, 4L), dtype = torch_long())

I could consider making torch_long() the default dtype when converting from R integers to torch tensors. We did something similar for R doubles that are converted to Tensors with dtype = torch_float(). What do you think?

jwijffels · 2020-06-17T07:03:19Z

Indeed, works with long instead of int. Don't know enough about the C API of lantern/libtorch to give advice. I don't mind specifyng that it is a long. Don't know currently if this impacts speed of anything.

dfalbel added the nn Related to nn API label Jun 16, 2020

dfalbel closed this as completed in 219526e Jun 17, 2020

dfalbel mentioned this issue Jun 17, 2020

Consider making torch_long() the default integer type #57

Closed

emauryg mentioned this issue May 17, 2021

Error converting between cuda tenstor to cpu #562

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nnf_nll_loss - ignore_index #53

nnf_nll_loss - ignore_index #53

jwijffels commented Jun 16, 2020

dfalbel commented Jun 16, 2020

jwijffels commented Jun 16, 2020

jwijffels commented Jun 16, 2020 •

edited

Loading

dfalbel commented Jun 16, 2020

dfalbel commented Jun 16, 2020 •

edited

Loading

jwijffels commented Jun 17, 2020

nnf_nll_loss - ignore_index #53

nnf_nll_loss - ignore_index #53

Comments

jwijffels commented Jun 16, 2020

dfalbel commented Jun 16, 2020

jwijffels commented Jun 16, 2020

jwijffels commented Jun 16, 2020 • edited Loading

dfalbel commented Jun 16, 2020

dfalbel commented Jun 16, 2020 • edited Loading

jwijffels commented Jun 17, 2020

jwijffels commented Jun 16, 2020 •

edited

Loading

dfalbel commented Jun 16, 2020 •

edited

Loading