Script to use classifier to predict against test #641

nlothian · 2018-07-19T09:33:58Z

As discussed with @sebastianruder at http://forums.fast.ai/t/cant-replicate-ulmfit-validation-predictions/18477/22?u=nickl

sebastianruder · 2018-07-22T02:46:56Z

Hi Nick, thanks! This is great! I'm just on the way back from a conference. Will test the code once I'm back.

Varal7 · 2018-07-26T21:23:11Z

Good idea!
I think, you can just use argmax instead of softmax for inference

nlothian · 2018-07-26T22:01:40Z

@Varal7 this is true.

There's a whole set of questions on the forum about "what are these numbers which come back as classifier scores" and "why don't they add up to 1" etc. The answer is always softmax, and my thought was to include it to avoid all that confusion.

linbojin · 2018-09-13T08:43:40Z

I think this code doesn't pad any "padding index" so the result will be different from which is computed by constructing DataLoader.

For my own test:

encoded text is: [3, 4, 5, 6, 303, 10, 11, 2, 73, 24, 72, 27, 15, 32, 46, 22, 16, 2, 25, 7]
I 
result： [-0.74961,  0.71659]

encoded text for dataloader:
[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 3, 4, 5, 6, 303, 10, 11, 2, 73, 24, 72, 27, 15, 32, 46, 22, 16, 2, 25, 7]
result : [ 0.31941, -0.15691]

@nlothian @sebastianruder @jph00 Any idea?

nlothian · 2018-09-13T09:41:55Z

You could be right.

I remember looking at the padding issues some, but when I was testing it I don't think I found any difference. There's no difference in behavior here between the scripts and the notebook is there?

I don't have the code to retest that, and if you are seeing a difference then it is likely you are correct. I'm sure a patch would be welcomed!

linbojin · 2018-09-13T12:17:18Z

The problem is I can not determine how many 1s should i pad, and different 1s will give different results. It's confusing.

sebastianruder · 2018-10-01T09:37:04Z

Yeah, this is quite weird. I didn't look into this before as I thought we were already ignoring padding using pack_padded_sequence (see here for an explanation).
In the code, the padding is done here and the number of padding tokens is the difference of the current document length to the length of the largest document in the batch (see here).
As a short-term measure, for padding individual examples at test time, you could thus use something like the difference to the average document length in the training data.
In the mid-term, we should use pack_padded_sequence and pad_packed_sequence as described here.
cc @jph00 @sgugger

ranihorev · 2018-10-11T08:29:09Z

I created a PR to fix this issue.
#882 (comment)

nlothian · 2018-10-25T02:33:50Z

Just bumping this because I think it's pretty important. @ranihorev's proposed changes broke a lot of the language model, so there should be a better way.

@sebastianruder suggestion of the short term fix of using difference to the average document length in the training data is difficult too, because that isn't actually recorded anywhere at the moment.

I follow the pack_padded_sequence post mentioned above up to trick 3, but I need to think about that some more.

What is the approach in FastAI 1.0 here? Is it fixed somehow?

predictions from classifier

298a9f4

jph00 merged commit 75e8ae0 into fastai:master Jul 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script to use classifier to predict against test #641

Script to use classifier to predict against test #641

nlothian commented Jul 19, 2018

sebastianruder commented Jul 22, 2018

Varal7 commented Jul 26, 2018

nlothian commented Jul 26, 2018

linbojin commented Sep 13, 2018 •

edited

nlothian commented Sep 13, 2018

linbojin commented Sep 13, 2018

sebastianruder commented Oct 1, 2018

ranihorev commented Oct 11, 2018

nlothian commented Oct 25, 2018

Script to use classifier to predict against test #641

Script to use classifier to predict against test #641

Conversation

nlothian commented Jul 19, 2018

sebastianruder commented Jul 22, 2018

Varal7 commented Jul 26, 2018

nlothian commented Jul 26, 2018

linbojin commented Sep 13, 2018 • edited

nlothian commented Sep 13, 2018

linbojin commented Sep 13, 2018

sebastianruder commented Oct 1, 2018

ranihorev commented Oct 11, 2018

nlothian commented Oct 25, 2018

linbojin commented Sep 13, 2018 •

edited