Logits or softmax probabilities #2

YanWang2014 · 2017-11-04T21:44:30Z

In the paper,

Through this loss function, we aim to directly penalize the distance between the
predicted output logits.

So for the PairwiseConfusion, we are using logits? which is the direct output of pytorch models.

But for EntropicConfusion, obviously we should use softmax probabilities, which is obtained by feeding logits through a softmax function.

Am I right?
Thank you

abhimanyudubey · 2017-11-06T22:15:23Z

Hi Yan,

For both models of confusion, the preferred usage is softmax probabilities (obtained after feeding the logits through a softmax). You can, alternatively, try the logits for pairwise confusion, but the loss weight will have to be scaled to a very small value to prevent oscillations, and hence we recommend operating on the softmax probabilities themselves.

Abhi

YanWang2014 · 2017-11-08T02:46:06Z

Thank you!

twmht · 2019-02-19T14:11:17Z

@abhimanyudubey

In the Entropy confusion (https://github.com/abhimanyudubey/confusion/blob/master/confusion_pytorch/__init__.py#L15), it seems that you missed a negative sign according to the paper. isn't this a loss function?

YanWang2014 closed this as completed Nov 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logits or softmax probabilities #2

Logits or softmax probabilities #2

YanWang2014 commented Nov 4, 2017 •

edited

Loading

abhimanyudubey commented Nov 6, 2017

YanWang2014 commented Nov 8, 2017

twmht commented Feb 19, 2019 •

edited

Loading

Logits or softmax probabilities #2

Logits or softmax probabilities #2

Comments

YanWang2014 commented Nov 4, 2017 • edited Loading

abhimanyudubey commented Nov 6, 2017

YanWang2014 commented Nov 8, 2017

twmht commented Feb 19, 2019 • edited Loading

YanWang2014 commented Nov 4, 2017 •

edited

Loading

twmht commented Feb 19, 2019 •

edited

Loading