How to use temperature scaling for binary classification using sigmoid? #32

evapachetti · 2022-06-14T11:27:49Z

My network is trained according to a binary classification approach, so the model outputs as a single logit value which I then convert into probability by applying the sigmoid function. How can I modify the temperature scaling code to apply it to my network?

Thank you in advance.

xchani · 2022-11-23T13:09:18Z

I think the following approach should work:

Replace the CrossEntropyLoss with BCEWithLogitsLoss
Change the size of self.temperature to number of classes

Sigmoid can be regarded as a special case of softmax where one of the logits is 0: $\frac{e^x}{e^0 + e^x}$. Then we only need to learn the temperature for each class.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use temperature scaling for binary classification using sigmoid? #32

How to use temperature scaling for binary classification using sigmoid? #32

evapachetti commented Jun 14, 2022

xchani commented Nov 23, 2022

How to use temperature scaling for binary classification using sigmoid? #32

How to use temperature scaling for binary classification using sigmoid? #32

Comments

evapachetti commented Jun 14, 2022

xchani commented Nov 23, 2022