Mask is always zero in Margin-based Contrastive Loss #5

OrigamiDream · 2022-09-26T19:07:38Z

Reference: https://github.com/wzhouad/Contra-OOD/blob/main/model.py#L40

I have question about the implementation of Margin-based Contrastive Loss

mask = (labels.unsqueeze(1) == labels.unsqueeze(0)).float()

If the batch size is 64, the labels variable have a shape of [64]
When the above code performs, ([64, 1] == [1, 64]).float() → [64, 64], which is exact 2D diagonal matrix.

mask = mask - torch.diag(torch.diag(mask))

But the problem is on the second line of code.
If torch.diag(mask) performs, the result has a shape of [64] that is one-filled vector: $[1, 1, 1, ...]$
Therefore, the result of torch.diag(torch.diag(mask)) is exactly same with the mask, which is exact 2D diagonal matrix.
Furthermore, if you subtract the result from mask, eventually the mask is always zero-filled matrix.
Eventually, the mask variable have no power for gradient descending.

Is this really on your purpose?

I thought the mask variable is used for distinguishing $P(i)$ and $N(i)$ in equation.
Is this right? Or am I missing a point?

The text was updated successfully, but these errors were encountered:

OrigamiDream · 2022-09-30T22:21:45Z

That was my mistake, problem solved.

OrigamiDream closed this as completed Sep 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mask is always zero in Margin-based Contrastive Loss #5

Mask is always zero in Margin-based Contrastive Loss #5

OrigamiDream commented Sep 26, 2022

OrigamiDream commented Sep 30, 2022

Mask is always zero in Margin-based Contrastive Loss #5

Mask is always zero in Margin-based Contrastive Loss #5

Comments

OrigamiDream commented Sep 26, 2022

OrigamiDream commented Sep 30, 2022