3D embedding tensor #565

celsofranssa · 2023-01-03T00:21:54Z

Hello,
I'm getting the ValueError: embeddings must be a 2D tensor of shape (batch_size, embedding_size) message because, indeed, my embedding is a 3D tensor. However, I've provided the distance function. So, is there a workaround for this shapes verification?

The text was updated successfully, but these errors were encountered:

KevinMusgrave · 2023-01-03T08:32:54Z

Interesting. For now I think this is the best workaround:

from pytorch_metric_learning.utils import common_functions as c_f
c_f.check_shapes = lambda x,y: None

celsofranssa · 2023-01-03T12:23:08Z

Thank you, @KevinMusgrave. I am using the NTXentLoss as shown in the following code snippet:

class RelevanceLoss(nn.Module):

    def __init__(self, params):
        super(RelevanceLoss, self).__init__()
        self.miner = RelevanceMiner(params.miner)
        # Let Q and D be two batches of matrices (MxN), MaxSimDistance return a score matrix S,
        # where S[i,j] represents the score between Q[i] and D[j]
        self.criterion = losses.NTXentLoss(temperature=params.criterion.temperature, distance=MaxSimDistance())

    def forward(self, query_idx, query_rpr, doc_idx, doc_rpr):
        # query_rpr and doc_rpr is a tensor of shape (batch_size, sequence_length, hidden_size)
        miner_outs = self.miner.mine(text_ids=query_idx, label_ids=doc_idx)
        return self.criterion(query_rpr, None, miner_outs, doc_rpr, None)

Even navigating over the NTXentLoss code, I couldn't figure out the best place to overwrite the check_shapes.
Could you help me a little more?

KevinMusgrave · 2023-01-03T12:33:48Z

It should work as long as you overwrite it before you use the loss function. See this notebook for an example: https://colab.research.google.com/drive/1jHw9dlFBCAd46CKSaAL4O7vtKQvzxgeG?usp=sharing

(It crashes but it gets passed check_shapes)

celsofranssa · 2023-01-03T13:25:04Z

It seems to be working, thank you.

celsofranssa · 2023-01-04T20:29:01Z

Hello again @KevinMusgrave

To clarify, does the suggested change have any impact on RAM usage? The loss is converging, but the RAM consumption grows with each training step, even though I'm not storing anything and all training step is done on GPU.

KevinMusgrave · 2023-01-05T06:48:47Z

No I don't see why it would affect RAM usage.

KevinMusgrave · 2023-01-21T05:28:27Z

In version 1.7.2, I have moved the ndim check into BaseDistance so that it can be overriden like this:

class CustomDistance(BaseDistance):
    def check_shapes(self, query_emb, ref_emb):
        pass

KevinMusgrave added the enhancement New feature or request label Jan 3, 2023

KevinMusgrave closed this as completed Jan 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3D embedding tensor #565

3D embedding tensor #565

celsofranssa commented Jan 3, 2023

KevinMusgrave commented Jan 3, 2023

celsofranssa commented Jan 3, 2023

KevinMusgrave commented Jan 3, 2023

celsofranssa commented Jan 3, 2023

celsofranssa commented Jan 4, 2023

KevinMusgrave commented Jan 5, 2023

KevinMusgrave commented Jan 21, 2023 •

edited

3D embedding tensor #565

3D embedding tensor #565

Comments

celsofranssa commented Jan 3, 2023

KevinMusgrave commented Jan 3, 2023

celsofranssa commented Jan 3, 2023

KevinMusgrave commented Jan 3, 2023

celsofranssa commented Jan 3, 2023

celsofranssa commented Jan 4, 2023

KevinMusgrave commented Jan 5, 2023

KevinMusgrave commented Jan 21, 2023 • edited

KevinMusgrave commented Jan 21, 2023 •

edited