How `smart_batching_collate` works? #1592

InhyeokYoo · 2022-06-12T13:53:16Z

Hi.

I've read SBERT paper but there's one thing I can not understand.
In section 7 Computational Efficiency in the paper, SBERT uses 'smart batching' for computation efficiency by reducing computational overhead from padding.

For improved computation of sentence embeddings, we implemented a smart batching strategy: Sentences with similar lengths are grouped together and are only padded to the longest element in a mini-batch. This drastically reduces computational overhead from padding tokens.

However, the code doesn't use this sentences with similar lengths information.
It seems to me that the collate function just extract text and label from example, and then tokenize the texts and return.

Could please somebody let me know what am I missing?

The text was updated successfully, but these errors were encountered:

nreimers · 2022-06-12T16:16:04Z

Sentences are sorted by length in the encode method

InhyeokYoo · 2022-06-12T17:06:44Z

@nreimers

Does it mean smart_batching_collate actually not the smart batching of the paper?

And one more question: why did you use it in encode method not also fit method?

nreimers · 2022-06-12T17:36:45Z

Only in the encode. In the fit we what random shuffle

InhyeokYoo · 2022-06-13T02:13:34Z

Thank you. Totally understood.

InhyeokYoo closed this as completed Jun 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How `smart_batching_collate` works? #1592

How `smart_batching_collate` works? #1592

InhyeokYoo commented Jun 12, 2022 •

edited

Loading

nreimers commented Jun 12, 2022

InhyeokYoo commented Jun 12, 2022 •

edited

Loading

nreimers commented Jun 12, 2022

InhyeokYoo commented Jun 13, 2022

How smart_batching_collate works? #1592

How smart_batching_collate works? #1592

Comments

InhyeokYoo commented Jun 12, 2022 • edited Loading

nreimers commented Jun 12, 2022

InhyeokYoo commented Jun 12, 2022 • edited Loading

nreimers commented Jun 12, 2022

InhyeokYoo commented Jun 13, 2022

How `smart_batching_collate` works? #1592

How `smart_batching_collate` works? #1592

InhyeokYoo commented Jun 12, 2022 •

edited

Loading

InhyeokYoo commented Jun 12, 2022 •

edited

Loading