Consider using state of the art semantic similarity algorithms #5

LifeIsStrange · 2021-11-15T00:49:12Z

The state of the art can be found here:
https://paperswithcode.com/sota/semantic-textual-similarity-on-sts-benchmark

Other benchmarcks:
https://paperswithcode.com/task/semantic-textual-similarity

paulbricman · 2021-11-15T05:55:48Z

Thanks for the suggestion! Thing is, I feel the current SOTA encoders which can handle multi-modal data (i.e. both texts and images) are a bit behind strictly text encoders, especially when mainly working with text.

One way of combining the better ones for text with also having images around is to have two embeddings stored for texts, one with a strictly text encoder, one with an image one. When you're working with text, it would use the cleaner text one, while when working with images it would use the CLIP one.

In your view, would the slightly better performance with text justify the increase in complexity, @LifeIsStrange ?

paulbricman closed this as completed Dec 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider using state of the art semantic similarity algorithms #5

Consider using state of the art semantic similarity algorithms #5

LifeIsStrange commented Nov 15, 2021

paulbricman commented Nov 15, 2021

Consider using state of the art semantic similarity algorithms #5

Consider using state of the art semantic similarity algorithms #5

Comments

LifeIsStrange commented Nov 15, 2021

paulbricman commented Nov 15, 2021