Could this be used to compare audio similarity? #12

youssefabdelm · 2022-11-05T16:48:14Z

❓ Questions

I'm curious how to extract embeddings, and if that's the output of the compress function / command line tool, and whether that could be used to compare, via cosine similarity, how similar 2 audio files are?

adefossez · 2022-11-17T16:30:10Z

Good question, we actually haven't tried. We definitely believe that the model performs some "collapse" of similar audio on the same representation, and it eliminates some of the variability that might occur between two similar audios (e.g. phase difference, white noise components). Note that we have good reasons to believe the representation is mostly at the acoustic level. Thus semantic comparisons (e.g. two musics with the same genre, or two people talking of the same topic) wouldn't be close in the latent space.

youssefabdelm added the question Further information is requested label Nov 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could this be used to compare audio similarity? #12

Could this be used to compare audio similarity? #12

youssefabdelm commented Nov 5, 2022

adefossez commented Nov 17, 2022

Could this be used to compare audio similarity? #12

Could this be used to compare audio similarity? #12

Comments

youssefabdelm commented Nov 5, 2022

❓ Questions

adefossez commented Nov 17, 2022