d-Vectors for UIS-RNN #29

arthavmane · 2020-05-06T09:43:01Z

I'm working on a project in which I want to use d-vector embeddings to train a model.
Can someone please help how to compute d-vectors for different utterances from different speakers to pass into the UISRNN model?

zhs105 · 2020-09-22T07:11:24Z

Hi @arthavmane ,
Did you find a way to get the d-vectors as I am working on a similar project?

davide-scalzo · 2020-10-07T13:18:10Z

I haven't tried UIS-RNN yet and only found this library yesterday but I can extract the embeds with _, EMBEDS, wav_splits = encoder.embed_utterance(wav, return_partials=True)

saumyaborwankar · 2021-02-11T04:21:26Z

@davodesign84 actually this command doesnt output a single 256 element array, the EMBEDS variable will be (#,256) but it should be (1,256). I think its first splitting the audio into segments and then finding embeddings but it should use the entire audio. Any clue how to do that?

saumyaborwankar · 2021-02-11T04:27:32Z

Actually I found out @davodesign84 and @zhs105 you can just call embed = encoder.embed_utterance(wav) and itll give you a (1,256) array which is your embedding for the specific wav file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

d-Vectors for UIS-RNN #29

d-Vectors for UIS-RNN #29

arthavmane commented May 6, 2020

zhs105 commented Sep 22, 2020

davide-scalzo commented Oct 7, 2020 •

edited

Loading

saumyaborwankar commented Feb 11, 2021

saumyaborwankar commented Feb 11, 2021

d-Vectors for UIS-RNN #29

d-Vectors for UIS-RNN #29

Comments

arthavmane commented May 6, 2020

zhs105 commented Sep 22, 2020

davide-scalzo commented Oct 7, 2020 • edited Loading

saumyaborwankar commented Feb 11, 2021

saumyaborwankar commented Feb 11, 2021

davide-scalzo commented Oct 7, 2020 •

edited

Loading