Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

d-Vectors for UIS-RNN #29

Open
arthavmane opened this issue May 6, 2020 · 4 comments
Open

d-Vectors for UIS-RNN #29

arthavmane opened this issue May 6, 2020 · 4 comments

Comments

@arthavmane
Copy link

I'm working on a project in which I want to use d-vector embeddings to train a model.
Can someone please help how to compute d-vectors for different utterances from different speakers to pass into the UISRNN model?

@zhs105
Copy link

zhs105 commented Sep 22, 2020

Hi @arthavmane ,
Did you find a way to get the d-vectors as I am working on a similar project?

@davide-scalzo
Copy link

davide-scalzo commented Oct 7, 2020

I haven't tried UIS-RNN yet and only found this library yesterday but I can extract the embeds with _, EMBEDS, wav_splits = encoder.embed_utterance(wav, return_partials=True)

@saumyaborwankar
Copy link

@davodesign84 actually this command doesnt output a single 256 element array, the EMBEDS variable will be (#,256) but it should be (1,256). I think its first splitting the audio into segments and then finding embeddings but it should use the entire audio. Any clue how to do that?

@saumyaborwankar
Copy link

Actually I found out @davodesign84 and @zhs105 you can just call embed = encoder.embed_utterance(wav) and itll give you a (1,256) array which is your embedding for the specific wav file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants