Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting speaker embeddings #13

Closed
AH289 opened this issue Nov 2, 2021 · 3 comments
Closed

Getting speaker embeddings #13

AH289 opened this issue Nov 2, 2021 · 3 comments

Comments

@AH289
Copy link

AH289 commented Nov 2, 2021

UniSpeech-SAT directory in this repo contains an example. The example takes a .wav file as an input and produces a tensor 'f' as an output. Can I get the speaker embeddings from 'f'?

@MarkWuNLP
Copy link
Contributor

UniSpeech-SAT directory in this repo contains an example. The example takes a .wav file as an input and produces a tensor 'f' as an output. Can I get the speaker embeddings from 'f'?

Hi,
If you would like to get speaker embeddings, you can refer to what we have done for speaker verification https://github.com/microsoft/UniSpeech/tree/main/UniSpeech-SAT/speaker_verification . @czy97 Add Zhengyang into the thread.

@czy97
Copy link
Contributor

czy97 commented Nov 3, 2021

UniSpeech-SAT directory in this repo contains an example. The example takes a .wav file as an input and produces a tensor 'f' as an output. Can I get the speaker embeddings from 'f'?

Specifically, you can get the speaker embedding from this line:

@AH289
Copy link
Author

AH289 commented Nov 4, 2021

Thank you so much.

@AH289 AH289 closed this as completed Nov 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants