Speaker embedding

This code is used to extract the speaker's feature from wav and make the speaker's embedding. The algorithm is based on the following papers:

Wan, L., Wang, Q., Papir, A., & Moreno, I. L. (2017). Generalized end-to-end loss for speaker verification. arXiv preprint arXiv:1710.10467.
Jia, Y., Zhang, Y., Weiss, R. J., Wang, Q., Shen, J., Ren, F., ... & Wu, Y. (2018). Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis. arXiv preprint arXiv:1806.04558.

I uploaded the Torch version speaker embedding model. If you want, please refer the following:

https://github.com/CODEJIN/Speaker_Embedding_Torch

Training and test

VCTK, LibriSpeech, VoxCeleb1, and VoxCeleb2 were used for model learning, and some of the test sets of VoxCeleb1 that were not learned were used for testing the learned model. Please refer to the following link for each dataset:

VCTK: https://datashare.is.ed.ac.uk/handle/10283/2651
LibriSpeech: http://www.robots.ox.ac.uk/~vgg/data/voxceleb/
VoxCeleb: http://www.openslr.org/12/

t-SNE Result about the sentences of 10 non-trained talkers

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Audio.py		Audio.py
Customized_Modules.py		Customized_Modules.py
Hyper_Parameters.py		Hyper_Parameters.py
Pattern_Feeders.py		Pattern_Feeders.py
Pattern_Generate.Speaker_Embedding.py		Pattern_Generate.Speaker_Embedding.py
README.md		README.md
Speaker_Embedding.py		Speaker_Embedding.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker embedding

Training and test

t-SNE Result about the sentences of 10 non-trained talkers

About

Releases

Packages

Languages

CODEJIN/speaker_embedding

Folders and files

Latest commit

History

Repository files navigation

Speaker embedding

Training and test

t-SNE Result about the sentences of 10 non-trained talkers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages