Skip to content
Branch: master
Find file History
Permalink
Type Name Latest commit message Commit time
..
Failed to load latest commit information.
conf [egs] Add recipes for Speakers in the Wild (SITW) (#2422) May 24, 2018
README.txt [egs] Add recipes for Speakers in the Wild (SITW) (#2422) May 24, 2018
cmd.sh
local
path.sh
run.sh [egs] minor fixes related to python2 vs python3 differences (#2977) Jan 8, 2019
sid [egs] Add recipes for Speakers in the Wild (SITW) (#2422) May 24, 2018
steps
utils [egs] Add recipes for Speakers in the Wild (SITW) (#2422) May 24, 2018

README.txt

 This recipe replaces i-vectors used in the v1 recipe with embeddings extracted
 from a deep neural network.  In the scripts, we refer to these embeddings as
 "x-vectors."  The recipe in local/nnet3/xvector/tuning/run_xvector_1a.sh is
 closesly based on the following paper:

 @inproceedings{snyder2018xvector,
 title={X-vectors: Robust DNN Embeddings for Speaker Recognition},
 author={Snyder, D. and Garcia-Romero, D. and Sell, G. and Povey, D. and Khudanpur, S.},
 booktitle={2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
 year={2018},
 organization={IEEE},
 url={http://www.danielpovey.com/files/2018_icassp_xvectors.pdf}
 }

 The recipe uses the following datasets:

 Evaluation
     
     Speakers in the Wild    http://www.speech.sri.com/projects/sitw

 System Development
     
     VoxCeleb 1              http://www.robots.ox.ac.uk/~vgg/data/voxceleb
     VoxCeleb 2              http://www.robots.ox.ac.uk/~vgg/data/voxceleb2
     MUSAN                   http://www.openslr.org/17
     RIR_NOISES              http://www.openslr.org/28
You can’t perform that action at this time.