ivector_extract

Overview

Ivector extraction in Kaldi is done through invoking 3 programs running in a pipe: gmm-gselect, fgmm-global-gselect-to-post and ivector-extract. Each program takes the input (features generated from wave files), certain models and the output from previous program to generate output. Models are large files (~400MB), the overhead is negligible when processing thousands of features at once, however in the use case of processing one feature at once the overhead is significant.

The code merges 3 programs into one, load the models at the beginning and then act as a server, processing a feature from a request and return the results.

Setup

Clone the project
Add KALDI_ROOT to your environment variables by editing ~/.bashrc
Run the init script: $ ./init.sh
Run the server script: $ ./server.sh
Open another shell, run the client script: $ ./client.sh
To terminate, use the client shell and run the termination script: $ ./terminate.sh

Structure of the repository

src/
	ivector-extract-server.cc 	# server code
	ivector-extract-client.cc 	# client code
bin/
	ivector-extract-client		# client binary
workflow/
	conf/						# config for compute-mfcc and compute-vad
		mfcc.conf
		vad.conf
	models/
		final.ie
		final.ubm
		[final.dubm] 			# optional
	data/
		utt2spk					# mapping of utterance id to speaker id
		spk2utt 				# the reverse of above (usually has the same content)
		file_name.sph 			# the source sph file
		wav.scp 				# Kaldi script file to convert sph to wav for processing 
	temp/ 						# folder to store temp result files
		mfcc.ark
		mfcc.scp
		vad.ark
		vad.scp
	result/ 					# folder to store results
		ivector.ark
		ivector.scp
		num_utts.ark
		spk_ivector.ark
		spk_ivector.scp
	gold/ 						# gold standard if exists
	log/						# folder to store log files
		mfcc.log
		vad.log
		speaker_mean.log
	utils/ 						# utilities, only run.pl needed
		run.pl
init.sh 						# make client, patch Kaldi with server
path.sh 						# include paths
server.sh 						# start server
client.sh 						# start client
terminate.sh 					# client shortcut to terminate server

To-do

Only need to take sph file, generate the rest of the input files automatically

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

README.md

README.md

client.sh

client.sh

init.sh

init.sh

path.sh

path.sh

server.sh

server.sh

terminate.sh

terminate.sh

Repository files navigation

ivector_extract

Overview

Setup

Structure of the repository

To-do

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src		src
.gitignore		.gitignore
README.md		README.md
client.sh		client.sh
init.sh		init.sh
path.sh		path.sh
server.sh		server.sh
terminate.sh		terminate.sh

nguyenhuyanhh/ivector_extract

Folders and files

Latest commit

History

Repository files navigation

ivector_extract

Overview

Setup

Structure of the repository

To-do

About

Resources

Stars

Watchers

Forks

Languages