Wall Street Journal with ISS

Data preparation. Execute the following scripts in order.

Set up links to the WSJ database, and also your own temp space (for features):

CreateLinks.sh

This converts the .dot files supplied with WSJ into .mlf label files in the local directory:

CreateLabels.sh

This will generate the file lists used for training and testing:

CreateLists.sh

This script creates the two dictionaries required for training: flat (basic) and main (with sil and sp final phones):

CreateDicts.sh

Now a system can be trained.

Extract features to the feats directory. You can choose the output directory; just make sure it remain the same for all training commands:

ExtractTrain.sh train-plp-si-84

Train an HMM-GMM model using HTS:

TrainGMM.sh train-plp-si-84

Prepare for testing.

Convert the WSJ LM and word lists to generic formats in ./local:

CreateLMs.sh

Create a language model suitable for the selected decoder. Check the Config.h file for the desired one. You can choose the output directory; note that it's acoustic model independent, so something language model specific is appropriate.

CreateLM.sh wsj5k

Now the test can be run.

Extract testing data:

ExtractTest.sh test-plp-h2-p0

Run the test:

TestGMM.sh test-plp-h2-p0

Score the test:

Score.sh test-plp-h2-p0

Result should be:

SENT: %Correct=36.74 [H=79, S=136, N=215]
WORD: %Corr=91.58, Acc=90.65 [H=3525, D=86, S=238, I=36, N=3849]

Speaker Adaptive Training (SAT)

Estimate CMLLR transforms for the training set:

AdaptMLLRTrain.sh cmllr-plp-si-84

Train CMLLR speaker normalized acoustic models:

RetrainGMM.sh train-plp-si-84

Estimate CMLLR transforms (supervised) for the test set:

AdaptMLLRTest.sh cmllr-plp-h2-p0

Run the test with speaker adapted models:

TestSATGMM.sh test-plp-h2-p0-sat

Score the test:

Score.sh test-plp-h2-p0-sat

Result should be

SENT: %Correct=44.19 [H=95, S=120, N=215]
WORD: %Corr=93.45, Acc=92.70 [H=3597, D=71, S=181, I=29, N=3849]

SAT+MLLR Speaker Adaptation

Estimate MLLR transforms for the test set with the speaker adapted models:

AdaptMLLR-sat-test.sh mllr-plp-h2-p0

Run the test with speaker adapted models plus MLLR:

TestSATMLLRGMM.sh test-plp-h2-p0-sat-mllr

Score the test:

Score.sh test-plp-h2-p0-sat-mllr

Score should be:

SENT: %Correct=54.88 [H=118, S=97, N=215]
WORD: %Corr=95.53, Acc=95.14 [H=3677, D=55, S=117, I=15, N=3849]

Tandem features from MLP phone posteriors

This will generate train, dev and test lists. Train and dev used for MLP training

CreateMLPLists.sh

This will generate phone alignments for the train, dev and test MLP lists. The acoustic models to use must be specified within CreateMLPLabels.sh.

CreateMLPLabels.sh align

Shuffle and prepare Quicknet data for MLP training (see MLP architecture set-up inside the script.)

InitMLP.sh mlptrain-si-84

Train MLP, outputs a mat file with architecture as file name.

TrainMLP.sh mlptrain-si-84

Runs a MLP forward pass on the training data, trains the KLT transform and applies log and KLT to obtain tandem features.

ForwardMLPTrain.sh fwdmlp-train-si-84

Computes tandem features for the test set (it uses the KLT stats from the previous step).

ForwardMLPTest.sh fwdmlp-test-h2-p0

The train and test tandem features are stored into feats/$featName/$MLP_OUT_HTK_DIR. MLP_OUT_HTK_DIR is defined inside ForwardMLP-train.sh and ForwardMLP-test.sh respectively. To use tandem features to train and test acoustic models change the trainList and testList variables in Config.sh to:

trainList=../fwdmlp-train-si-84/file-list-htk.txt
testList=../fwdmlp-test-h2-p0/file-list-htk.txt

Trains an HMM-GMM model using tandem features.

TrainGMM.sh train-tandem-si-84

Run the test (You need to change the acoustic models used to acousticModel=../train-tandem-si-84)

TestGMM.sh test-tandem-h2-p0

Score the test.

Score.sh test-tandem-h2-p0

Result should be:

SENT: %Correct=32.56 [H=70, S=145, N=215]
WORD: %Corr=91.48, Acc=89.76 [H=3521, D=88, S=240, I=66, N=3849]

Notes

In principle the local directory could be copied to a static site directory. This would save running many of the CreateXXX.sh scripts. However, they are not difficult to run.

Phil Garner, July 2011

Marc Ferras

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
AdaptMLLRTest.sh		AdaptMLLRTest.sh
AdaptMLLRTrain.sh		AdaptMLLRTrain.sh
AdaptSATMLLRTest.sh		AdaptSATMLLRTest.sh
COPYING		COPYING
Clean.sh		Clean.sh
Config.sh		Config.sh
CreateCLG.sh		CreateCLG.sh
CreateDicts.sh		CreateDicts.sh
CreateHub2Ref.sh		CreateHub2Ref.sh
CreateLG.sh		CreateLG.sh
CreateLM.sh		CreateLM.sh
CreateLMs.sh		CreateLMs.sh
CreateLabels.sh		CreateLabels.sh
CreateLinks.sh		CreateLinks.sh
CreateLists.sh		CreateLists.sh
CreateMLPLabels.sh		CreateMLPLabels.sh
CreateMLPLists.sh		CreateMLPLists.sh
ExtractTest.sh		ExtractTest.sh
ExtractTrain.sh		ExtractTrain.sh
ForwardMLPTest.sh		ForwardMLPTest.sh
ForwardMLPTrain.sh		ForwardMLPTrain.sh
InitMLP.sh		InitMLP.sh
Panic.sh		Panic.sh
README.md		README.md
RetrainGMM.sh		RetrainGMM.sh
Run.sh		Run.sh
Score.sh		Score.sh
TestGMM.sh		TestGMM.sh
TestSATGMM.sh		TestSATGMM.sh
TestSATMLLRGMM.sh		TestSATMLLRGMM.sh
TrainGMM.sh		TrainGMM.sh
TrainMLP.sh		TrainMLP.sh
oov-dict.txt		oov-dict.txt

License

idiap/iss-wsj

Folders and files

Latest commit

History

Repository files navigation

Wall Street Journal with ISS

Data preparation. Execute the following scripts in order.

Now a system can be trained.

Prepare for testing.

Now the test can be run.

Speaker Adaptive Training (SAT)

SAT+MLLR Speaker Adaptation

Tandem features from MLP phone posteriors

Notes

About

Resources

License

Stars

Watchers

Forks

Languages