10807 Project

Preparation

Compile HTK

HTK is used to create MFCC features. To compile it, run ./compile_htk.sh

Create symbolic link for TIMIT.

Run ./link_timit.sh to create a symbolic link for TIMIT dataset under /results.

Debug MFCC feature

debug_mfcc has files for comparing MFCC features generated by HTK and that by python_speech_features .

Only one file is used to do this test. It's copied from TIMITcorpus/TIMIT/TRAIN/DR8/FBCG1/SX442.*, where * are PHN, TXT, WAV, WRD. .wav.sox is generated using sox SX442.WAV SX442.wav, as seen in /home/zhihaol/807/scripts/convert_wav.sh on CNBC cluster, and I renamed it to .wav.sox.

I will generate features according to Section 5.1 of Supervised Sequence Labelling with Recurrent Neural Networks, a preprint version is at http://www.cs.toronto.edu/~graves/preprint.pdf.

feature configuration for HTK

It's based on 3.1.5 "Step 5 - Coding the Data" of HTK official book (htkbook-3.5.alpha-1.pdf)

load HTK feature

It's done using https://github.com/cmusphinx/sphinxtrain/blob/master/python/cmusphinx/htkmfc.py

Feature generation

Run PYTHONPATH=$(pwd) python feature_generation/generate_mfcc_features.py under project root to see the results.

It will generate /results/features/TIMIT_train.hdf5 and /results/features/TIMIT_test.hdf5, as well as a folder /results/features/TIMIT_train for training data of ctc_example.

CTC Training

`ctc_example`

It's from https://github.com/dresen/tensorflow_CTC_example.

I modified INPUT_PATH & TARGET_PATH, and minibatch size to 10 in bdlstm_train.py, and load_batched_data in utils.py.

Run PYTHONPATH=$(pwd) python ctc_example/bdlstm_train.py under project root to see the results. By 10 epochs, it should give error rate of about 0.4.

`ctc_example_2`

It's from http://file.ppwwyyxx.com/

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
3rdparty		3rdparty
ctc_example		ctc_example
ctc_example_2		ctc_example_2
debug_mfcc		debug_mfcc
feature_generation		feature_generation
ios/simple		ios/simple
lstm_siamese		lstm_siamese
model_training		model_training
results		results
.gitignore		.gitignore
README.md		README.md
compile_htk.sh		compile_htk.sh
compile_sox.sh		compile_sox.sh
link_timit.sh		link_timit.sh
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

10807 Project

Preparation

Compile HTK

Create symbolic link for TIMIT.

Debug MFCC feature

feature configuration for HTK

load HTK feature

Feature generation

CTC Training

`ctc_example`

`ctc_example_2`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

10807 Project

Preparation

Compile HTK

Create symbolic link for TIMIT.

Debug MFCC feature

feature configuration for HTK

load HTK feature

Feature generation

CTC Training

ctc_example

ctc_example_2

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`ctc_example`

`ctc_example_2`

Packages