GitHub - idiap/CNN_QbE_STD: Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"

Description

Implementation of the work presented in "CNN based Query by Example Spoken Term Detection".

We have included some example groundtruth files for training as well as development set.

The posteriors features are extracted using the setup presented in the following paper: "High-performance query-by-example spoken term detection on the SWS 2013 evaluation".

The input feature files for training/evaluation are in pytorch readable format which are saved as python dictionaries. The keys are the names of the files in 'groundtruth files' and values are the features in matrix format.

Training

python query_detection_dtw_cnn.py -optim adam -learning_rate 0.0001 -input_size 152 -batch_size 50 -layers 9 -depth 30 -dropout 0.2 -loss_threshold 0.1 -n_valid 50 -max_batch_dev 250 -max_batch_train 1000

Evaluation

python query_detection_dtw_cnn_evaluation.py -input_size 152 -depth 30 -load_model -modelpath cnn_qbe_std_model.pt -outdir outpath -query_list dev_queries_sample_list.txt -search_list search_utterances_sample_list.txt

Reference

@inproceedings{ram2018cnn,
  title={CNN based Query by Example Spoken Term Detection},
  author={Ram, Dhananjay and Miculicich, Lesly and Bourlard, Herv{\'e}},
  booktitle={Nineteenth Annual Conference of the International Speech Communication Association (INTERSPEECH)},
  year={2018}
}

Contact:

dhananjay.ram@idiap.ch

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
COPYING		COPYING
Dataset_DTW_CNN.py		Dataset_DTW_CNN.py
GroundTruth_label_dev_sample.txt		GroundTruth_label_dev_sample.txt
GroundTruth_label_train_complete_neg_sample.txt		GroundTruth_label_train_complete_neg_sample.txt
GroundTruth_label_train_complete_pos_sample.txt		GroundTruth_label_train_complete_pos_sample.txt
Model_Query_Detection_DTW_CNN.py		Model_Query_Detection_DTW_CNN.py
README.md		README.md
cnn_qbe_std_model.pt		cnn_qbe_std_model.pt
dev_queries_sample_list.txt		dev_queries_sample_list.txt
query_detection_dtw_cnn.py		query_detection_dtw_cnn.py
query_detection_dtw_cnn_evaluation.py		query_detection_dtw_cnn_evaluation.py
search_utterances_sample_list.txt		search_utterances_sample_list.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

COPYING

COPYING

Dataset_DTW_CNN.py

Dataset_DTW_CNN.py

GroundTruth_label_dev_sample.txt

GroundTruth_label_dev_sample.txt

GroundTruth_label_train_complete_neg_sample.txt

GroundTruth_label_train_complete_neg_sample.txt

GroundTruth_label_train_complete_pos_sample.txt

GroundTruth_label_train_complete_pos_sample.txt

Model_Query_Detection_DTW_CNN.py

Model_Query_Detection_DTW_CNN.py

README.md

README.md

cnn_qbe_std_model.pt

cnn_qbe_std_model.pt

dev_queries_sample_list.txt

dev_queries_sample_list.txt

query_detection_dtw_cnn.py

query_detection_dtw_cnn.py

query_detection_dtw_cnn_evaluation.py

query_detection_dtw_cnn_evaluation.py

search_utterances_sample_list.txt

search_utterances_sample_list.txt

Repository files navigation

Description

Training

Evaluation

Reference

Contact:

About

Releases

Packages

Languages

License

idiap/CNN_QbE_STD

Folders and files

Latest commit

History

Repository files navigation

Description

Training

Evaluation

Reference

Contact:

About

Resources

License

Stars

Watchers

Forks

Languages