speechlmscore_tool

Implementation of "SpeechLMScore: Evaluating speech generation using speech langauge model" https://arxiv.org/abs/2212.04559

Installation

You can install required python packages as:

python setup.py install

Usage

Download pretrained models

Download these pretrained models and update their path in run.sh.
Note: tokens.txt is located with speech ulm model.

Run the following command to download all the above models:

./download_pretrained_models.sh

Compute SpeechLMScore using pretrained models

Generates speechlmscore for each file in audio_dir in file ppl.
Audio files with sampling rate of 16kHZ are supported.
Note: for using audio files other than .wav set ext variable is run.sh.

audio_dir=<folder containing audio>
layer=<Hubert layer to extract features>

./run.sh ${audio_dir} ${layer}

Train speech language models

Additionally speech language model can be trained and used for evaluation as well. More Details

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
utils		utils
.gitignore		.gitignore
01a_gen_list.py		01a_gen_list.py
01b_gen_tsv.py		01b_gen_tsv.py
02a_dump_feature.py		02a_dump_feature.py
02b_dump_km_label.py		02b_dump_km_label.py
03_calc_perplexity.py		03_calc_perplexity.py
LICENSE		LICENSE
README.md		README.md
Training.md		Training.md
create_token_list.py		create_token_list.py
download_pretrained_models.sh		download_pretrained_models.sh
requirements.txt		requirements.txt
run.sh		run.sh
setup.py		setup.py

License

soumimaiti/speechlmscore_tool

Folders and files

Latest commit

History

Repository files navigation

speechlmscore_tool

Installation

Usage

Download pretrained models

Compute SpeechLMScore using pretrained models

Train speech language models

About

Resources

License

Stars

Watchers

Forks

Languages