WFST Decoding Based on k2

This repository aims to add WFST decoding based on k2 for speechbrain with python. You can know more details about how k2 implements WFST from k2 and icefall. Here, we use the transformer trained by speechbrain as our acoustic model to compute the output probabilities. Based on the output probabilities, we implement WFST decoding with k2.

Environment

Suggest run this script with a conda environment (Linux). You can also config your environment according to this url based on your reality.

1. conda create -n k2-python python=3.8
2. source activate k2-python
3. pip install torch==1.8.1+cu102 torchaudio===0.8.1 -f https://download.pytorch.org/whl/torch_stable.html
4. conda install -c k2-fsa -c pytorch -c conda-forge k2 python=3.8 cudatoolkit=10.2 pytorch=1.8.1
5. pip install speechbrain

If there are some other packages missing, you can install them as follows:

pip install xxx or conda install xxx

Dataset

You can change the data_dir to your own librispeech data directory in test_ctc.py and test_hlg.py .

Testing

You just need run the following command:

bash run.sh

The run.sh can unify all python files and complete them with just one command.

Results

CUDA_VISIBLE_DEVICES='0' python3 test_ctc.py

        | test-clean | test-other
--------------------------------
 WER(%) |    5.88    |   13.82

CUDA_VISIBLE_DEVICES='0' python3 test_hlg.py

           |          WER(%)
-------------------------------------
  lm_scale | test-clean | test-other
-------------------------------------
    0.3    |    4.76    |   10.93
-------------------------------------
    0.4    |    4.75    |   10.83
-------------------------------------
    0.5    |    4.83    |   10.89
-------------------------------------

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
local		local
results		results
utils		utils
README.md		README.md
run.sh		run.sh
test_ctc.py		test_ctc.py
test_hlg.py		test_hlg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WFST Decoding Based on k2

Environment

Dataset

Testing

Results

About

Releases

Packages

Languages

luomingshuang/k2-speechbrain

Folders and files

Latest commit

History

Repository files navigation

WFST Decoding Based on k2

Environment

Dataset

Testing

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages