viet-aligner

Aligner vietnamese text and audio clip

sudo apt install -y sox libsox-fmt-mp3

Install MFA

conda create -n kaldi python=3.9.12
conda activate kaldi
conda install -c conda-forge kaldi montreal-forced-aligner

Download datasets

python download_infore.py --output-dir data
python download_vivos.py --output-dir data
python download_common_voice.py --output-dir data
python download_fpt_open_speech.py --output-dir data

Align using pretrained acoustic model

mfa align /path/to/corpus/dir assets/lexicon.txt assets/mfa_vi_model.zip /path/to/output/dir

Train new acoustic model

python create_lexicon.py --data-dir data
mfa train --clean -o mfa_vi_model -t ./mfa_tmp ./data ./data/lexicon.txt mfa_output

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
colab_quick_start.ipynb		colab_quick_start.ipynb
create_lexicon.py		create_lexicon.py
download_common_voice.py		download_common_voice.py
download_fpt_open_speech.py		download_fpt_open_speech.py
download_infore.py		download_infore.py
download_vivos.py		download_vivos.py
normalize_text.py		normalize_text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

colab_quick_start.ipynb

colab_quick_start.ipynb

create_lexicon.py

create_lexicon.py

download_common_voice.py

download_common_voice.py

download_fpt_open_speech.py

download_fpt_open_speech.py

download_infore.py

download_infore.py

download_vivos.py

download_vivos.py

normalize_text.py

normalize_text.py

Repository files navigation

viet-aligner

Install MFA

Download datasets

Align using pretrained acoustic model

Train new acoustic model

About

Releases

Packages

Languages

License

NTT123/viet-aligner

Folders and files

Latest commit

History

Repository files navigation

viet-aligner

Install MFA

Download datasets

Align using pretrained acoustic model

Train new acoustic model

About

Resources

License

Stars

Watchers

Forks

Languages