StyleFormant

KeonLee님이 구현하신 StyleSpeech와 FastPitchFormant의 구성 요소를 합쳤습니다. 특별히 StyleSpeech의 few shot을 target으로 한 task1과, 텍스트와 운율을 별도로 모델링하는 task2를 복합적으로 모델링 하였습니다.

ksc2021에서 학부생 포스터 발표를 진행하였고 이에 해당하는 paper 입니다.

Usage

Pretrained Model

Quick Inference

pretrained model을 따로 폴더의 경로를 만들어, ./output/ckpt/LibriTTS_meta_learner 에 위치하도록 해주세요.

python synthesize.py --text "TEXT" --ref_audio RefAudio/lj_02_gt.wav --restore_step 60000 --mode single -p config/LibriTTS/preprocess.yaml -m config/LibriTTS/model.yaml -t config/LibriTTS/train.yaml

를 실행하면 ./output/result/LibriTTS_meta_learner 에 실행 결과가 나타납니다.

Preprocess

Quick Start

keonlee님이 제공해주신 StyleSpeech의 TextGrid들 중 LibriTTS.zip을 다운로드하고 unzip한 폴더를 preprocessed_data/LibriTTS/TextGrid/ 하단에 위치시킵니다.

Start with scratch

Montreal Forced Aligner를 다운로드합니다.
LibriTTS 데이터셋을 다운받아 raw_data의 하위 폴더로 넣어줍니다.
둘 중 하나의 명령어로 alignment 작업을 진행합니다.

./montreal-forced-aligner/bin/mfa_align raw_data/LibriTTS/ lexicon/librispeech-lexicon.txt english preprocessed_data/LibriTTS

혹은

./montreal-forced-aligner/bin/mfa_train_and_align raw_data/LibriTTS/ lexicon/librispeech-lexicon.txt preprocessed_data/LibriTTS

Train

python train.py -p config/LibriTTS/preprocess.yaml -m config/LibriTTS/model.yaml -t config/LibriTTS/train.yaml

Inference

python3 synthesize.py --text "Hello world." --ref_audio RefAudio/lj_02_gt.wav --restore_step 60000 --mode single -p config/LibriTTS/preprocess.yaml -m config/LibriTTS/model.yaml -t config/LibriTTS/train.yaml

Reference

https://github.com/keonlee9420/StyleSpeech https://github.com/KevinMIN95/StyleSpeech (official) https://github.com/keonlee9420/FastPitchFormant

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
RefAudio		RefAudio
__pycache__		__pycache__
audio		audio
config/LibriTTS		config/LibriTTS
hifigan		hifigan
lexicon		lexicon
model		model
preprocessor		preprocessor
text		text
utils		utils
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
evaluate.py		evaluate.py
filelist_filtering.py		filelist_filtering.py
prepare_align.py		prepare_align.py
preprocess.py		preprocess.py
run.sh		run.sh
run1.sh		run1.sh
synthesize.py		synthesize.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StyleFormant

Usage

Pretrained Model

Quick Inference

Preprocess

Quick Start

Start with scratch

Train

Inference

Reference

About

Releases

Packages

Languages

KeyboarderSon/StyleFormant

Folders and files

Latest commit

History

Repository files navigation

StyleFormant

Usage

Pretrained Model

Quick Inference

Preprocess

Quick Start

Start with scratch

Train

Inference

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages