FastSpeech 2 - Multilingual

运行方法

python synthesize.py --restore_step 900000 --mode mixed --text "Numbers如何寻找最优特征？是有放回还是无放回的呢？"  -p config/hcsi_10speakers/preprocess.yaml -m config/hcsi_10speakers/model.yaml -t config/hcsi_10speakers/train.yaml --speaker_id 5 --duration_control 1 --pitch_control 1 --energy_control 1

说话人

speaker 0-9：

0 Spk4.CN.03FR00
1 Spk9.EN.F.DB2
2 Spk8.EN.F.DB1
3 Spk2.CN.Deng
4 Spk1.CN.DataBaker
5 Spk6.CN.Pachira
6 Spk7.EN.M.DB1
7 Spk3.EN.XuYue
8 Spk5.CN.03MR00
9 Spk0.EN.LJSpeech

mfa提取

mfa align raw_data/hcsi_10speakers/Spk0.EN.LJSpeech/ lexicon/librispeech-lexicon.txt english preprocessed_data/hcsi_10speakers/Spk0.EN.LJSpeech/ --clean --disable_textgrid_cleanup --optional_silence_phone sp --other_noise_phone onp


# new_acoustic_model 放在 /ceph/home/huangqc18/Documents/MFA/pretrained_models/acoustic中
mfa adapt raw_data/hcsi_10speakers/Spk1.CN.DataBaker/ lexicon/pinyin-lexicon-r.txt new _acoustic_model preprocessed_data/hcsi_10speakers/Spk1.CN.DataBaker/ --clean --disable_textgrid_cleanup --optional_silence_phone sp --other_noise_phone onp

References

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech, Y. Ren, et al.
ming024's FastSpeech implementation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
audio		audio
config		config
hifigan		hifigan
lexicon		lexicon
model		model
preprocessed_data		preprocessed_data
preprocessor		preprocessor
text		text
transformer		transformer
utils		utils
.gitignore		.gitignore
README.md		README.md
add_lexicon.py		add_lexicon.py
code_switch.py		code_switch.py
control.sh		control.sh
dataset.py		dataset.py
en.sh		en.sh
evaluate.py		evaluate.py
mfa_config.yaml		mfa_config.yaml
new_acoustic_model.zip		new_acoustic_model.zip
phoneme embedding.png		phoneme embedding.png
prepare_align.py		prepare_align.py
prepare_align0.py		prepare_align0.py
prepare_align1.py		prepare_align1.py
prepare_align2.py		prepare_align2.py
prepare_align3.py		prepare_align3.py
prepare_align4.py		prepare_align4.py
prepare_align5.py		prepare_align5.py
prepare_align6.py		prepare_align6.py
prepare_align7.py		prepare_align7.py
prepare_align8.py		prepare_align8.py
prepare_align9.py		prepare_align9.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
run.sh		run.sh
speaker embedding.png		speaker embedding.png
statistics.py		statistics.py
statistics12.json		statistics12.json
synthesize.py		synthesize.py
test.py		test.py
test.txt		test.txt
test_energy.py		test_energy.py
train.py		train.py
tsne.py		tsne.py
zh.sh		zh.sh

AugustRush/FastSpeech2-Multilingual

Folders and files

Latest commit

History

Repository files navigation

FastSpeech 2 - Multilingual

References

About

Resources

Stars

Watchers

Forks

Languages