visinger-speech

基于fs2、vits、visinger的tts模型（暂时还在开发调试中）（效果暂时依旧不太满意）

模型结构

总的来说基本就是将fastspeech2的VarianceAdapter结构添加进了vits

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
assets		assets
configs		configs
data		data
dataset		dataset
filelists		filelists
mfa_temp		mfa_temp
text		text
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
data_utils.py		data_utils.py
f0energy.py		f0energy.py
frame_prior_network.py		frame_prior_network.py
gui.py		gui.py
inference.ipynb		inference.ipynb
inference.py		inference.py
inference_api.py		inference_api.py
losses.py		losses.py
mel_processing.py		mel_processing.py
merge_dataset.py		merge_dataset.py
models.py		models.py
modules.py		modules.py
post_mfa.py		post_mfa.py
prepare_mfa.py		prepare_mfa.py
preprocess_config.py		preprocess_config.py
requirements.txt		requirements.txt
train.py		train.py
transforms.py		transforms.py
utils.py		utils.py