VITS 한국어 버전 (VITS Korean version)

설치

Python >= 3.6
Clone this repository
Install python requirements. Please refer requirements.txt
1. You may need to install espeak first: apt-get install espeak
Download datasets
1. Download and extract the LJ Speech dataset, then rename or create a link to the dataset folder: ln -s /path/to/LJSpeech-1.1/wavs DUMMY1
2. For mult-speaker setting, download and extract the VCTK dataset, and downsample wav files to 22050 Hz. Then rename or create a link to the dataset folder: ln -s /path/to/VCTK-Corpus/downsampled_wavs DUMMY2
Build Monotonic Alignment Search and run preprocessing if you use your own datasets.

# Cython-version Monotonoic Alignment Search
cd monotonic_align
python setup.py build_ext --inplace

# Preprocessing (g2p) for your own datasets. Preprocessed phonemes for LJ Speech and VCTK have been already provided.
# python preprocess.py --type ljs --filelists filelists/ljs_audio_text_train_filelist.txt filelists/ljs_audio_text_val_filelist.txt filelists/ljs_audio_text_test_filelist.txt 
# python preprocess.py --type vctk --filelists filelists/vctk_audio_sid_text_train_filelist.txt filelists/vctk_audio_sid_text_val_filelist.txt filelists/vctk_audio_sid_text_test_filelist.txt

Training Exmaple

# LJ Speech 포멧
python train.py -c configs/ljs_base.json -m ljs_base

# 사전학습 모델에서 학습시작하기 - LJ Speech 모멧
python train.py -c configs/ljs_base.json -m ljs_base -w pre_trained

# VCTK
python train_ms.py -c configs/vctk_base.json -m vctk_base

Inference Example

See inference_cpu.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

VITS 한국어 버전 (VITS Korean version)

설치

Training Exmaple

Inference Example

Files

README.md

Latest commit

History

README.md

File metadata and controls

VITS 한국어 버전 (VITS Korean version)

설치

Training Exmaple

Inference Example