torch-whisper-guided-vc

Torch implementation of Whisper-guided DDPM based Voice Conversion

DiffWave: A Versatile Diffusion Model for Audio Synthesis, Zhifeng Kong et al., 2020. [arXiv:2009.09761]
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data, Sungwon Kim et al., 2022. [arXiv:2205.15370]
Variational Diffusion Models, Kingma et al., 2021. [arXiv:2107.00630]
Whisper: Robust Speech Recognition via Large-Scale Weak Supervision, Radford et al., 2022. [openai:whisper]

Requirements

Tested in python 3.7.9 conda environment.

Usage

Download LibriTTS dataset from openslr

To train model, run train.py

python train.py \
    --data-dir /datasets/LibriTTS/train-clean-360

To start to train from previous checkpoint, --load-epoch is available.

python train.py \
    --data-dir /datasets/LibriTTS/train-clean-360 \
    --load-epoch 20 \
    --config ./ckpt/t1.json

Checkpoint will be written on TrainConfig.ckpt, tensorboard summary on TrainConfig.log.

tensorboard --logdir ./log

To inference model, run inference.py

[TODO] Pretrained checkpoints are relased on releases.

To use pretrained model, download files and unzip it. Followings are sample script.

from wgvc import WhisperGuidedVC

ckpt = torch.load('t1_200.ckpt', map_location='cpu')
wgvc = WhisperGuidedVC.load(ckpt)
wgvc.eval()

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
speechset @ d410f42		speechset @ d410f42
utils		utils
wgvc		wgvc
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
config.py		config.py
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speechset @ d410f42

speechset @ d410f42

utils

utils

wgvc

wgvc

.gitignore

.gitignore

.gitmodules

.gitmodules

LICENSE

LICENSE

README.md

README.md

config.py

config.py

inference.py

inference.py

requirements.txt

requirements.txt

train.py

train.py

Repository files navigation

torch-whisper-guided-vc

Requirements

Usage

[TODO] Learning curve

[TODO] Samples

About

Releases

Packages

Languages

License

revsic/torch-whisper-guided-vc

Folders and files

Latest commit

History

Repository files navigation

torch-whisper-guided-vc

Requirements

Usage

[TODO] Learning curve

[TODO] Samples

About

Resources

License

Stars

Watchers

Forks

Languages