StyleVC - PyTorch official implementation

This is a Pytorch implementation of StyleVC StyleVC: Non-Parallel Voice Conversion with Adversarial Style Generalization. Feel free to use and modify the code and please refer our repo.

Updates

2022/02/04 Release the StyleVC official code.

Demo Samples

Audio samples generated by this implementation can be found here.

Quick Start

You can quickly run model using Google Collab

Run the 'inference.ipynb' file in Collab! here

Install Dependencies

(Option) You can make an environment using anaconda

conda create -n py37torch17 python=3.7.9

(Option) And then activate your conda environment and install PyTorch and Tensorflow

conda activate py37torch17
conda install pytorch=1.7 torchvision torchaudio cudatoolkit=10.1 -c pytorch
pip install --upgrade tensorflow-gpu==1.15

You can install the python dependencies with

pip install -r requirements.txt

Train Your Model

Datasets

Preprocessing is supported for VCTK Datasets.

VCTK: CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit (version 0.92) https://datashare.ed.ac.uk/handle/10283/3443

Preprocessing

You can refer to the sample file and the file structure below on Github. For preprocessing, use the following command.

python prepare_dataset.py --in_dir data/VCTK/original/ --out_dir_name VCTK_16K --dataset VCTK

The file structure after preprocessing is as follows:

├── data
│   ├── VCTK
│   │   ├── original    
│   │   │   ├── wav48
│   │   │   │   ├── wavs
│   │   │   ├── metadata.csv
│   │   ├── VCTK22K   
│   │   │   ├── train
│   │   │   │   ├── p225
│   │   │   │   │   ├── p225_021.npz
│   │   │   │   │   ├── ...
│   │   │   │   │   ├── p225_423.npz
│   │   │   │   ├── ...
│   │   │   │   ├── p376
│   │   │   ├── val

Train

To train, set hyperparameters in model/hparams.py and use the command.

python trainer.py --dataset VCTK --dataset_name VCTK_16K --log_dir StyleVC_VCTK_test01

Vocoder

We used Hifigan finetuned. You can download the checkpoint and config file below and saved in 'vocoder/checkpoint'.

Model	Checkpoint file	Config file
VCTK	Download	Download

Inference

python inference.py

Checkpoint

We provide pretrained checkpoint. Download the checkpoint file below and put it in 'outputs/StyleVC_VCTK'.

Model	Checkpoint file
VCTK	Download

Citation

Please cite the paper if you find StyleVC useful.

@inproceedings{hwang2022stylevc,
  title={StyleVC: Non-Parallel Voice Conversion with Adversarial Style Generalization},
  author={Hwang, In-Sun and Lee, Sang-Hoon and Lee, Seong-Whan},
  booktitle={2022 26th International Conference on Pattern Recognition (ICPR)},
  pages={23--30},
  year={2022},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data/VCTK		data/VCTK
model		model
utils		utils
vocoder		vocoder
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate_inference_list.py		generate_inference_list.py
generate_speaker_embedding.py		generate_speaker_embedding.py
inference.ipynb		inference.ipynb
inference.py		inference.py
online_inference.py		online_inference.py
prepare_dataset.py		prepare_dataset.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StyleVC - PyTorch official implementation

Updates

Demo Samples

Quick Start

You can quickly run model using Google Collab

Install Dependencies

Train Your Model

Datasets

Preprocessing

Train

Vocoder

Inference

Checkpoint

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

insunhwang89/StyleVC

Folders and files

Latest commit

History

Repository files navigation

StyleVC - PyTorch official implementation

Updates

Demo Samples

Quick Start

You can quickly run model using Google Collab

Install Dependencies

Train Your Model

Datasets

Preprocessing

Train

Vocoder

Inference

Checkpoint

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages