AutoVC-WavRNN

voice conversion system

This repository provides a PyTorch implementation of AutoVC-WavRNN

Audio Demo

The audio demo for AUTOVC-WavRNN can be found in results.

Data Preprocess

To get the audio:

1.Load and rescale the wav with the max absolute value.

2.Normalize the volume of wavs.(target dBFS is -30)

3.Skip utterances that are too short.(less than 1.5s)

To get the mel-spectrogram:

4.Preemphasis.(filter coefficient is 0.97)

5.STFT.(n_fft=1024, hop_size=256,win_size=1024)

6.Build mel filter bank.

7.Inner product the result of STFT and mel filter bank then get 80 channel mel-spectrogram.

8.Transform amplitude to dB.(ref_level_db=16)

9.Normalize mel-spectrogram to [0,1].

try: python synthesizer_preprocess_audio.py /data to preprocess audio

try: python synthesizer_preprocess_embeds.py /data/SV2TTS_autovc-ttsdb/synthesizer_vad -e pretrained.pt to preprocess speaker embedding

Pretrained model

speaker encoder pretrained model: https://drive.google.com/file/d/1n1sPXvT34yXFLT47QZA6FIRGrwMeSsZc/view.

the autoVC-WavRNN model which trained by mixed VCTK dataset and VCC2020 dataset is in the folder of /model, please choose all rar files and decompression.

AutoVC Training step

the AutoVC training step is included in the autovc_train.py, the model of AutoVC included in the model_vc.py.

try: python autovc_train.py autovc-vcc2020 /data -g

WavRNN Training step

try: python vocoder_train.py my_vocoder /data/ -g

Inference

try:python convert.py

Relevent Repositories

CorentinJ/Real-Time-Voice-Cloning: https://github.com/CorentinJ/Real-Time-Voice-Cloning

auspicious3000/autovc: https://github.com/auspicious3000/autovc

Relevent Paper

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss https://arxiv.org/abs/1905.05879

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
encoder		encoder
model		model
picture		picture
results		results
synthesizer		synthesizer
utils		utils
vocoder		vocoder
LICENSE		LICENSE
README.md		README.md
autovc_train.py		autovc_train.py
convert.py		convert.py
demo_cli.py		demo_cli.py
encoder_preprocess.py		encoder_preprocess.py
encoder_test.py		encoder_test.py
encoder_train.py		encoder_train.py
synthesizer_preprocess_audio.py		synthesizer_preprocess_audio.py
synthesizer_preprocess_embeds.py		synthesizer_preprocess_embeds.py
train.py		train.py
vocoder_preprocess.py		vocoder_preprocess.py
vocoder_train.py		vocoder_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoVC-WavRNN

Audio Demo

Data Preprocess

Pretrained model

AutoVC Training step

WavRNN Training step

Inference

Relevent Repositories

Relevent Paper

About

Releases

Packages

Languages

License

freenowill/AutoVC-WavRNN

Folders and files

Latest commit

History

Repository files navigation

AutoVC-WavRNN

Audio Demo

Data Preprocess

Pretrained model

AutoVC Training step

WavRNN Training step

Inference

Relevent Repositories

Relevent Paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages