Implementation of Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder.
We modify the variational auto-encoder (VAE) for our voice conversion (VC) task to deal with training from non-parallel corpora. There are 3 modules in the overall architecture:
- feature extraction
- conversion
- speech synthesis
The 1st and the 3rd parts involve the STRAIGHT vocoder, so they are not included in this repository.
*Speaker dependent global variance was applied to the resulting spectra.
python train.py --data_dir [path] --file_filter [regexp]