This repository features an adaptation of the AutoVC framework presented in Zero-Shot Voice Style Transfer with Only Autoencoder Loss. In this repository, we demonstrate the AutoVC framework's ability to disentangle singing attributes - namely singing techniques and singing content. There are multiple aspects to consider when doing this that we hope to highlight in the forthcoming publication
Audio demos will be uploaded in time.
TBC
The singing technique classifier and the wavenet vocoder must be downloaded separately.
AUTO-STC | WaveNet Vocoder |
---|---|
link | link |
Run the make_spect.py
, adapting the paths as necessary to point towards your audio files directory
Run main.py
to start training a model
--FURTHER INSTRUCTIONS WILL FOLLOW UPON COMPLETING THIS REPOSITORY IN FULL--