This repository contains implementation of a Wasserstein-GAN for singing voice synthesis in Tensorflow 2.5. The neural networks were trained using publicly available NUS-48E sung and spoken lyrics corpus.
The code is based on an original implementation described in WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN, however with improvements regarding the training process and code quality. The implementation of WGAN can be reused for other similar tasks.
This project was completed as a Master's thesis in Computer Science, Adam Mickiewicz University, Poznań, Poland.