Skip to content

Voice Conversion using Feature Specific Loss Function Based Self-Attentive Generative Adversarial Network

License

Notifications You must be signed in to change notification settings

BlueBlaze6335/FLSGAN-VC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VOICE CONVERSION USING FEATURE SPECIFIC LOSS FUNCTION BASED SELF-ATTENTIVE GENERATIVE ADVERSARIAL NETWORK FLSGAN-VC

Authors

Sandipan DharPadmanabha BanerjeeNanda Dulal JanaSwagatam Das

Dataset links and speech samples are provided below:

Datasets

Generated Samples

CMU Arctic

VCC2018

More Generated samples of CMU Arctic and VCC2018 Dataset

Samples of CMU Arctic and VCC 2018

Code

Prerequisites

  • Linux, macOS or Windows
  • Python 3
  • CPU or NVIDIA GPU + CUDA CuDNN

Getting Started

Downloading Datasets

Download and save in "Data" folder

Installation

  • Clone this repo:
git clone https://github.com/BlueBlaze6335/FLSGAN-VC.git
cd FLSGAN-VC
  • Install all the dependencies by
pip install -r requirements.txt
ipython==8.8.0
librosa==0.9.1
matplotlib==3.4.2
numpy==1.19.5
SoundFile==0.10.3.post1
tensorflow==2.6.0
tensorflow_addons==0.17.0
torch==1.12.0
torchaudio==0.12.0
tqdm==4.42.1

Run

  • Training
python train.py
  • Testing
python inference.py

Acknowledgments

Our code is heavily inspired by MelGAN VC.

About

Voice Conversion using Feature Specific Loss Function Based Self-Attentive Generative Adversarial Network

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages