Articulation GAN: Unsupervised Modeling of Articulatory Learning

A PyTorch Implementation for Articulation GAN

Setup

git clone https://github.com/gbegus/articulationGAN.git
cd articulationGAN
pip install -r requirements.txt

Here is a link to a folder containing the weights of several ema2wav models.

Training

cd articulationGAN
python train.py --datadir data_dir/ --logdir log_dir/ --emadir articulatory_weights/ --ciw

Here is a list of the possible command line options for training:

Argument	Description
datadir	Path to a folder containing .wav files for training the model
logdir	Path to the folder where checkpoints and training logs will be stored
emadir	Path to a folder containing the weights of the ema2wav model
slice_len	Slice length of training samples. Shorter samples will be zero-padded and longer samples will be cropped to the specified length. The provided ema2wav models only support the default slice_len of 20480.
kernel_len	Kernel length of the ArticulationGAN generator. Must be an odd integer; the suggested range is from 3 to 25
num_channels	Possible values: 12 or 13 The number of EMA channels that the model will generate. The provided folder contains ema2wav models supporting 12 and 13 channels, which will be automatically loaded based on the value of this argument.
log_audio	If used, this flag will allow the trainer to log sample audio files and EMA plots periodically. Otherwise, only the losses will be saved in the training log. This may increase the filesize for longer runs.
num_categ	The number of categories used for Q-network training. This should be equivalent to the number of classes in the training dataset.
ciw or fiw	Mutually exclusive arguments that determine whether categorical (ciw) or featural (fiw) z-vectors will be used in the generator. One of the two is required to enable learning using the Q network. ciw is generally recommended for most training runs. More information can be found in this paper.
save_int	Save interval in epochs
batch_size	Batch size
cont	Provide the epoch number to resume training from a specific checkpoint, or set to "last" to continue from the last available checkpoint.

Citation

  author={Beguš, Gašper and Zhou, Alan and Wu, Peter and Anumanchipalli, Gopala K.},
  booktitle={ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, 
  title={Articulation GAN: Unsupervised Modeling of Articulatory Learning}, 
  year={2023},
  volume={},
  number={},
  pages={1-5},
  doi={10.1109/ICASSP49357.2023.10096800}}

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
articulatory		articulatory
README.md		README.md
infowavegan.py		infowavegan.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Articulation GAN: Unsupervised Modeling of Articulatory Learning

Setup

Training

Citation

About

Releases

Packages

Contributors 3

Languages

gbegus/articulationGAN

Folders and files

Latest commit

History

Repository files navigation

Articulation GAN: Unsupervised Modeling of Articulatory Learning

Setup

Training

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages