FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures

This repository provides official PyTorch implementation of multichannel variational autoencoder (MVAE) and its fast algorithms proposed in the following papers. The MVAE algorithm in this repo is the same as that previously provided at https://github.com/lili-0805/MVAE.

We also provide pretrained models for speaker-closed and speaker-open situations trained using VCC and WSJ0 datasets, respectively.

Hirokazu Kameoka, Li Li, Shota Inoue, and Shoji Makino, "Supervised Determined Source Separation with Multichannel Variational Autoencoder," Neural Computation, vol. 31, no. 9, pp. 1891-1914, Sep. 2019.
Li Li, Hirokazu Kameoka, Shota Inoue, and Shoji Makino, "FastMVAE: A fast optimization algorithm for the multichannel variational autoencoder method," IEEE Accesss, vol. 8, pp. 228740-228753, Dec. 2020.
Li Li, Hirokazu Kameoka, and Shoji Makino, "FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures", IEEE TASLP, Oct. 2022.

Dependencies

Code was tested using following packages. A full package list is in requirements.txt.

Python 3.6.6
PyTorch 1.10.2
Scipy 1.5.4
Numpy 1.19.5

Download

Get code

$ git clone https://github.com/lili-0805/mvae-ss.git

Using download script to download training dataset, test dataset, and pretrained models.

The test samples were generated using the VCC dataset. Namely, the test samples are speaker-closed for models trained using the VCC dataset, and speaker-open for models trained using the WSJ0 dataset.

Considering the license of WSJ0 database, we do not provide training dataset of WSJ0. Please download WSJ0 database and prepare trainging dataset described in our paper #2 by yourselves.

$ cd mvae-ss/exe
$ bash download.sh dataset-VCC
$ bash download.sh test-samples
$ bash download.sh models

Usage

Please use stage to choose model training or test, where stage=0 and stage=1 indicate training and test, respectively.

The following command is the default setting for training ChimeraACVAE source model with the VCC dataset and then testing the trained model with FastMVAE2 algorithm on the downloaded test dataset.

$ ./run.sh --stage 0 --stop_stage 1 --algorithm FastMVAE2 --dataset vcc --test_mode trained --test_dataset test_input

More details are available in the run.sh bash file.

License and citations

License: Creative Commons Attribution-NonCommercial-NoDerivs (CC-BY-NC-ND)

If you find this work is useful for your research or project, please cite out papers:

@article{kameoka2019supervised,
  title={Supervised determined source separation with multichannel variational autoencoder},
  author={Kameoka, Hirokazu and Li, Li and Inoue, Shota and Makino, Shoji},
  journal={Neural computation},
  volume={31},
  number={9},
  pages={1891--1914},
  year={2019},
  publisher={MIT Press One Rogers Street, Cambridge, MA 02142-1209, USA journals-info~…}
}
@article{li2020fastmvae,
  title={FastMVAE: A fast optimization algorithm for the multichannel variational autoencoder method},
  author={Li, Li and Kameoka, Hirokazu and Inoue, Shota and Makino, Shoji},
  journal={IEEE Access},
  volume={8},
  pages={228740--228753},
  year={2020},
  publisher={IEEE}
}
@article{li2022fastmvae2,
  title={FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures},
  author={Li, Li and Kameoka, Hirokazu and Makino, Shoji},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year={2022},
  publisher={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
exe		exe
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exe

exe

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures

Dependencies

Download

Usage

License and citations

See also

About

Releases

Packages

Languages

lili-0805/mvae-ss

Folders and files

Latest commit

History

Repository files navigation

FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures

Dependencies

Download

Usage

License and citations

See also

About

Resources

Stars

Watchers

Forks

Languages