GitHub - msaddler/auditory-model-denoising

Speech Denoising with Auditory Models (arXiv, audio examples)

This is a TensorFlow implementation of our Speech Denoising with Auditory Models.

Citation

If you use our code for research, please cite our paper: Mark R. Saddler*, Andrew Francl*, Jenelle Feather, Kaizhi Qian, Yang Zhang, Josh H. McDermott (2021). Speech Denoising with Auditory Models. Proc. Interspeech 2021, 2681-2685. arXiv:2011.10706.

License

The source code is published under the MIT license. See LICENSE for details. In general, you can use the code for any purpose with proper attribution. If you do something interesting with the code, we'll be happy to know. Feel free to contact us.

Requirements

In order to speed setup and aid reproducibility we provide a Singularity container. This container holds all the libraries and dependencies needed to run the code and allows you to work in the same environment as was originally used. Please see the Singularity Documentation for more details. Download Singularity image: tensorflow-1.13.0-denoising.simg.

Trained Models

We provide model checkpoints for all of our trained audio transforms and deep feature recognition networks. Users must download the audio transform checkpoints to evaluate our denoising algorithms on their own audio. Both sets of checkpoints must be downloaded to run our DEMO Jupyter notebook. Download the entire auditory-model-denoising/models directory here:

Recognition network checkpoints: auditory-model-denoising/models/recognition_networks
Auditory transform checkpoints: auditory-model-denoising/models/audio_transforms

Quick Start

We provide a Jupyter notebook that (1) demos how to run our trained denoising models and (2) provides examples of how to compute the auditory model losses used to train the models. A second notebook demos how to train a new audio transform using the auditory model losses.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
Wave-U-Net		Wave-U-Net
audio		audio
data		data
models		models
.gitignore		.gitignore
DEMO.ipynb		DEMO.ipynb
DEMO_train_audio_transform.ipynb		DEMO_train_audio_transform.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
util_audio_preprocess.py		util_audio_preprocess.py
util_audio_transform.py		util_audio_transform.py
util_auditory_model_loss.py		util_auditory_model_loss.py
util_cochlear_model.py		util_cochlear_model.py
util_recognition_network.py		util_recognition_network.py

License

msaddler/auditory-model-denoising

Folders and files

Latest commit

History

Repository files navigation

Speech Denoising with Auditory Models (arXiv, audio examples)

Citation

License

Requirements

Trained Models

Quick Start

About

Resources

License

Stars

Watchers

Forks

Languages