InpaintingNMF

This is the accompanying repository for the article Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization authored by Ondřej Mokrý, Paul Magron, Thomas Oberlin and Cédric Févotte, published in Elsevier Signal Processing.

Audio inpainting, i.e., the task of restoring missing or occluded audio signal samples, usually relies on sparse representations or autoregressive modeling. In this paper, we propose to structure the spectrogram with nonnegative matrix factorization (NMF) in a probabilistic framework. First, we treat the missing samples as latent variables, and derive two expectation–maximization algorithms for estimating the parameters of the model, depending on whether we formulate the problem in the time- or time-frequency domain. Then, we treat the missing samples as parameters, and we address this novel problem by deriving an alternating minimization scheme. We assess the potential of these algorithms for the task of restoring short- to middle-length gaps in music signals. Experiments reveal great convergence properties of the proposed methods, as well as competitive performance when compared to state-of-the-art audio inpainting techniques.

The preprint is available at arXiv and HAL, the paper is OA and published at ScienceDirect.

figures – Exported plots as generated by the codes in results.
references – Implementation of several candidates for the state of the art in audio inpainting, with the help of InpaintingRevisited and Dictionary learning for sparse audio inpainting.
results – Data and scripts to plot them.
signals – Signals used for testing.
utils – All the functions needed to run the main files.

Psychoacoustically motivated evaluation

Note that the psychoacoustically motivated metrics (PEMO-Q, PEAQ) are not computed inside the main testing file inpainting_comparison.m. This evaluation could be performed by

uncommenting the lines related to signal saving in inpainting_comparison.m and running the whole experiment,
running the file inpainting_comparison_add_odgs.m.

The PEAQ package is included in the utils subfolder (acquired from TSP Lab of McGill University). However, the PEMO-Q software is no longer publicly available, thus it is not shared as part of this repository. For that reason, and because the processing is very time- and spacedemanding, the provided .mat files in the results subfolder include all the data precomputed.

Further dependencies

The functions related to short-time Fourier transform rely on the Large Time-Frequency Analysis Toolbox.
The dictionary learning method used as reference uses the CVX toolbox.
These codes were written in Matlab R2021a. Some functions may not be available in older versions of Matlab, e.g. tiledlayout, used commonly for plotting, was introduced in version R2019b.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InpaintingNMF

Contents

Psychoacoustically motivated evaluation

Further dependencies

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
figures		figures
references		references
results		results
signals		signals
utils		utils
README.md		README.md
inpainting_comparison.m		inpainting_comparison.m
inpainting_comparison_add_odgs.m		inpainting_comparison_add_odgs.m
inpainting_demo.m		inpainting_demo.m

ondrejmokry/InpaintingNMF

Folders and files

Latest commit

History

Repository files navigation

InpaintingNMF

Contents

Psychoacoustically motivated evaluation

Further dependencies

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages