Is there any built-in data augmentation function? #49

LeeYongHyeok · 2020-09-24T04:42:09Z

Hi, I'm so impressed by your wonderful project.
But, I want to know how can i augment the training data (ex. SNR control, time-stretch, speed perturbation, volume or pitch control, specaugment ...)
In the torchaudio, there are built-in parameter for the data transformations. https://pytorch.org/tutorials/beginner/audio_preprocessing_tutorial.html#transformations
Is there any built-in function or parameter for the data augmentation?

KinWaiCheuk · 2020-09-28T03:40:46Z

Hi YongHyeok, thanks for you question. Unfortunately, there is no build in data augmentation in nnAudio at the moment. If you have the augmentation function, however, you can apply it to the PyTorch tensor returned by nnAudio.

LeeYongHyeok · 2020-09-29T07:47:24Z

@KinWaiCheuk
Thanks for your detailed answer. Do you have the to-do plan about these function?

KinWaiCheuk · 2020-09-29T08:04:52Z

Yes, I do wish to expand nnAudio by adding more features to it (data augmentation is one of the items on my mind). But we are lack of manpower at the moment (me and my current collaborator are occupied with improving the GriffinLim reconstruction accuracy).

If you know anyone who are interested in contributing, please let me know, it would be a great help for us!

hbredin · 2020-10-21T06:49:22Z

Regarding audio augmentation, you might want to join forces with https://github.com/asteroid-team/torch-audiomentations.

KinWaiCheuk · 2020-10-21T07:50:49Z

Regarding audio augmentation, you might want to join forces with https://github.com/asteroid-team/torch-audiomentations.

Wow, it is a really awesome project! If it is the case, I think a build-in data augmentation is not necessary for nnAudio? Since you can always keep one line of code as audio-domain augmentation, another line of code for nnAudio, e.g.

import torch
from torch_audiomentations import Gain
from nnAudio import Spectrogram
class MyModel():
    def __init___(args):
        self.augmentation_layer = Gain(min_gain_in_db=-15.0, max_gain_in_db=5.0, p=0.5)
        self.nnaudio_layer = Spectrogram.STFT()
        self.neuralnet = SomeNeuralNet()

    def forward(x):
        x = self.augmentation_layer(x)
        x = self.nnaudio_layer(x)
        x = self.neuralnet(x)
        return x

Is there any augmentation that cannot be done in this way?

hbredin · 2020-10-21T10:36:21Z

I am sure that someone will eventualy come up with a new augmentation technique that must be done in a different way but I don't know of any for now.

fdroessler · 2021-01-10T19:04:41Z

There is also Kornia, I guess that could be used as long as they provide the necessary Augmentations:
https://kornia.readthedocs.io/en/latest/tutorials/data_augmentation.html
and: https://arxiv.org/pdf/2011.09832.pdf

KinWaiCheuk added the enhancement New feature or request label Oct 20, 2020

LeeYongHyeok closed this as completed Sep 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any built-in data augmentation function? #49

Is there any built-in data augmentation function? #49

LeeYongHyeok commented Sep 24, 2020

KinWaiCheuk commented Sep 28, 2020

LeeYongHyeok commented Sep 29, 2020

KinWaiCheuk commented Sep 29, 2020

hbredin commented Oct 21, 2020

KinWaiCheuk commented Oct 21, 2020

hbredin commented Oct 21, 2020

fdroessler commented Jan 10, 2021

Is there any built-in data augmentation function? #49

Is there any built-in data augmentation function? #49

Comments

LeeYongHyeok commented Sep 24, 2020

KinWaiCheuk commented Sep 28, 2020

LeeYongHyeok commented Sep 29, 2020

KinWaiCheuk commented Sep 29, 2020

hbredin commented Oct 21, 2020

KinWaiCheuk commented Oct 21, 2020

hbredin commented Oct 21, 2020

fdroessler commented Jan 10, 2021