Speed sampling in speed augmentation is counterintuitive #207

mikolajpabiszczak · 2021-02-12T12:23:56Z

Currently speed augmentation (nlpaug/augmenter/audio/speed.py) behaves counterintuitively to what user may expect. Namely, the method for get_random_factor as defined there:

    def get_random_factor(self):
        speeds = [round(i, 1) for i in np.arange(self.factor[0], self.factor[1], 0.1)]
        speeds = [s for s in speeds if s != 1.0]
        return speeds[np.random.randint(len(speeds))]

results in very small number of possible speeds used, e.g. for factor=(0.9, 1.1) there will be 3 values of speeds used (0.9, 1.0, 1.1). In particular, even lower amount of audio (than user would expect) will be augmented as there 1:3 probability that we sample the value 1.0!

User expects uniform sampling from the interval given by factor, so the get_random_factor should look like:

    def get_random_factor(self):
        return  (self.factor[1] - self.factor[0]) * np.random.random_sample + self.factor[0]

The text was updated successfully, but these errors were encountered:

makcedward added the enhancement New feature or request label Mar 7, 2021

makcedward added a commit that referenced this issue Nov 24, 2021

[#207] Solve SpeedAug random factor bug

5677d62

makcedward closed this as completed Dec 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed sampling in speed augmentation is counterintuitive #207

Speed sampling in speed augmentation is counterintuitive #207

mikolajpabiszczak commented Feb 12, 2021

Speed sampling in speed augmentation is counterintuitive #207

Speed sampling in speed augmentation is counterintuitive #207

Comments

mikolajpabiszczak commented Feb 12, 2021