ResizeSignal Always takes Random Chunk #96

kevinbird15 · 2021-05-17T04:52:14Z

I was looking at using ResizeSignal to cut an input and output down to the same chunk of audio and realized that's not currently an easy thing to do. I was thinking that adding an option that would allow for crop_start to be set during initialization would be helpful. I haven't written anything yet, but wanted to see if that's something that would be helpful for anybody else.

Code in question:

class ResizeSignal(Transform):
    """Crops signal to be length specified in ms by duration, padding if needed"""

    def __init__(self, duration, pad_mode=AudioPadType.Zeros):
        self.duration = duration
        self.pad_mode = pad_mode
        if pad_mode not in [
            AudioPadType.Zeros,
            AudioPadType.Zeros_After,
            AudioPadType.Repeat,
        ]:
            raise ValueError(
                f"""pad_mode {pad_mode} not currently supported,
                only AudioPadType.Zeros, AudioPadType.Zeros_After,
                or AudioPadType.Repeat"""
            )

    def encodes(self, ai: AudioTensor) -> AudioTensor:
        sig = ai.data
        orig_samples = ai.nsamples
        crop_samples = int((self.duration / 1000) * ai.sr)
        if orig_samples == crop_samples:
            return ai
        elif orig_samples < crop_samples:
            ai.data = _tfm_pad_signal(sig, crop_samples, pad_mode=self.pad_mode)
        else:
            crop_start = random.randint(0, int(orig_samples - crop_samples)) ########################Always Random at the moment
            ai.data = sig[:, crop_start : crop_start + crop_samples]
        return ai

The other thing I though is that you could potentially set a random.seed but I wasn't able to make that solution to work in my code.

The text was updated successfully, but these errors were encountered:

scart97 · 2021-05-22T21:43:03Z

It makes sense, and would be a good addition to the library. Also I just realized that this code is not using pytorch to create random numbers, that is a major silent bug:
https://tanelp.github.io/posts/a-bug-that-plagues-thousands-of-open-source-ml-projects/

kevinbird15 · 2021-05-23T00:57:48Z

That bug is super interesting, thanks for sharing!

See this comment for the issue: fastaudio#96 (comment)

See this comment for the issue: #96 (comment)

scart97 added the enhancement New feature or request label May 22, 2021

filipmu added a commit to filipmu/fastaudio that referenced this issue Jun 18, 2021

Use pytorch randint for ResizeSignalpt

99bbc47

See this comment for the issue: fastaudio#96 (comment)

This was referenced Jun 18, 2021

Use pytorch randint for ResizeSignalpt filipmu/fastaudio#1

Merged

Use pytorch randint for ResizeSignal #103

Merged

mogwai pushed a commit that referenced this issue Sep 29, 2021

Use pytorch randint for ResizeSignalpt (#103)

3d6c0a0

See this comment for the issue: #96 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ResizeSignal Always takes Random Chunk #96

ResizeSignal Always takes Random Chunk #96

kevinbird15 commented May 17, 2021

scart97 commented May 22, 2021

kevinbird15 commented May 23, 2021

ResizeSignal Always takes Random Chunk #96

ResizeSignal Always takes Random Chunk #96

Comments

kevinbird15 commented May 17, 2021

scart97 commented May 22, 2021

kevinbird15 commented May 23, 2021