MelSpectrogram: Use given n_fft to initialize fb immediately #246

jamarshon · 2019-08-19T19:53:37Z

In the issue #245, it was determined that for MelSpectrogram the transform already knows how many bins the stft will have and thus can initialize the fb matrix without needing to infer the dimensions of its first input.
The benefit of initializing the fb matrix immediately as opposed to lazy load when given a first input is that loading and saving from state dict (load_state_dict) will not throw an error as the tensor sizes match

vincentqb · 2019-08-19T21:36:10Z

torchaudio/transforms.py

                                       pad=self.pad, window_fn=window_fn, power=2,
                                       normalized=False, wkwargs=wkwargs)
-        self.mel_scale = MelScale(self.n_mels, self.sample_rate, self.f_min, self.f_max)
+        self.mel_scale = MelScale(self.n_mels, self.sample_rate, self.f_min, self.f_max, self.n_fft // 2 + 1)


Is there a way to read off the size of the spectrogram instead of recomputing it?

Not unless the spectrogram computes a specgram and we look at the tensor's dimension

If that constant n_fft // 2 + 1 is used in a few places within Spectrogram, we could attach it to Spectrogram. If that's our inferred default dimension, we could also just attach a default_dimension to Spectrogram, in case that ever changes, but that would add clutter to the Spectrogram interface.

Alright, I'm ok with having the value recomputed here. If there's ever a change in this, the test will fail :)

more

299c39c

jamarshon requested a review from vincentqb August 19, 2019 20:16

vincentqb reviewed Aug 19, 2019

View reviewed changes

vincentqb approved these changes Aug 19, 2019

View reviewed changes

jamarshon merged commit 42a705d into pytorch:master Aug 20, 2019

jamarshon deleted the lazy branch August 20, 2019 20:03

jamarshon mentioned this pull request Aug 21, 2019

Lazy initialization of MelScale.fb throws when loading module #245

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MelSpectrogram: Use given n_fft to initialize fb immediately #246

MelSpectrogram: Use given n_fft to initialize fb immediately #246

Uh oh!

jamarshon commented Aug 19, 2019 •

edited

Loading

Uh oh!

vincentqb Aug 19, 2019

Uh oh!

jamarshon Aug 19, 2019

Uh oh!

vincentqb Aug 19, 2019 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MelSpectrogram: Use given n_fft to initialize fb immediately #246

MelSpectrogram: Use given n_fft to initialize fb immediately #246

Uh oh!

Conversation

jamarshon commented Aug 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vincentqb Aug 19, 2019

Choose a reason for hiding this comment

Uh oh!

jamarshon Aug 19, 2019

Choose a reason for hiding this comment

Uh oh!

vincentqb Aug 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jamarshon commented Aug 19, 2019 •

edited

Loading

vincentqb Aug 19, 2019 •

edited

Loading