add slaney normalization #589

vincentqb · 2020-04-26T21:40:37Z

Fixes #287

cpuhrsch · 2020-04-27T17:02:22Z

torchaudio/functional.py

@@ -430,6 +431,8 @@ def create_fb_matrix(
        f_max (float): Maximum frequency (Hz)
        n_mels (int): Number of mel filterbanks
        sample_rate (int): Sample rate of the audio waveform
+        norm (Optional[str]): If 'slaney', divide the triangular mel weights by the width of the mel band


Do we expected further normalization schemes? Does it make sense to split these normalizations out into its own layer? Are they useful in other contexts (maybe for volume normalization and such)?

Do we expected further normalization schemes?

Yes, we could, see librosa/librosa#1050. Librosa itself is in process of adding other normalization as mentioned in this pull request.

Does it make sense to split these normalizations out into its own layer?

Not according to this comment.

Are they useful in other contexts (maybe for volume normalization and such)?

The normalization is done against f_pts which is computed within create_fb_matrix. I'm not aware of other use case.

cpuhrsch · 2020-05-05T18:37:07Z

test/test_torchscript_consistency.py

@@ -99,7 +99,8 @@ def func(_):
            f_max = 20.0
            n_mels = 10
            sample_rate = 16000
-            return F.create_fb_matrix(n_stft, f_min, f_max, n_mels, sample_rate)
+            norm = None


If you're exercising torchscript I'd pass the less trivial type which is a string instead of None.

You mean: default to empty string? We don't use that elsewhere, but sounds good to me.

torchaudio/functional.py

codecov · 2020-05-05T19:29:06Z

Codecov Report

Merging #589 into master will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #589      +/-   ##
==========================================
+ Coverage   88.99%   89.01%   +0.01%     
==========================================
  Files          21       21              
  Lines        2254     2257       +3     
==========================================
+ Hits         2006     2009       +3     
  Misses        248      248

Impacted Files	Coverage Δ
torchaudio/functional.py	`95.53% <100.00%> (+0.01%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b499840...7270e6e. Read the comment docs.

vincentqb · 2020-05-14T18:29:03Z

@mthrok -- do you have any feedback?

mthrok

Sorry for the late reply. Looks good to me. One nit.

mthrok · 2020-05-14T20:10:11Z

torchaudio/functional.py

@@ -424,7 +424,8 @@ def create_fb_matrix(
        f_min: float,
        f_max: float,
        n_mels: int,
-        sample_rate: int
+        sample_rate: int,
+        norm: str = "",


As a public API signature, I think Optional[str] looks cleaner.

This was made due to comment. I'll leave it as it is for now. We can always extend the str to Optional[str] without BC breaking later :)

It looks to me that, that comment was meant for the type of variable to pass when running Torchscript test, not about the function signature.

If we allow None in the signature, then the code should work with/without jit when passing None. It wasn't though. Is that what you meant?

If we allow None in the signature, then the code should work with/without jit when passing None.

Yes, it works.

from typing import Optional import torch from torch import Tensor def bar(foo: Optional[str]=None) -> Tensor: if foo is None: return torch.zeros(1, 2) if foo == "a": return torch.ones(1, 1) return torch.empty(1, 1) ts_bar = torch.jit.script(bar) for v in [None, "a", "b"]: print(v) print(bar(v)) print(ts_bar(v))

produces

None tensor([[0., 0.]]) tensor([[0., 0.]]) a tensor([[1.]]) tensor([[1.]]) b tensor([[-2.8910e+12]]) tensor([[0.]])

also dcshift uses Optional[float] and it works fine. for both None and float input for torchscript.
#558

It's just when type is optional, it firstly needs to compare against None using if var is None or if var is not None.

https://pytorch.org/docs/master/jit_language_reference.html#optional-type-refinement

TorchScript will refine the type of a variable of type Optional[T] when a comparison to None is made inside the conditional of an if-statement or checked in an assert. The compiler can reason about multiple None checks that are combined with and, or, and not. Refinement will also occur for else blocks of if-statements that are not explicitly written.

Alrighty, #641 :)

* add slaney normalization. * add torchscript. * convert to string for torchscript compatibility. * flake8. * use string as default.

Update seq2seq_translation_tutorial.py

Added instructive comments

cpuhrsch reviewed Apr 27, 2020

View reviewed changes

vincentqb force-pushed the slaney_normalization branch from 88913b2 to 9a1994f Compare April 28, 2020 21:16

vincentqb mentioned this pull request Apr 28, 2020

enable mel_scale option #593

Merged

vincentqb added 4 commits May 4, 2020 18:34

add slaney normalization.

6349378

add torchscript.

cdb066a

convert to string for torchscript compatibility.

86bfeaf

flake8.

4150097

vincentqb force-pushed the slaney_normalization branch from 924c37a to 4150097 Compare May 4, 2020 22:35

vincentqb marked this pull request as ready for review May 4, 2020 22:36

vincentqb requested review from cpuhrsch and mthrok May 5, 2020 18:25

cpuhrsch reviewed May 5, 2020

View reviewed changes

torchaudio/functional.py Outdated Show resolved Hide resolved

use string as default.

7270e6e

vincentqb mentioned this pull request May 13, 2020

amplitude normalization in create_fb_matrix #287

Closed

mthrok approved these changes May 14, 2020

View reviewed changes

vincentqb merged commit 995b75f into pytorch:master May 14, 2020

vincentqb mentioned this pull request May 14, 2020

Make parameter optional string #641

Merged

bhargavkathivarapu pushed a commit to bhargavkathivarapu/audio that referenced this pull request May 19, 2020

add slaney normalization (pytorch#589)

4add6b1

* add slaney normalization. * add torchscript. * convert to string for torchscript compatibility. * flake8. * use string as default.

mthrok pushed a commit to mthrok/audio that referenced this pull request Feb 26, 2021

Merge pull request pytorch#589 from pytorch/jspisak-patch-1

671a338

Update seq2seq_translation_tutorial.py

mpc001 pushed a commit to mpc001/audio that referenced this pull request Aug 4, 2023

Added instructive comments (pytorch#589)

013f957

Added instructive comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add slaney normalization #589

add slaney normalization #589

vincentqb commented Apr 26, 2020

cpuhrsch Apr 27, 2020

vincentqb Apr 27, 2020

cpuhrsch May 5, 2020

vincentqb May 5, 2020

vincentqb May 5, 2020

codecov bot commented May 5, 2020 •

edited

vincentqb commented May 14, 2020

mthrok left a comment

mthrok May 14, 2020

vincentqb May 14, 2020

mthrok May 14, 2020

vincentqb May 14, 2020

mthrok May 14, 2020 •

edited

mthrok May 14, 2020

mthrok May 14, 2020 •

edited

vincentqb May 14, 2020

add slaney normalization #589

add slaney normalization #589

Conversation

vincentqb commented Apr 26, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented May 5, 2020 • edited

Codecov Report

vincentqb commented May 14, 2020

mthrok left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mthrok May 14, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mthrok May 14, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented May 5, 2020 •

edited

mthrok May 14, 2020 •

edited

mthrok May 14, 2020 •

edited