Time domain RMS #407

carlthome · 2016-09-06T13:00:54Z

If a precomputed spectrogram is not given, calculate rmse() in the time domain (to avoid doing a costly STFT).

(side note: could rmse() be renamed to rms()? RMSE implies root-mean-square error and it's confusing.)

This change is

stefan-balke · 2016-09-06T13:41:52Z

You may consider a git rebase first. :)

carlthome · 2016-09-06T14:09:48Z

@stefan-balke Haha, yeah... 🐹 I'll squash soon. Sorry for the notifications.

bmcfee · 2016-09-06T15:08:39Z

calculate rmse() in the time domain

Do these give the same (numerical) results?

RMSE implies root-mean-square error and it's confusing

RMSE also means "root mean square energy".

carlthome · 2016-09-06T15:27:43Z

They give similar results. Example: https://gist.github.com/carlthome/048942b1369c374508f56b0d567abe2f

bmcfee · 2016-09-06T15:37:42Z

They give similar results.

Of course, but not numerically equivalent: they'll differ due to windowing. (They'd be the same if stft is called with window=np.ones).

This means that we're breaking backwards compatibility here, and more generally, breaking the usual librosa convention for the feature module that equivalent results are produced using time-domain or spectrogram input (for appropriately parameterized spectrograms).

I agree that using an stft is overkill for rmse calculation, so having an option for calculation in the time domain makes sense.

So, the question for us: how much do we care about preserving backwards compatibility and time/frequency equivalence for rmse? If we do adopt the proposed modification, it will need to be documented.

carlthome · 2016-09-06T15:44:30Z

The default could be the time-frequency method even when passed an audio series, and a bool would have to be set for the time domain method to be used. Then backwards compatibility is not affected.

bmcfee · 2016-09-08T13:51:40Z

The default could be the time-frequency method even when passed an audio series, and a bool would have to be set for the time domain method to be used. Then backwards compatibility is not affected.

Sure. We could also just tell people to use their own TF representation if that's what they want, and make no promises about it being equivalent to the un-windowed time domain implementation. There's a kind of precedent for this with melspectrogram, which uses squared energy rather than energy, so you only get equivalent results if called with melspectrogram(D=np.abs(S**2), ...).

I guess this is all to say: I'm okay with changing the default behavior to break backwards compatibility, and keeping the api simple. If we document it properly, there shouldn't be any major headaches.

carlthome · 2016-09-13T17:44:46Z

@bmcfee, how do you like this version with mode? It's backwards compatible.

bmcfee · 2016-09-13T18:46:51Z

@carlthome I think I prefer breaking backwards compatibility in this case. The additional parameter is redundant with whether the user supplied S or not; I think we can assume that a user would not supply S and expect a time-domain calculation.

As long as we include the example with window=np.ones in the docstring to demonstrate how to recover the old behavior, that's good enough.

carlthome · 2016-09-20T12:29:45Z

Hmm, even with a constant window function of 1.0, the RMS output from magnitudes or raw samples are not precisely equal. Any insight, @bmcfee?

bmcfee · 2016-09-20T12:40:21Z

Hmm, even with a constant window function of 1.0, the RMS output from magnitudes or raw samples are not precisely equal. Any insight, @bmcfee?

Are they off by sqrt(2pi) from FFT normalization?

carlthome · 2016-09-20T15:37:38Z

It's a problem with the framing. Given x, stft(..., n_fft=x) and util.frame(..., frame_length=x) should segment the signal exactly the same, no? However, the resulting RMS envelopes are slightly shifted in time:

carlthome · 2016-09-20T15:40:23Z

And for reference, I'm trimming the input signal to be divisible by the frame length before doing anything else, e.g.:

frame_length = 2048
y = util.fix_length(y, y.size - y.size % frame_length)
assert y.size % frame_length == 0

bmcfee · 2016-09-20T15:41:18Z

It's a problem with the framing. Given x, stft(..., n_fft=x) and util.frame(..., frame_length=x) should segment the signal exactly the same, no?

Aha! They won't be the same. stft first pads the signal and then center-aligns the frames. If you frame the signal directly, you lose padding and centering. I'm okay with the difference here.

If you call stft(..., center=False) then it ought to line up.

carlthome · 2016-09-20T16:02:45Z

@bmcfee, cool, stuff looks similar enough: https://gist.github.com/carlthome/048942b1369c374508f56b0d567abe2f

bmcfee · 2016-09-20T20:00:55Z

Looks great! One more nitpick in the tests (style), but otherwise I think this can merge. Thanks for doing this!

Reviewed 1 of 2 files at r5.
Review status: 1 of 2 files reviewed at latest revision, 1 unresolved discussion.

tests/test_features.py, line 342 at r5 (raw file):

        rms1, rms2 = map(lambda x: librosa.util.normalize(x, axis=1), [rms1, 
                                                                       rms2])

Please simplify this test to not use maps or lambdas. In this case, since the test signals are known to be strictly positive, I see no harm in just doing rms1 /= rms1.max() and likewise for 2.

Comments from Reviewable

carlthome · 2016-09-21T10:45:58Z

Review status: 1 of 2 files reviewed at latest revision, 1 unresolved discussion.

tests/test_features.py, line 342 at r5 (raw file):

Previously, bmcfee (Brian McFee) wrote…

Please simplify this test to not use maps or lambdas. In this case, since the test signals are known to be strictly positive, I see no harm in just doing rms1 /= rms1.max() and likewise for 2.

Done.

Comments from Reviewable

bmcfee · 2016-09-21T12:43:00Z

Reviewed 1 of 1 files at r6.
Review status: all files reviewed at latest revision, all discussions resolved.

Comments from Reviewable

bmcfee · 2016-09-21T12:44:34Z

Merged. Thanks again!

carlthome added 2 commits September 6, 2016 16:31

Calculate framewise RMS in time domain if spectrogram not precomputed

86f5c11

Prefer time domain

1131ff9

bmcfee added enhancement Does this improve existing functionality? discussion Open-ended discussion for developers and users API change Does this change the behavior of existing API? labels Sep 6, 2016

bmcfee added this to the 0.5 milestone Sep 6, 2016

bmcfee self-assigned this Sep 6, 2016

carlthome added 4 commits September 9, 2016 14:13

Added RMS energy mode setting

e4c3136

Prefer time-frequency domain RMS energy

7b41be8

Removed unused import

41e6fe6

Formatting

9cef76e

carlthome added 2 commits September 14, 2016 10:56

Remove rmse mode

00095bb

Test rmse consistency

d1075c8

Test for approximately equal RMS outputs instead

5457213

carlthome added 2 commits September 20, 2016 18:02

Add center=False to rmse docstring

b20ae16

Update rms test

03fa4db

No functional style

5122a0e

bmcfee merged commit 9ca2da3 into librosa:master Sep 21, 2016

carlthome deleted the rms branch September 21, 2016 12:54

bmcfee mentioned this pull request Nov 2, 2016

fixed a bug in feature.rmse time-domain #429

Merged

bmcfee mentioned this pull request Mar 14, 2017

different length of feature between rmse and others #528

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time domain RMS #407

Time domain RMS #407

carlthome commented Sep 6, 2016 •

edited by bmcfee

Loading

stefan-balke commented Sep 6, 2016 •

edited

Loading

carlthome commented Sep 6, 2016 •

edited

Loading

bmcfee commented Sep 6, 2016

carlthome commented Sep 6, 2016

bmcfee commented Sep 6, 2016

carlthome commented Sep 6, 2016 •

edited

Loading

bmcfee commented Sep 8, 2016

carlthome commented Sep 13, 2016

bmcfee commented Sep 13, 2016

carlthome commented Sep 20, 2016 •

edited

Loading

bmcfee commented Sep 20, 2016

carlthome commented Sep 20, 2016

carlthome commented Sep 20, 2016

bmcfee commented Sep 20, 2016

carlthome commented Sep 20, 2016

bmcfee commented Sep 20, 2016

carlthome commented Sep 21, 2016

bmcfee commented Sep 21, 2016

bmcfee commented Sep 21, 2016

Time domain RMS #407

Time domain RMS #407

Conversation

carlthome commented Sep 6, 2016 • edited by bmcfee Loading

stefan-balke commented Sep 6, 2016 • edited Loading

carlthome commented Sep 6, 2016 • edited Loading

bmcfee commented Sep 6, 2016

carlthome commented Sep 6, 2016

bmcfee commented Sep 6, 2016

carlthome commented Sep 6, 2016 • edited Loading

bmcfee commented Sep 8, 2016

carlthome commented Sep 13, 2016

bmcfee commented Sep 13, 2016

carlthome commented Sep 20, 2016 • edited Loading

bmcfee commented Sep 20, 2016

carlthome commented Sep 20, 2016

carlthome commented Sep 20, 2016

bmcfee commented Sep 20, 2016

carlthome commented Sep 20, 2016

bmcfee commented Sep 20, 2016

carlthome commented Sep 21, 2016

bmcfee commented Sep 21, 2016

bmcfee commented Sep 21, 2016

carlthome commented Sep 6, 2016 •

edited by bmcfee

Loading

stefan-balke commented Sep 6, 2016 •

edited

Loading

carlthome commented Sep 6, 2016 •

edited

Loading

carlthome commented Sep 6, 2016 •

edited

Loading

carlthome commented Sep 20, 2016 •

edited

Loading