Adding test for T.SlidingWindowCmn #1482

pavithranrao · 2021-04-30T20:59:26Z

Autograd tests for Transforms #1414

pavithranrao · 2021-04-30T21:41:27Z

@carolineechen Can you please review this?

carolineechen · 2021-04-30T21:58:14Z

test/torchaudio_unittest/transforms/autograd_test_impl.py

+    @parameterized.expand([
+        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': False, 'norm_vars': False}, ),
+        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': True, 'norm_vars': False}, ),
+        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': False, 'norm_vars': False}, ),


this set of params is a duplicate of the first one -- could you change or remove it?

Thanks for catching that, will ~~change~~ remove it.

Removed the duplicate test case. Thanks!

carolineechen · 2021-04-30T22:57:12Z

test/torchaudio_unittest/transforms/autograd_test_impl.py

@@ -157,7 +157,6 @@ def test_vol(self, gain, gain_type):
    @parameterized.expand([
        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': False, 'norm_vars': False}, ),
        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': True, 'norm_vars': False}, ),
-        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': False, 'norm_vars': True}, ),


@mthrok this set of params causes the test to fail with RuntimeError: Jacobian mismatch for output 0 with respect to input 0. Any idea why?

This means that when norm_vars=True, some operation is not differential to the 2nd degree.

It's somewhere here but it is not immediately clear to me.

audio/torchaudio/functional/functional.py

Lines 1073 to 1100 in 0c263a9

if norm_vars:

cur_sumsq += torch.cumsum(input_part ** 2, 1)[:, -1, :]

else:

if window_start > last_window_start:

frame_to_remove = specgram[:, last_window_start, :]

cur_sum -= frame_to_remove

if norm_vars:

cur_sumsq -= (frame_to_remove ** 2)

if window_end > last_window_end:

frame_to_add = specgram[:, last_window_end, :]

cur_sum += frame_to_add

if norm_vars:

cur_sumsq += (frame_to_add ** 2)

window_frames = window_end - window_start

last_window_start = window_start

last_window_end = window_end

cmn_specgram[:, t, :] = specgram[:, t, :] - cur_sum / window_frames

if norm_vars:

if window_frames == 1:

cmn_specgram[:, t, :] = torch.zeros(

num_channels, num_feats, dtype=dtype, device=device)

else:

variance = cur_sumsq

variance = variance / window_frames

variance -= ((cur_sum ** 2) / (window_frames ** 2))

variance = torch.pow(variance, -0.5)

cmn_specgram[:, t, :] *= variance

What we want to do is

Identify which part is causing this.

Change the code if that is possible without performance degradation.

However, they are beyond the scope of this PR, so here, we can set nondet_tol and add docstring saying it's not 2nd-order differentiable when norm_vars=True, like in Spectrogram.

audio/test/torchaudio_unittest/transforms/autograd_test_impl.py

Lines 71 to 77 in 0c263a9

# replication_pad1d_backward_cuda is not deteministic and

# gives very small (~2.7756e-17) difference.

#

# See https://github.com/pytorch/pytorch/issues/54093

transform = T.Spectrogram(**kwargs)

waveform = get_whitenoise(sample_rate=8000, duration=0.05, n_channels=2)

self.assert_grad(transform, [waveform], nondet_tol=1e-10)

mthrok · 2021-05-03T14:09:51Z

test/torchaudio_unittest/transforms/autograd_test_impl.py

+    def test_sliding_window_cmn(self, kwargs):
+        sample_rate = 8000
+        transform = T.SlidingWindowCmn(**kwargs)
+        waveform = get_whitenoise(sample_rate=sample_rate, duration=0.05, n_channels=2)


The input to SlidingWindowCmn is supposed to be spectrogram.
This has been fixed in the master documentation https://pytorch.org/audio/master/functional.html#torchaudio.functional.sliding_window_cmn

Can you use get_spectrogram, then flip the last axis so that Tensor dimension is [... time, freq]?

From the doc of torch.stft I could find that it returns a tensor in the shape (* × N × T) so do you suggest using torch.transpose(-2, -1) on the output?

mthrok · 2021-05-03T14:24:40Z

test/torchaudio_unittest/transforms/autograd_test_impl.py

+        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': True, 'norm_vars': False}, ),
+        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': False, 'norm_vars': True}, ),
+        ({'cmn_window': 600, 'min_cmn_window': 100, 'center': True, 'norm_vars': True}, ),
+        ({'cmn_window': 500, 'min_cmn_window': 50, 'center': False, 'norm_vars': False}, ),


cmn_window =600 and min_cmn_window=100 look too big for the input with 8000 * 0.05 == 400 (then FFT applied) can you make them somewhat smaller than the number of frames in time axis of the input tensor?

thanks @mthrok will try to implement your suggestions

pavithranrao · 2021-05-03T18:29:37Z

unittest seems to pass with @mthrok 's suggestions

torchaudio_unittest\functional\batch_consistency_test.py::TestFunctional_0::test_sliding_window_cmn_False_False PASSED [ 48%]
torchaudio_unittest\functional\batch_consistency_test.py::TestFunctional_0::test_sliding_window_cmn_False_True PASSED [ 48%]
torchaudio_unittest\functional\batch_consistency_test.py::TestFunctional_0::test_sliding_window_cmn_True_False PASSED [ 48%]
torchaudio_unittest\functional\batch_consistency_test.py::TestFunctional_0::test_sliding_window_cmn_True_True PASSED [ 48%]

pavithranrao added 2 commits April 30, 2021 10:37

Adding test for T.SlidingWindowCmn

d10930e

Removing one negative case

6ec8994

facebook-github-bot added the CLA Signed label Apr 30, 2021

carolineechen requested review from mthrok and carolineechen April 30, 2021 21:48

carolineechen suggested changes Apr 30, 2021

View reviewed changes

Removing duplicate test case

96e4606

pavithranrao requested a review from carolineechen April 30, 2021 22:29

carolineechen reviewed Apr 30, 2021

View reviewed changes

Adding failing test cases for

a279859

mthrok reviewed May 3, 2021

View reviewed changes

mthrok mentioned this pull request May 3, 2021

Autograd tests for Transforms #1414

Closed

15 tasks

Using spectrogram instead of waveform for T.SlidingWindowCmn

c5256e5

pavithranrao requested review from carolineechen and mthrok May 3, 2021 18:30

mthrok merged commit b540e5d into pytorch:master May 3, 2021

mthrok pushed a commit to mthrok/audio that referenced this pull request Dec 13, 2022

Fix rendering bug in basics/data_tutorial.py (pytorch#1482)

effbcf5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding test for T.SlidingWindowCmn #1482

Adding test for T.SlidingWindowCmn #1482

pavithranrao commented Apr 30, 2021

pavithranrao commented Apr 30, 2021

carolineechen Apr 30, 2021

pavithranrao Apr 30, 2021 •

edited

pavithranrao Apr 30, 2021

carolineechen Apr 30, 2021

mthrok May 3, 2021

mthrok May 3, 2021

pavithranrao May 3, 2021

mthrok May 3, 2021

pavithranrao May 3, 2021

pavithranrao commented May 3, 2021

	if norm_vars:
	cur_sumsq += torch.cumsum(input_part ** 2, 1)[:, -1, :]
	else:
	if window_start > last_window_start:
	frame_to_remove = specgram[:, last_window_start, :]
	cur_sum -= frame_to_remove
	if norm_vars:
	cur_sumsq -= (frame_to_remove ** 2)
	if window_end > last_window_end:
	frame_to_add = specgram[:, last_window_end, :]
	cur_sum += frame_to_add
	if norm_vars:
	cur_sumsq += (frame_to_add ** 2)
	window_frames = window_end - window_start
	last_window_start = window_start
	last_window_end = window_end
	cmn_specgram[:, t, :] = specgram[:, t, :] - cur_sum / window_frames
	if norm_vars:
	if window_frames == 1:
	cmn_specgram[:, t, :] = torch.zeros(
	num_channels, num_feats, dtype=dtype, device=device)
	else:
	variance = cur_sumsq
	variance = variance / window_frames
	variance -= ((cur_sum 2) / (window_frames 2))
	variance = torch.pow(variance, -0.5)
	cmn_specgram[:, t, :] *= variance

	# replication_pad1d_backward_cuda is not deteministic and
	# gives very small (~2.7756e-17) difference.
	#
	# See https://github.com/pytorch/pytorch/issues/54093
	transform = T.Spectrogram(**kwargs)
	waveform = get_whitenoise(sample_rate=8000, duration=0.05, n_channels=2)
	self.assert_grad(transform, [waveform], nondet_tol=1e-10)

Adding test for T.SlidingWindowCmn #1482

Adding test for T.SlidingWindowCmn #1482

Conversation

pavithranrao commented Apr 30, 2021

pavithranrao commented Apr 30, 2021

Choose a reason for hiding this comment

pavithranrao Apr 30, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pavithranrao commented May 3, 2021

pavithranrao Apr 30, 2021 •

edited