Ensure axis masking operations are not in-place #1481

carolineechen · 2021-04-30T19:11:38Z

As reported in #1478, TimeMasking performs in-place operations that modifies the input tensor. This is also the case for FrequencyMasking.

This PR modifies the algorithm to not perform in-place operations on the input tensor and adds checks in the unit test to ensure that the input tensor is not being changed.

(Resolves #1478)

vincentqb · 2021-04-30T19:25:26Z

I agree the functional should not modify in place. However, it is possible someone in the wild was relying on that, and this PR is changing the behavior of the function silently. I do not see a way of detecting users who would have relied on that feature though. Should we raise a warning for one release to warn of this behavior? Or would calling this a bug fix in the release notes be enough?

On first thought, I'm leaning for just calling this a bug fix in the release notes. Thoughts?

carolineechen · 2021-04-30T19:46:26Z

@vincentqb I agree that this should be considered a bug fix since it's not the expected behavior for input parameters to be changed in functional/transforms, and we also have not mentioned that this was an in-place operation in the past either.

vincentqb · 2021-04-30T22:14:25Z

test/torchaudio_unittest/functional/functional_impl.py

@@ -211,11 +212,13 @@ def test_mask_along_axis(self, shape, mask_param, mask_value, axis):

        assert mask_specgram.size() == specgram.size()
        assert num_masked_columns < mask_param
+        self.assertEqual(specgram, specgram_copy)


the intention here is to specifically test that the original is not changed. at a minimum, i'd expect a comment justifying why this is here. however, i would usually expect a separate test for this, so we know that this was the intention of the test. having a separate test would also make it easier to extend to other functionals if we want to.

thoughts?

I agree with this point. Each unit test should be testing only one thing, so that when a unit test fails we know what aspect of the functionality is not met.

Can you make new test methods with short description and reference the issue number in docstring.
See examples of test descriptions like

audio/test/torchaudio_unittest/backend/sox_io/save_test.py

Lines 381 to 382 in 0c263a9

def test_save_tensor_preserve(self, dtype):

"""save function should not alter Tensor"""

audio/test/torchaudio_unittest/backend/sox_io/load_test.py

Lines 284 to 293 in 0c263a9

def test_mp3(self):

"""Providing format allows to read mp3 without extension

libsox does not check header for mp3

https://github.com/pytorch/audio/issues/1040

The file was generated with the following command

ffmpeg -f lavfi -i "sine=frequency=1000:duration=5" -ar 16000 -f mp3 test_noext

"""

mthrok · 2021-05-03T17:04:43Z

test/torchaudio_unittest/functional/functional_impl.py

+
+        https://github.com/pytorch/audio/issues/1478
+        """
+        torch.random.manual_seed(42)


This is a tricky one. The goal of this test is to ensure that "when mask is randomly applied, the original Tensor is not altered", and in #1478, we learned that bug happens stochastically. So we need to control the randomness in a way that it always hits the condition that caused the issue in #1478.

Simply setting the random seed has a positive and negative effects here. The positive aspect is that the test is repeatable. Assuming that the environment (hardware / software (including everything from OS, PyTorch and CUDA) is the same, we can repeat the test and expect it to produce the same result. But the negative aspect is, we cannot be sure if this specific seed value applies to any configuration (HW/SW) in future? We do not know and it's not likely. If something about the random generator has been changed in the future, and if this seed value stopped hitting the condition, then the test becomes moot.

There are couple of approaches to overcome this.

Set the test configuration to always hit the correct condition of the reported bug.
For example, if we make the test to mask all the elements of the input tensor, we can be sure that the test meets the requirement. However, this diverges from the expected usages (should be masking a part of the input, not all), and looking at the signature/docstring of function tested, it's not straightforward to do so. (It could be, but the docstring is hard to understand.)

Patch random generator for the sake of testing.
If there is no other solution, we can patch the random generator to change the behavior in our favor. This kind of technique is often used in function that relies on external resource (like HTTP access, for example moto makes it possible to test your AWS app without internet connection). But this complicates the test logic, which increases the chance of writing a wrong test.

Bound the probability of this test not hitting the condition.
Another approach is to bound the probability that the test hits the correct condition. Say that this one attempt will hit the bug condition with probability p. If you repeat the procedure n times, assuming that the implementation of pseudo random generator meets iid, (which I think is a reasonable assumption for this context even though it is not in strict sense) then the probability that the test hits the condition at least once becomes 1 - (1 - p)^n. For p=0.6 and n=10, we get something like 99.989 % of hitting the bug condition. The good thing about this approach is that we can still set the seed value once (and only once at the very beginning) and we expect the reproducibility.

I agree with everything here, and feel that option 3 makes the most sense to me. The success in #1478 (when the input tensor is not changed) takes place when value = torch.rand(1) * mask_param is equal to 0, which results in mask_start=mask_end and no masking takes place. A mask_param value of 100 in all these tests indicates a 1% probability of value=0 and not hitting the condition. I think therefore looping it over 5 times should be sufficient (> 99.9999%), what do you think? Also, should this reasoning be explained in the documentation or is linking the issue sufficient?

Additionally, I have run this test on the previous implementation and see that all these tests fail, indicating that we do hit the condition (although I do agree that the test as is does not ensure this is always true, if we end up changing parameters or if the state of the random seed changes)

Additionally, I have run this test on the previous implementation and see that all these tests fail, indicating that we do hit the condition

Good to know. Thanks for taking the proper care.

A mask_param value of 100 in all these tests indicates a 1% probability of value=0 and not hitting the condition. I think therefore looping it over 5 times should be sufficient (> 99.9999%), what do you think?

Yes, 5 sounds good enough.

Also, should this reasoning be explained in the documentation or is linking the issue sufficient?

Yes, explaining it in the docstring is more helpful for future maintainers. (which is how I learned this kind of technique in the past)

mthrok

Looks good. Thanks!

ensure axis masking is not in-place

0e93578

facebook-github-bot added the CLA Signed label Apr 30, 2021

carolineechen requested review from vincentqb, mthrok and cpuhrsch April 30, 2021 19:12

vincentqb reviewed Apr 30, 2021

View reviewed changes

modify tests

572ae34

carolineechen force-pushed the mask-axis-inplace branch from 038f743 to 572ae34 Compare May 3, 2021 15:21

bug fix

25caed0

mthrok reviewed May 3, 2021

View reviewed changes

bound test error probability

79651b1

mthrok approved these changes May 3, 2021

View reviewed changes

carolineechen merged commit 7fd5fce into pytorch:master May 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure axis masking operations are not in-place #1481

Ensure axis masking operations are not in-place #1481

carolineechen commented Apr 30, 2021

vincentqb commented Apr 30, 2021 •

edited

carolineechen commented Apr 30, 2021

vincentqb Apr 30, 2021

mthrok May 3, 2021

mthrok May 3, 2021

carolineechen May 3, 2021

mthrok May 3, 2021

mthrok left a comment

	def test_save_tensor_preserve(self, dtype):
	"""save function should not alter Tensor"""

	def test_mp3(self):
	"""Providing format allows to read mp3 without extension

	libsox does not check header for mp3

	https://github.com/pytorch/audio/issues/1040

	The file was generated with the following command
	ffmpeg -f lavfi -i "sine=frequency=1000:duration=5" -ar 16000 -f mp3 test_noext
	"""

Ensure axis masking operations are not in-place #1481

Ensure axis masking operations are not in-place #1481

Conversation

carolineechen commented Apr 30, 2021

vincentqb commented Apr 30, 2021 • edited

carolineechen commented Apr 30, 2021

vincentqb Apr 30, 2021

Choose a reason for hiding this comment

mthrok May 3, 2021

Choose a reason for hiding this comment

mthrok May 3, 2021

Choose a reason for hiding this comment

carolineechen May 3, 2021

Choose a reason for hiding this comment

mthrok May 3, 2021

Choose a reason for hiding this comment

mthrok left a comment

Choose a reason for hiding this comment

vincentqb commented Apr 30, 2021 •

edited