Allow extra dimensions with extent 1 in Spectrogram operator & AudioDecoder changes #1679

jantonguirao · 2020-01-21T12:03:08Z

Signed-off-by: Joaquin Anton janton@nvidia.com

Why we need this PR?

Pick one, remove the rest

It adds new feature needed to handle empty dimensions in Spectrogram operator

What happened in this PR?

Fill relevant points, put NA otherwise. Replace anything inside []

What solution was applied:
- In Spectrogram operator, check that the input shape is 1-D or only one dimension of the shape has an extent greater than 1. If so, we can squeeze the shape into a 1D tensor
Affected modules and functionalities:
- Spectrogram operator
Key points relevant for the review:
- Changes in Spectrogram
Validation and testing:
- Python tests
Documentation (including examples):
- Updated docstr

JIRA TASK: [Use DALI-XXXX or NA]

jantonguirao · 2020-01-21T12:04:04Z

!build

dali-automaton · 2020-01-21T12:05:35Z

CI MESSAGE: [1083375]: BUILD STARTED

dali-automaton · 2020-01-21T12:10:17Z

CI MESSAGE: [1083381]: BUILD STARTED

dali-automaton · 2020-01-21T12:49:44Z

CI MESSAGE: [1083375]: BUILD PASSED

dali-automaton · 2020-01-21T12:56:03Z

CI MESSAGE: [1083381]: BUILD PASSED

szalpal · 2020-01-21T14:59:46Z

I'm gonna improve this PR with AudioDecoder stuff, please don't merge yet

szalpal · 2020-01-21T17:15:06Z

!build

dali-automaton · 2020-01-21T17:20:59Z

CI MESSAGE: [1083750]: BUILD STARTED

JanuszL · 2020-01-21T17:21:35Z

Can we add some test: FileReader->AudioDecoder->Spectrogram test to see if that works?

szalpal · 2020-01-21T17:23:28Z

Can we add some test: FileReader->AudioDecoder->Scectogram test to see if that works?

Upssss, forgot about it, adding!

JanuszL · 2020-01-21T17:24:47Z

dali/test/python/test_audiodecoder_spectogram.py

@@ -0,0 +1,115 @@
+# Copyright (c) 2020, NVIDIA CORPORATION. All rights reserved.


You need to run this test as well.

Why not just make it as another case in the existing test_operator_spectrogram.py ?

JanuszL · 2020-01-21T17:25:09Z

dali/test/python/test_audiodecoder_spectogram.py

+    out = np.abs(
+        librosa.stft(y=input_data, n_fft=nfft, hop_length=win_step, window=hann_win)) ** 2
+
+    # Alternative way to calculate the spectrogram:


It could be removed

szalpal · 2020-01-21T22:46:03Z

!build

dali-automaton · 2020-01-21T22:50:20Z

CI MESSAGE: [1084339]: BUILD STARTED

dali-automaton · 2020-01-22T00:17:14Z

CI MESSAGE: [1084339]: BUILD FAILED

szalpal · 2020-01-22T11:04:33Z

!build

dali-automaton · 2020-01-22T11:10:25Z

CI MESSAGE: [1085295]: BUILD STARTED

dali-automaton · 2020-01-22T12:02:30Z

CI MESSAGE: [1085295]: BUILD PASSED

dali/operators/signal/fft/spectrogram.cc

klecki · 2020-01-22T16:13:39Z

dali/test/python/test_audiodecoder_spectogram.py

+        read, _ = self.input()
+        audio, rate = self.decode(read)
+        spec = self.fft(audio)
+        return spec


Can you also return the data from decode and validate their shapes?

That's already tested, here

klecki

Minor nitpicks written above, also please adjust the test file name as Janusz suggests.

szalpal · 2020-01-23T10:54:11Z

test_operator_spectogram uses random artificial data, I use real data with FileReader here. I'd need to add another 2 pipelines there. Ofc that test there could be changed, but it's out of scope of this PR. IMHO it's better to just add another file

JanuszL · 2020-01-23T13:33:39Z

!build

dali-automaton · 2020-01-23T13:35:16Z

CI MESSAGE: [1087534]: BUILD STARTED

dali-automaton · 2020-01-23T14:08:51Z

CI MESSAGE: [1087534]: BUILD FAILED

jantonguirao · 2020-01-23T15:20:05Z

test_operator_spectogram uses random artificial data, I use real data with FileReader here. I'd need to add another 2 pipelines there. Ofc that test there could be changed, but it's out of scope of this PR. IMHO it's better to just add another file

I'd simply add those as another set of test cases in test_operator_spectrogram (as Janusz suggested)

Signed-off-by: Joaquin Anton <janton@nvidia.com>

dali-automaton · 2020-01-23T15:52:09Z

CI MESSAGE: [1087534]: BUILD PASSED

Signed-off-by: Michał Szołucha <mszolucha@nvidia.com>

szalpal · 2020-01-23T18:46:57Z

!build

dali-automaton · 2020-01-23T18:51:19Z

CI MESSAGE: [1088023]: BUILD STARTED

dali-automaton · 2020-01-23T19:39:03Z

CI MESSAGE: [1088023]: BUILD PASSED

jantonguirao requested a review from a team January 21, 2020 12:04

JanuszL approved these changes Jan 21, 2020

View reviewed changes

szalpal changed the title ~~Allow extra dimensions with extent 1 in Spectrogram operator~~ Allow extra dimensions with extent 1 in Spectrogram operator & AudioDecoder changes Jan 21, 2020

JanuszL reviewed Jan 21, 2020

View reviewed changes

klecki reviewed Jan 22, 2020

View reviewed changes

dali/operators/signal/fft/spectrogram.cc Show resolved Hide resolved

klecki reviewed Jan 22, 2020

View reviewed changes

klecki requested changes Jan 22, 2020

View reviewed changes

JanuszL self-requested a review January 23, 2020 01:02

jantonguirao force-pushed the spectrogram_1d_squeeze branch from b620ef0 to 560f907 Compare January 23, 2020 15:18

Allow extra dimensions with extent 1 in Spectrogram operator

1bd6dc7

Signed-off-by: Joaquin Anton <janton@nvidia.com>

jantonguirao force-pushed the spectrogram_1d_squeeze branch from 560f907 to 1bd6dc7 Compare January 23, 2020 15:51

szalpal added 2 commits January 23, 2020 19:43

Changes in Decoder and decoder's test

0835b12

Signed-off-by: Michał Szołucha <mszolucha@nvidia.com>

updating spectogram test to cover decoder also

2a476ae

Signed-off-by: Michał Szołucha <mszolucha@nvidia.com>

klecki approved these changes Jan 24, 2020

View reviewed changes

JanuszL approved these changes Jan 24, 2020

View reviewed changes

szalpal merged commit c85420c into NVIDIA:master Jan 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow extra dimensions with extent 1 in Spectrogram operator & AudioDecoder changes #1679

Allow extra dimensions with extent 1 in Spectrogram operator & AudioDecoder changes #1679

jantonguirao commented Jan 21, 2020

jantonguirao commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

szalpal commented Jan 21, 2020

szalpal commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

JanuszL commented Jan 21, 2020 •

edited

Loading

szalpal commented Jan 21, 2020

JanuszL Jan 21, 2020

JanuszL Jan 21, 2020

JanuszL Jan 21, 2020

szalpal Jan 21, 2020

szalpal commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 22, 2020

szalpal commented Jan 22, 2020

dali-automaton commented Jan 22, 2020

dali-automaton commented Jan 22, 2020

klecki Jan 22, 2020

szalpal Jan 23, 2020

klecki left a comment

szalpal commented Jan 23, 2020

JanuszL commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

jantonguirao commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

szalpal commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

		@@ -0,0 +1,115 @@
		# Copyright (c) 2020, NVIDIA CORPORATION. All rights reserved.

Allow extra dimensions with extent 1 in Spectrogram operator & AudioDecoder changes #1679

Allow extra dimensions with extent 1 in Spectrogram operator & AudioDecoder changes #1679

Conversation

jantonguirao commented Jan 21, 2020

Why we need this PR?

What happened in this PR?

jantonguirao commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

szalpal commented Jan 21, 2020

szalpal commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

JanuszL commented Jan 21, 2020 • edited Loading

szalpal commented Jan 21, 2020

JanuszL Jan 21, 2020

Choose a reason for hiding this comment

JanuszL Jan 21, 2020

Choose a reason for hiding this comment

JanuszL Jan 21, 2020

Choose a reason for hiding this comment

szalpal Jan 21, 2020

Choose a reason for hiding this comment

szalpal commented Jan 21, 2020

dali-automaton commented Jan 21, 2020

dali-automaton commented Jan 22, 2020

szalpal commented Jan 22, 2020

dali-automaton commented Jan 22, 2020

dali-automaton commented Jan 22, 2020

klecki Jan 22, 2020

Choose a reason for hiding this comment

szalpal Jan 23, 2020

Choose a reason for hiding this comment

klecki left a comment

Choose a reason for hiding this comment

szalpal commented Jan 23, 2020

JanuszL commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

jantonguirao commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

szalpal commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

dali-automaton commented Jan 23, 2020

JanuszL commented Jan 21, 2020 •

edited

Loading