Issue with Windowing and Spectrum methods #240

ghost · 2015-03-25T16:37:53Z

I'm playing with Essentia and I get some weird results with Windowing and Spectrum algorithms when using AudioLoader.

First Windowing:

In [122]: monoloader = essentia.standard.MonoLoader(filename = 'myfile.wav')

In [123]: audioloader = essentia.standard.AudioLoader(filename = 'myfile.wav')

In [124]: monoloader_audio = monoloader()

In [125]: audioloader_audio, sr, c = audioloader()

In [126]: monoloader_frame = monoloader_audio[0:1024]

In [127]: audioloader_frame = audioloader_audio[0:1024,0]

In [128]: monoloader_frame
Out[128]: 
array([ 0.00143437,  0.02502518,  0.04733421, ...,  0.02630696,
        0.01281777, -0.00366222], dtype=float32)

In [129]: audioloader_frame
Out[129]: 
array([ 0.00143437,  0.02502518,  0.04733421, ...,  0.02630696,
        0.01281777, -0.00366222], dtype=float32)

In [130]: w = Windowing(type = 'hann', size=1024)

In [131]: w(monoloader_frame)
Out[131]: 
array([-0.00038114, -0.00030989, -0.00022922, ..., -0.0004094 ,
       -0.00044366, -0.00042744], dtype=float32)

In [132]: w(audioloader_frame)
Out[132]: 
array([-0.00022529,  0.        , -0.00018149, ...,  0.        ,
       -0.00028865,  0.        ], dtype=float32)

Both frame are the same, but I get weird windowed data with AudioLoader (one every two value is 0).

Same thing with Spectrum: I get the expected result when using MonoLoader and weird result with AudioLoader: I set the size parameter to 1024, which from the doc is the audio input size.
From a spectrum function I would expect either the whole symmetric magnitude spectrum on 1024 points, or half the spectrum on 513 points. With the Spectrum method I get the whole (symmetric) spectrum on 513 points. What am I missing?

In [4]: import essentia.standard

In [5]: import essentia

In [6]: loader = essentia.standard.AudioLoader(filename = 'myfile.wav')

In [7]: audio, sr, c = loader()

In [8]: spectrum = essentia.standard.Spectrum(size=1024)

In [9]: spectrum.paramValue('size')
Out[9]: 1024

In [10]: s = spectrum(audio[100000:101024, 0])

In [11]: ion()

In [12]: plot(s)
Out[12]: [<matplotlib.lines.Line2D at 0x7f64edf76f10>]

The text was updated successfully, but these errors were encountered:

dbogdanov · 2015-05-21T16:24:53Z

Is the input file mono and are you sure that monoloader_frame and audioloader_frame are equal value by value? AudioLoader return stereo signal, so that the 0-th index will correspond to the left channel.

dbogdanov · 2017-01-10T17:10:29Z

@pabloEntropia

palonso · 2017-01-11T11:45:26Z

I've reproduced the experiment getting similar results to @ghost.
I can't exactly tell what is the difference between monoloader_frame and audioloader_frame as they are equal for the numpy array_equal and array_equiv methods. However the problem is solved simply by casting to essentia.array. For instance:

import essentia.standard as ess

plt.plot(w(ess.essentia.array(monoloader_frame)))
plt.show()

plt.plot(w(ess.essentia.array(audioloader_frame)))
plt.show()

The same applies for the Spectrogram

dbogdanov · 2017-01-12T17:15:58Z

Ok, this is a problem interleaved representation of vectors of StereoSamples.

audioloader_frame = audioloader_audio[0:1024,0]

a new array object is created, but it does not allocate additional memory for the array's data. Instead, it creates a "view" that shares the original array's data buffer. Therefore, while printing monoloader_frame looks fine in python, passing that back to C++ algorithm produces an error. Addressing the first 1024 floats results in interleaved left/right channel values for 512 samples.

Casting to essentia.array or numpy.array solves the issue. This would not be obvious, however, for a user to do that.

dbogdanov · 2017-01-12T17:42:08Z

The same issue seems to appear for any slicing. Giving any numpy array object created by slicing to Essentia algorithm would result in incorrect memory access.

For example, monoloader_frame[::2] should have 512 value with every second value from monoloader_frame, but the resulting vector input will have the first 512 values instead.

To sum up, we should implement a check for if the input is a copy or a view when passing input to Essentia algorithms in the wrapper. In the case it is a view, we should create a new copy and pass that.

dbogdanov · 2017-01-12T17:45:44Z

We should implement a base python test for that too.

If it is a view it creates a copy

…w arrays MTG#240

dbogdanov · 2017-01-18T12:10:16Z

Fixed in #555

ghost changed the title ~~Issue with Spectrum method~~ Issue with Windowing and Spectrum method Mar 25, 2015

ghost changed the title ~~Issue with Windowing and Spectrum method~~ Issue with Windowing and Spectrum methods Mar 26, 2015

dbogdanov added this to the 2.1 milestone May 21, 2015

dbogdanov mentioned this issue Jun 16, 2016

what is the frequency grid of pfft? #425

Closed

dbogdanov added the algorithms QA label Oct 3, 2016

dbogdanov assigned palonso Dec 21, 2016

palonso pushed a commit to palonso/essentia that referenced this issue Jan 13, 2017

Python Wrapper: checks if input is a view or a copy MTG#240

4f61a4b

If it is a view it creates a copy

palonso pushed a commit to palonso/essentia that referenced this issue Jan 17, 2017

base test to check if the python parser is creating new copies of vie…

40fefc9

…w arrays MTG#240

palonso pushed a commit to palonso/essentia that referenced this issue Jan 17, 2017

Changed flag from ´OWNDATA´ to ´C_CONTIGUOUS´ MTG#240

716c0d5

dbogdanov closed this as completed Jan 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with Windowing and Spectrum methods #240

Issue with Windowing and Spectrum methods #240

ghost commented Mar 25, 2015

dbogdanov commented May 21, 2015

dbogdanov commented Jan 10, 2017

palonso commented Jan 11, 2017 •

edited

dbogdanov commented Jan 12, 2017

dbogdanov commented Jan 12, 2017 •

edited

dbogdanov commented Jan 12, 2017

dbogdanov commented Jan 18, 2017

Issue with Windowing and Spectrum methods #240

Issue with Windowing and Spectrum methods #240

Comments

ghost commented Mar 25, 2015

dbogdanov commented May 21, 2015

dbogdanov commented Jan 10, 2017

palonso commented Jan 11, 2017 • edited

dbogdanov commented Jan 12, 2017

dbogdanov commented Jan 12, 2017 • edited

dbogdanov commented Jan 12, 2017

dbogdanov commented Jan 18, 2017

palonso commented Jan 11, 2017 •

edited

dbogdanov commented Jan 12, 2017 •

edited