Query in stChromFeatures #22

MerinS · 2016-07-17T11:52:02Z

In here -->
https://github.com/tyiannak/pyAudioAnalysis/blob/master/audioFeatureExtraction.py#L267
The audio spectral values are divided by a scalar after they are pushed into the chroma audio bins.Could you let me know why that is done?
If it is for some kind of normalization, could you elucidate what normalization it is, as it is not clear to me.

tyiannak · 2016-07-17T21:00:01Z

The extracted chroma values are normalized by the number of freq bins corresponding to each chroma bin. This is done so that the chroma values are not biased the spectral window size

MerinS · 2016-07-18T07:57:03Z

<First up, the second line isn't clear to me>
Also, shouldn't such a normalization be done at the end after the 2-D chroma array(12,X dimensions)is compiled to 1D(12)
Taking an instance when the window size - 512. There are 10 frequency elements in the Ab chroma bin, but the Chroma[8] value is divided by 3 though it is in Ab. Also chroma[20] is divided by 4 though it is in Ab as well. Please clarify why these differences occur.

tyiannak closed this as completed Jul 17, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query in stChromFeatures #22

Query in stChromFeatures #22

MerinS commented Jul 17, 2016

tyiannak commented Jul 17, 2016

MerinS commented Jul 18, 2016

Query in stChromFeatures #22

Query in stChromFeatures #22

Comments

MerinS commented Jul 17, 2016

tyiannak commented Jul 17, 2016

MerinS commented Jul 18, 2016