Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query in stChromFeatures #22

Closed
MerinS opened this issue Jul 17, 2016 · 2 comments
Closed

Query in stChromFeatures #22

MerinS opened this issue Jul 17, 2016 · 2 comments

Comments

@MerinS
Copy link

MerinS commented Jul 17, 2016

In here -->
https://github.com/tyiannak/pyAudioAnalysis/blob/master/audioFeatureExtraction.py#L267
The audio spectral values are divided by a scalar after they are pushed into the chroma audio bins.Could you let me know why that is done?
If it is for some kind of normalization, could you elucidate what normalization it is, as it is not clear to me.

@tyiannak
Copy link
Owner

The extracted chroma values are normalized by the number of freq bins corresponding to each chroma bin. This is done so that the chroma values are not biased the spectral window size

@MerinS
Copy link
Author

MerinS commented Jul 18, 2016

<First up, the second line isn't clear to me>
Also, shouldn't such a normalization be done at the end after the 2-D chroma array(12,X dimensions)is compiled to 1D(12)
Taking an instance when the window size - 512. There are 10 frequency elements in the Ab chroma bin, but the Chroma[8] value is divided by 3 though it is in Ab. Also chroma[20] is divided by 4 though it is in Ab as well. Please clarify why these differences occur.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants