Skip to content

Chroma features

lbenbaccar edited this page Feb 6, 2021 · 2 revisions

These features are related to musical notes in a song. It is the estimation of the intensity with which each note is present in an audio file and how the changes between two notes occurs in time. These information are useful because each musical genre tends towards different key signature that is the most frequently played note in a song. A chromagram has shape (12, n_frames). 12 for each of the 12 semitones in an octave C, C#, D..., B. Each bin in the chroma spectrogram represents the average energy of that semitone (across all octaves). n_frames is the number of time-frames in the spectrogram. Each frame is hop_length/sr seconds long, with sr is the sample rate of the loaded audio file. To go to a given time in seconds in this spectrogram, compute frame_no = int(time/hop_length/sr).

Here is the chromagram of four songs from four different genres. chroma

Clone this wiki locally