A Multi-Layer Modeling Approach to Music Genre Classification

Machine Learning project of MC886 at Unicamp

Listening to music and perceiving its structure is a fairly easy task for humans beings, even for listeners without formal musical training. However, building computational models to mimic those processes is a hard problem. The research is motivated by a large class of challenging applications in the media business that require efficient and robust audio segmentation and classification. Application scenarios include audio streaming services and Web stream analysis, automatic media monitoring, content- and descriptor-based search in large multimedia databases, improving recommendation algorithms and playlists suggestions, and artistic explorations.

Goals and Objectives

detect segment boundaries and transitions between musical sections on classical piano pieces.
test the performance of the algorithm for automatic segmentation of audio streams into coherent or otherwise meaningful units corresponding to musical sections of more generic pieces, such as Concerti and Symphonies.

Project Plan

The International Music Score Library Project (IMSPL) currently comprises more than 288,000 scores and more than 31,000 public domain recordings. Collections of piano sonatas by different composers (e.g. Haydn, Mozart, Beethoven and Schubert) and/or different interpreters are going to be selected and labeled accordingly to their corresponding musical sections. Audio descriptors algorithms from Essentia (and possibly other available libraries) will populate the feature vectors. Essentia is an open-source C++ library for audio analysis and audio-based music information retrieval. This combination will give life to our training dataset.

We are interested in:

Tonal descriptors notes, chords, keys, inharmonicity and dissonance
Rhythm descriptors beat detection, BPM, rhythm transform, beat loudness, rhythm similarity or homogeneity.
Time-domain descriptors duration, loudness, zero-crossing-rate log attack time and other signal envelope descriptors.
Statistical descriptors median, mean, variance, power means, flatness.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
report		report
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Multi-Layer Modeling Approach to Music Genre Classification

Goals and Objectives

Project Plan

Reports

About

Releases

Packages

Languages

License

g4brielvs/music-genre-classification

Folders and files

Latest commit

History

Repository files navigation

A Multi-Layer Modeling Approach to Music Genre Classification

Goals and Objectives

Project Plan

Reports

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages