DIgital Musicology Corpus Analysis Toolkit
A Python library for processing and analyzing notated music on a very large scale. It is under heavy development and has just seen its v1.0.0 alpha release. The library is developed by the Digital and Cognitive Musicology Lab at the École Polytechnique Fédérale de Lausanne (EPFL) in Switzerland and a white paper has been published as
Hentschel, J., McLeod, A., Rammos, Y., & Rohrmeier, M. (2023). Introducing DiMCAT for processing and analyzing notated music on a very large scale. Proceedings of the 24th International Society for Music Information Retrieval Conference, 516–523. https://ismir2023program.ismir.net/poster_52.html
DiMCAT is available on PyPI and can be installed via pip:
pip install dimcat
DiMCAT compiles frictionless datapackages. To play around with the alpha release, we recommend downloading the package which corresponds to the DCML corpora that are currently public. The package consists of two files:
- dcml_corpora.zip (data)
- dcml_corpora.json (metadata)
The data package can be loaded into DiMCAT as follows:
from dimcat import Dataset
D = Dataset.from_package("dcml_corpora.datapackage.json")
Development of this software tool was supported by the Swiss National Science Foundation within the project “Distant Listening – The Development of Harmony over Three Centuries (1700–2000)” (Grant no. 182811). This project is being conducted at the Latour Chair in Digital and Cognitive Musicology, generously funded by Mr. Claude Latour.
The software project has been set up using PyScaffold 4.2.1.