Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Extracted data from the intersection of the Million Song Dataset's 10,000-song subset, the Tagtraum genre "ground truth" dataset, and the musiXmatch lyrics dataset. This combined dataset is designed to help researchers develop genre-classification algorithms and statistical analysis methods for the portion of the Million Song Dataset for which t…
Software, musical data, and phonetic transcriptions for analyzing music-text relationships in a corpus of 19th-c. German art songs by Franz Schubert.
Markov analysis of harmonic progressions and cluster analysis of song-level harmonic profiles in the McGill Billboard dataset.
Place to put stuff we are working on for parsing and analyzing the Billboard corpus
Python module for statistical analysis of transitional probabilities in a musical corpus (designed for harmony, but usable for other structures)