This repository contains the analysis of musical patterns gathered with the PAF and the ANOMIC annotation tools, as well as patterns discovered with various SIA-based pattern discovery algorithms. The analysis is split into two sections, the inter-annotator agreement analysis and the feature-based analysis. The inter-agreement analysis is based on the below agreement matrices between annotators of all datasets. Meanwhile the feature-based analysis results consist of t-test and KS-test results between the feature distributions of the aforementioned datasets, and boxplot visualisations of these distributions.
F1 score matrices, representing pairwise agreement between annotators, with the time threshold set to 5 crotchets. Each matrix showcases results for a different piece of music. Matrix columns and rows denote different annotators, grouped based on the annotation tool used (PAF or ANOMIC) and their musical background. An exception are the last few rows, labeled ALG, where each row represents a separate pattern discovery algorithm. Matrix cell colours correspond to the obtained pairwise agreement values, where yellow denotes high agreement and blue low. Some annotators did not provide annotations for all pieces, which can be seen on the diagonals as low agreement.
Boxplots showcasing the distributions of the analysed pattern features of the PAF, ANOMIC and algorithmic datasets. For each feature the three distributions were normalised in order to fit on the same plot. Separate plots for each feature can be seen lower on this page.
Tables showcasing t and p values of t-tests with alpha of 0.05, between the PAF, the ANOMIC (musicians and non-musicians) and the algorithmic datasets on the features of the first column. Orange cells denote tests with p values higher than the predefined threshold alpha, which means that the difference between the two tested distributions is not statistically significant.
Tables showcasing the D statistics and p values of Kolmogorov-Smirnov tests on the pairs of PAF, ANOMIC (musicians and non-musicians) and algorithmic feature distributions. The tests were performed with a threshold alpha of 0.05 and a critical value of 1.36. Yellow cells denote tests whose D statistic is lower than the critical value. Orange cells denote tests whose p value is higher than the threshold alpha, meaning that the difference between distributions cannot be considered as statistically significant.
Cardinality is defined as the number of notes or events contained in one occurrence of a pattern
Occurrences refers to the number of times that a pattern
The coverage of a pattern
Compression ratio is the amount of compression that can be achieved by representing the set of points covered by all occurrences of the pattern by specifying simply one occurrence of the pattern and all the vectors by which the pattern can be translated \cite{meredith2003algorithms}.
Meredith et al. \cite{meredith2003algorithms} use the following multiplication of coverage, compactness and compression ratio for ordering discovered patterns.
[ coverage^a \cdot compactness^b \cdot compression_ratio^c ]
Here the
Rhythmic density represents the mean number of events per tactus beat, as defined by Pearce and Wiggins \cite{pearce2007evaluating}. Research suggests that this variable and compactness are highly positively correlated \cite{collins2011improved}, but since rhythmic density is a specific musical property, unlike compactness, it might prove to be a better feature for comparison.
Pattern duration is similar to cardinality (\secref{sub:cardinality}), but it measures the duration of pattern
Last note duration is again a rather self explanatory feature. It describes the duration of the last note of a pattern.
Kranenburg, Volk and Wiering \cite{kranenburg2013comparison} compared several global and local features of melodies in their research. One of these was the duration of melodic arcs which is defined as the average number of notes that separate melodic peaks and troughs in melodies. We decided to include this feature in our analysis due to the similarities between melodies and patterns. However, we adjusted the feature slightly. The value of this feature was set to
Pearce and Wiggins \cite{pearce2007evaluating} define rhythmic variability as the degree of change in note duration (i.e. the standard deviation of the log of the event durations). Research of Eerola and North \cite{eerola2000expectancy} suggests that patterns with high rhythmic variation might be more difficult to identify. On the other hand Collins \cite{collins2011improved} raises the idea that their high variation might actually increase their distinctiveness.
This features combines the cardinality and the occurrences feature by using their mean-centered values. The idea behind this is that a pattern containing many notes and/or occurring many times is likely to be relatively noticeable/important \cite{collins2011improved}. The mean-centered cardinality$\times$occurrences for pattern
[mc_C \times O (P, D)= (|P|-\overline{|P|})(|m|-\overline{|m|}) ] %mc_cardinality \times occurrences = (|P|-\overline{|P|})(|m|-\overline{|m|})
Here
This feature is based on the formula, presented by Conklin and Bergeron \cite{conklin_bergeron_2008}, for calculating the expected number of pattern occurrences in a dataset. The idea behind it is that patterns that involve less common pitches or rhythms are less likely to occur by chance and should thus be more noticeable.
This feature uses the relative frequency of occurrences of musical events, i.e. the empirical distribution, from which the likelihood that a given pattern occurs can be computed. By taking into account the number of locations where such a pattern can occur we obtain the expected occurrences feature.
The geometric mean likelihood represents a slight variation of the expected occurrences feature. It uses the individual probabilities
[geom_mean_likelihood(P,D)=(\prod_{j=1}^{l}{ \pi_{i_j} })^\frac{1}{l} ]
Conklin and Anagnostopoulou \cite{conklin_anagnostopoulou_2001} define the score of a pattern in order to indicate potentially interesting patterns. This feature is based on the observed and expected occurrence counts of a pattern and is computed as follows:
[ score(P,D) = \frac{( |TEC(P,D)|-\norm{E(Z)} )^2}{\norm{\mathbb{E}(Z)}} ]
Conklin and Bergeron \cite{conklin_bergeron_2008} also propose the interest of a pattern, which is a simpler version of score. The feature defines potentially interesting patterns as patterns with large differences between the observed and expected occurrence counts of a pattern. The interest itself is computed as the ratio of the two counts:
[ interest(P,D) = \frac{|TEC(P,D)|}{\norm{\mathbb{E}(Z)}} ]
The prominence feature is used to identify significant or important patterns. It was defined by Cambouropoulos \cite{cambouropoulos2006musical} and it is based on a combination of cardinality, occurrences and coverage features. [ prominence(P,D) = |P| \cdot m \cdot 10^{3\frac{coverage(P,D)-m\cdot|P|}{coverage(P,D)}} ]
As the name suggest this feature presents an alternative to the prominence feature. In prominence the
Note range describes the number of semi-tones between the lowest and highest note of a pattern. The range is measured in MIDI note numbers.
The intuition behind this feature is that patterns with a larger range are more noticeable. We denote the range of a pattern by
The following two features are rather similar to note range, however they also take into account the average range of other patterns from the same music piece. The signed pitch range represents the distance, in semitones, of the pitch range of a pattern from the mean pitch range of other discovered patterns from the same dataset, as defined by Pearce and Wiggins \cite{pearce2007evaluating}. [sp_range(P,D) = r(P) - \frac{1}{M} \sum_{i=1}^{M} r(P_i) ]
The unsigned pitch range simply defines the absolute value of the signed pitch range. In comparison to the previous two features, the unsigned pitch range also takes into consideration unusually small pitch ranges and not just unusually large ones \cite{collins2011improved}.
Pearce and Wiggins \cite{pearce2007evaluating} define pitch centre as the absolute distance, in semitones, of the mean pitch of a pattern from the mean pitch of the music piece. Based on this Collins \cite{collins2011improved} defines the maximum pitch centre feature, which is used to isolate either unusually high, or unusually low occurrences.
[max_pitch_centre(P,D) = max{|\overline{y}_Q- \overline{y}_D|}]
Here
The chromatic feature is defined as the maximum number of nonkey notes present, taken over all occurrences of a pattern \cite{collins2011improved}. Collins suggests that a highly chromatic pattern is likely to be more noticeable than one that remains entirely in key.
This feature defines the ratio between the number of times that the pitch direction of a pattern changes and its overall number of events. It could also be defined as the number of melodic arcs in a pattern, compared to its cardinality. This feature was inspired by various melodic arc features used by Kranenburg et al. \cite{kranenburg2013comparison}.
This feature defines the fraction of all melodic intervals of a pattern that are small (less than two semitones). The intuition behind it is that static, scalic or stepwise melodies might be seen as more important \cite{collins2011improved}.
This feature is the opposite of the small intervals feature, as it represents the fraction of all melodic intervals of a pattern that are intervallic leaps (more than two semitones). The idea is that patterns with leaping melodies might be more noticeable and thus more important \cite{collins2011improved}.
This feature is defined as the average melodic interval of all consecutive note pairs in a pattern.
[\overline{melodic_interval} = \frac{1}{l} \sum_{i=2}^{l} y_i - y_{i-1}]
Here
This feature describes the fraction of notes in a pattern that are root notes or octaves.
This feature describes the fraction of all notes in a pattern that are major or minor thirds.
This feature describes the fraction of all notes in a pattern that are perfect fifths.
Kranenburg et al. \cite{kranenburg2013comparison} define the steepness of a pattern as the deviation in pitch between two turning points (peaks and troughs) divided by the duration. The mean steepness of a pattern is thus a mean across all steepnesses.
This feature describes the fraction of notes (events) that occur at least twice in a pattern. The intuition behind the feature is that patterns with more repeated notes are more prominent or significant, and will thus be more likely identified by annotators.
This feature represents the ratio between the number of occurrences of the most occurring note of a pattern and the pattern's cardinality. The idea behind this feature is that patterns with numerous repetitions of one specific note are more likely to be potentially interesting.
This feature represents the total ratio of unique notes in a pattern without counting repeated notes. If we consider a pattern