Transcription [wip] #170

justinsalamon · 2016-02-09T16:41:46Z

This will (eventually) contain all the code for transcription evaluation (and tests)

craffel · 2016-02-11T15:39:23Z

mir_eval/io.py

+    -------
+    intervals : np.ndarray, shape=(n_events, 2)
+        array of event start and end time
+    values : list of float


We should return values as a np.ndarray (just involves values = np.array(values) somewhere below).

I was returning a list because that's what load_labeled_intervals does, but I'm fine with changing it to an np.ndarray

Right, that's because it's a list of strings, not a list of numeric data.

changed in 7802919

craffel · 2016-02-16T15:56:21Z

Is this ready for re-review?

bmcfee · 2016-02-16T16:05:28Z

Is this ready for re-review?

note: i re-ran the pr test build to trigger coverage reporting post merging #179.

justinsalamon · 2016-02-16T16:15:07Z

It's missing the regression tests still, but otherwise I think I've addressed every comment from the previous CR, so yes I think it's ready for re-review (or you could wait till I have the regression tests in too, as you prefer).

craffel · 2016-02-16T17:57:13Z

.gitignore

+
+# pycharm
+.idea/*
+docs/_build/doctrees/environment.pickle


I think you can just ignore docs/_build/*, if you want to do this.

craffel · 2016-02-16T18:25:08Z

Ok, made a couple more minor comments for you to chew on until we wait for real world data. As a side note, once we have said data, eventually we will need to add the regression-score-generating step to
https://github.com/craffel/mir_eval/blob/master/tests/generate_data.py

justinsalamon · 2016-02-16T18:59:44Z

Roger that (I've addressed all the comments except for one I wasn't sure about). Looks like adding the regression-score-generating step should be fairly straight forward. I've now asked Chris Cannam and Emmanouil Benetos for their algorithms' output too, so whoever gets back to me first "wins" :)

prf() implements the computation of precision, recall, and f1. match_notes() finds the optimal matchinf between the ref and est notes.

Implement the evaluate function. Rename prf() to precision_recall_f1. Flip the bi-partite graph construction such that the first index of evert returned tuple is the ref index, and the second is the est index. Compure the evaluation metrics with and without offsets. Change load_valued_intervals to return the pitch sequence as an nd.array.

Add transcription to sphinx doc generation. Modify tests to raise errors instead of warning were relevant, and remove redundant tests. Use offset_min_tolerance, and make it an optional parameter.

Move match notes into transcription.py. Remove with_offset and make the use of offsets implicit in the offset_ratio argument (if None then offsets are ignored). Also, change behavior of precision_recall_f1 so that by default it DOES consider offsets. Also, the output metrics have been renamed.

In addition to moving test_match_notes it has also been updated.

Add strict option for using <. The default is strict=False, which means using <=. Use a more efficient metho to compute the offset_hit_matric. Rename cmp to cmp_func.

Generate est, ref, and output files for regression testing. Fix the unit test data (file-evel constants) to match the expected results: est note 1 should not match ref note 1 when offsets are considered. Also update the expected unit test results, since est note 3 can match ref note 3 when strict=False.

Test Duan's matching code (which uses greedy note matching) to compare it to the graph based note matching. This also requires rounding down note onset and offset times to 2 decimal places.

Major update of the top-level docstring to explain all the differences compared to the MIREX 2015 matlab evaluation code. Remove the multiplication by the .5 factor when computing the offset_tolerance, since it turns out this not done in the MIREX evaluation. Also remove the rounding down of onset/offset times.

Make N_DECIMALS a file-level constant. Improve the numerical stability of the pitch comparison, and use a simpler and optimized offset tolerance computation.

Add the following tests: test_match_notes_strict, test_invalid_pitch, test_inconsistent_int_pitch (for when intervals and pitches are not the same length), test_empty_ref, test_empty_est, test_precision_recall_f1_empty (for when either the ref or est is empty).

Replace "\" with "( .. )" for all line continuations. Fix top-level docstring to describe relative difference to MIREX metrics in percetages rather than absolute values.

justinsalamon · 2016-02-22T18:36:12Z

Alright @craffel, I think we're where we want to be. If you think the commit messages need further editing feel free to edit them, though I've followed the recommended guidelines to the dot.

craffel · 2016-02-22T18:40:50Z

Looks good to me, feel free to merge!

justinsalamon · 2016-02-22T18:43:08Z

Huzzah!

Transcription [wip]

craffel · 2016-02-22T18:44:00Z

You can close #167 too :)

craffel reviewed Feb 11, 2016
View reviewed changes

bmcfee added this to the 0.3.0 milestone Feb 11, 2016

craffel reviewed Feb 16, 2016
View reviewed changes

justinsalamon force-pushed the transcription branch from b084e51 to a738ae1 Compare February 22, 2016 18:28

justinsalamon added 18 commits February 22, 2016 13:31

Start work on transcription evaluation, no functionality yet

d4aedfe

Implement and use load_valued_intervals

4e63635

Implement prf, match_notes, and related tests

9c3e364

prf() implements the computation of precision, recall, and f1. match_notes() finds the optimal matchinf between the ref and est notes.

Add trans. to sphinx, fix tests, add offset param

6b192ba

Add transcription to sphinx doc generation. Modify tests to raise errors instead of warning were relevant, and remove redundant tests. Use offset_min_tolerance, and make it an optional parameter.

Move test_match_notes, rename no_offset metrics

73318a1

In addition to moving test_match_notes it has also been updated.

Add strict option, improve offset_hit_matrix calc

cf48078

Add strict option for using <. The default is strict=False, which means using <=. Use a more efficient metho to compute the offset_hit_matric. Rename cmp to cmp_func.

Replace load funcs with file-level constants

a1ed70c

Round down onsets/offsets, test Duan's matching

085a888

Test Duan's matching code (which uses greedy note matching) to compare it to the graph based note matching. This also requires rounding down note onset and offset times to 2 decimal places.

New regression data and test

2f47117

Make N_DECIMALS a constant, optimize comparisons

2bf25b3

Make N_DECIMALS a file-level constant. Improve the numerical stability of the pitch comparison, and use a simpler and optimized offset tolerance computation.

Add test for load_valued_intervals

a703b2b

Fix bug when ref or est pitch are empty

e1cedf4

Fix line continuations and update top docstring

1077366

Replace "\" with "( .. )" for all line continuations. Fix top-level docstring to describe relative difference to MIREX metrics in percetages rather than absolute values.

justinsalamon force-pushed the transcription branch from a738ae1 to 1077366 Compare February 22, 2016 18:32

justinsalamon added a commit that referenced this pull request Feb 22, 2016

Merge pull request #170 from craffel/transcription

8746248

Transcription [wip]

justinsalamon merged commit 8746248 into master Feb 22, 2016

This was referenced Feb 22, 2016

Transcription metrics #167

Closed

index order in match_events output #171

Closed

justinsalamon mentioned this pull request Jan 7, 2017

Chroma transcription metrics #197

Open

chf2117 mentioned this pull request Jun 11, 2017

Chroma transcription metrics #232

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transcription [wip] #170

Transcription [wip] #170

justinsalamon commented Feb 9, 2016

craffel Feb 11, 2016

justinsalamon Feb 15, 2016

craffel Feb 15, 2016

justinsalamon Feb 15, 2016

craffel commented Feb 16, 2016

bmcfee commented Feb 16, 2016

justinsalamon commented Feb 16, 2016

craffel Feb 16, 2016

justinsalamon Feb 16, 2016

craffel commented Feb 16, 2016

justinsalamon commented Feb 16, 2016

justinsalamon commented Feb 22, 2016

craffel commented Feb 22, 2016

justinsalamon commented Feb 22, 2016

craffel commented Feb 22, 2016

Transcription [wip] #170

Transcription [wip] #170

Conversation

justinsalamon commented Feb 9, 2016

craffel Feb 11, 2016

Choose a reason for hiding this comment

justinsalamon Feb 15, 2016

Choose a reason for hiding this comment

craffel Feb 15, 2016

Choose a reason for hiding this comment

justinsalamon Feb 15, 2016

Choose a reason for hiding this comment

craffel commented Feb 16, 2016

bmcfee commented Feb 16, 2016

justinsalamon commented Feb 16, 2016

craffel Feb 16, 2016

Choose a reason for hiding this comment

justinsalamon Feb 16, 2016

Choose a reason for hiding this comment

craffel commented Feb 16, 2016

justinsalamon commented Feb 16, 2016

justinsalamon commented Feb 22, 2016

craffel commented Feb 22, 2016

justinsalamon commented Feb 22, 2016

craffel commented Feb 22, 2016