Remove METRICS dictionary #54

craffel · 2014-04-25T01:18:50Z

tl;dr - the METRICS dictionary doesn't really serve the purpose I want it to, and the evaluators do, so it shouldn't exist.

In some cases, metrics within the same task might have a different number of arguments. This used to be the case in segment before it got split into boundary and structure. It's currently true in melody; voicing_measures just takes in voicing arrays while the rest take in voicing and frequency arrays (even though raw_pitch_accuracy and raw_chroma_accuracy don't actually use their est_voicing arg). If all metrics in each task submodule took the same arguments, you could run

for name, metric in mir_eval.task_submodule.METRICS.items():
    print name, metric(ref, est)

or whatever the call signature is, which would be convenient. But, we don't expect this to be the case in general. This was kind of the main reason for the METRICS dictionary to exist, and in practice the evaluators submodule should essentially subsume this functionality.

The text was updated successfully, but these errors were encountered:

craffel · 2014-08-19T19:59:16Z

Hey all, could I get a little consensus here? Especially those of us who are currently using mir_eval, @ejhumphrey @mattmcvicar @bmcfee @justinsalamon @urinieto The proposed replacement for the METRICS dictionary are the evaluators, specifically each evaluator's evaluate function. I'm finding it would be useful to have this function accessible outside of the evaluators, so that they could be used by the tests and by end-users. So, what do you think of putting an evaluate function in each task submodule instead, which takes in reference and estimated annotations (not filenames!), pre-processes them as necessary, and returns a dictionary with nice-sounding keys with the scores for each metric? Then, the evaluators would just load in the data and call this function.

urinieto · 2014-08-19T20:06:47Z

I would love to have something like that. I have used mir_eval in three
different projects now, and I always end up thinking that some sort of
behavior like the one you described would be pretty useful.

However, it might be a bit tricky to tell the exact parameters for all the
metrics (e.g. I like to have both the trimmed and non-trimmed versions of
the boundaries eval for both the 0.5 and 3 second windows). Maybe this
function should return a dictionary with all the metrics with the default
params only (i.e. not accepting any parameters other than the reference and
estimation).

On Tue, Aug 19, 2014 at 3:59 PM, craffel notifications@github.com wrote:

Hey all, could I get a little consensus here? Especially those of us who
are currently using mir_eval, @ejhumphrey https://github.com/ejhumphrey
@mattmcvicar https://github.com/mattmcvicar @bmcfee
https://github.com/bmcfee @justinsalamon
https://github.com/justinsalamon @urinieto https://github.com/urinieto
The proposed replacement for the METRICS dictionary are the evaluators,
specifically each evaluator's evaluate function. I'm finding it would be
useful to have this function accessible outside of the evaluators, so that
they could be used by the tests and by end-users. So, what do you think of
putting an evaluate function in each task submodule instead, which takes
in reference and estimated annotations (not filenames!), pre-processes them
as necessary, and returns a dictionary with nice-sounding keys with the
scores for each metric? Then, the evaluators would just load in the data
and call this function.

—
Reply to this email directly or view it on GitHub
#54 (comment).

craffel · 2014-08-19T20:25:15Z

So the evaluators currently just implement the standard set of metrics, taking in any parameters that might warrant changing. Take a look at the segment evaluator's evaluate function:
https://github.com/craffel/mir_eval/blob/43aa35903edea4249281e2fd49b57da9ec43ce0a/evaluators/segment_eval.py#L21
It takes in the trim parameter. The proposed submodule evaluate functions would work in the same way.

justinsalamon · 2014-08-24T13:52:00Z

In my use of mir_eval I have also found myself needing the functionality of the evaluate function outside of the evaluator... so I too think this would be a good move.

craffel · 2014-08-26T15:05:16Z

OK, created an issue for this #70.

…ome minor docstring changes and kwarg signature

craffel · 2014-08-26T22:29:25Z

Removing the METRICS dictionary will require refactoring the test scripts, as covered by #48 and #61

craffel added this to the 0.0.2 milestone Apr 25, 2014

craffel added the enhancement label Apr 25, 2014

craffel modified the milestones: 0.1, 0.0.2 Jul 3, 2014

craffel changed the title ~~Handling metrics with different call structures~~ Remove METRIC dictionary Jul 21, 2014

craffel changed the title ~~Remove METRIC dictionary~~ Remove METRICS dictionary Jul 21, 2014

craffel added a commit that referenced this issue Jul 22, 2014

Nicer metric names because of #54 for #14

1e354c4

This was referenced Aug 26, 2014

Split segment submodule into two submodules #38

Closed

Move evaluate method to task submodules #70

Closed

craffel added a commit that referenced this issue Aug 26, 2014

Moving evaluate method #70 and removing METRICS dictionary #54

fe74d41

craffel added a commit that referenced this issue Aug 26, 2014

Moving evaluate method #70 and removing METRICS dictionary #54 plus s…

0d79e10

…ome minor docstring changes and kwarg signature

craffel self-assigned this Sep 4, 2014

craffel added a commit that referenced this issue Sep 5, 2014

Removing METRICS dictionary #54

272ec35

craffel mentioned this issue Sep 5, 2014

Update chord #81

Closed

7 tasks

craffel added a commit that referenced this issue Oct 9, 2014

Removing metric dictionary #54 #81

c6addef

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove METRICS dictionary #54

Remove METRICS dictionary #54

craffel commented Apr 25, 2014

craffel commented Aug 19, 2014

urinieto commented Aug 19, 2014

craffel commented Aug 19, 2014

justinsalamon commented Aug 24, 2014

craffel commented Aug 26, 2014

craffel commented Aug 26, 2014

Remove METRICS dictionary #54

Remove METRICS dictionary #54

Comments

craffel commented Apr 25, 2014

craffel commented Aug 19, 2014

urinieto commented Aug 19, 2014

craffel commented Aug 19, 2014

justinsalamon commented Aug 24, 2014

craffel commented Aug 26, 2014

craffel commented Aug 26, 2014