[MRG] Add benchmarking script for multilabel metrics #2643

jnothman · 2013-12-07T11:55:39Z

These are not very important metrics in the context of scikit-learn. Yet whenever metric implementations gets changed, people seem to be interested in how it affects execution time. This makes such reports easy to calculate.

This benchmarks metrics for different multilabel target formats, also giving us an idea of their relative performance. Benchmarks are otherwise parametrised by (number of samples, classes, average density of positive labels), one of which may be plotted against time.

coveralls · 2013-12-07T12:02:15Z

Coverage remained the same when pulling fa5acb2 on jnothman:bench_mutilabel_metrics into 66a5a4a on scikit-learn:master.

coveralls · 2013-12-07T12:06:57Z

Coverage remained the same when pulling fa5acb2 on jnothman:bench_mutilabel_metrics into 66a5a4a on scikit-learn:master.

larsmans · 2013-12-07T13:26:51Z

LGTM. Ping @arjoly.

Care to do a quick review of #2642 for me? :)

arjoly · 2013-12-09T09:15:47Z

benchmarks/bench_multilabel_metrics.py

+def benchmark(metrics=[v for k, v in sorted(METRICS.items())],
+              formats=[v for k, v in sorted(FORMATS.items())],
+              samples=1000, classes=4, density=.2,
+              n_times=5):


Can you use tuple instead of list for function arguments?

You're concerned that they're mutable?

Yes, I think this is better to use immutable for default arguments.

arjoly · 2013-12-09T09:30:17Z

Except for the minor comments, +1 to merge.

jnothman · 2013-12-09T09:43:58Z

@arjoly Okay, yes, it's quick-and-dirty code. I don't think that's a big deal for benchmarks, but I'll get some of the lint out of it.

arjoly · 2013-12-09T09:50:17Z

Thanks !

arjoly · 2013-12-09T10:44:31Z

Your benchmark could be improved by adding dense c-layout and dense fortran-layout.

jnothman · 2013-12-09T11:58:54Z

Your benchmark could be improved by adding dense c-layout and dense fortran-layout.

Only if you want to see closely-overlapping curves...

coveralls · 2013-12-09T12:02:50Z

Coverage remained the same when pulling 84ec4f9 on jnothman:bench_mutilabel_metrics into 66a5a4a on scikit-learn:master.

[MRG] Add benchmarking script for multilabel metrics

arjoly · 2013-12-09T12:08:23Z

merged ! Thanks for the bench !!

TST Add benchmarking script for multilabel metrics

fa5acb2

arjoly reviewed Dec 9, 2013
View reviewed changes

COSMIT in response to feedback

84ec4f9

arjoly added a commit that referenced this pull request Dec 9, 2013

Merge pull request #2643 from jnothman/bench_mutilabel_metrics

5f57f85

[MRG] Add benchmarking script for multilabel metrics

arjoly merged commit 5f57f85 into scikit-learn:master Dec 9, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Add benchmarking script for multilabel metrics #2643

[MRG] Add benchmarking script for multilabel metrics #2643

jnothman commented Dec 7, 2013

coveralls commented Dec 7, 2013

coveralls commented Dec 7, 2013

larsmans commented Dec 7, 2013

arjoly Dec 9, 2013

jnothman Dec 9, 2013

arjoly Dec 9, 2013

arjoly commented Dec 9, 2013

jnothman commented Dec 9, 2013

arjoly commented Dec 9, 2013

arjoly commented Dec 9, 2013

jnothman commented Dec 9, 2013

coveralls commented Dec 9, 2013

arjoly commented Dec 9, 2013

[MRG] Add benchmarking script for multilabel metrics #2643

[MRG] Add benchmarking script for multilabel metrics #2643

Conversation

jnothman commented Dec 7, 2013

coveralls commented Dec 7, 2013

coveralls commented Dec 7, 2013

larsmans commented Dec 7, 2013

arjoly Dec 9, 2013

Choose a reason for hiding this comment

jnothman Dec 9, 2013

Choose a reason for hiding this comment

arjoly Dec 9, 2013

Choose a reason for hiding this comment

arjoly commented Dec 9, 2013

jnothman commented Dec 9, 2013

arjoly commented Dec 9, 2013

arjoly commented Dec 9, 2013

jnothman commented Dec 9, 2013

coveralls commented Dec 9, 2013

arjoly commented Dec 9, 2013