Support reporting geometric mean by benchmark tags #132

mdboom · 2022-05-24T20:07:29Z

This reports geometric mean organized by the tag(s) assigned to each benchmark.
This will allow us to include benchmarks in the pyperformance suite that we
don't necessarily want to include in "one big overall number" to represent progress.

This requires the following changes to pyperf first: psf/pyperf#132

pyperf/_metadata.py

vstinner · 2022-05-24T21:56:36Z

It seems like you want to add a new key in metadata. In that case, you should specify it in the doc: https://pyperf.readthedocs.io/en/latest/api.html#metadata

I dislike changing the default formatting to add "(all)". Can you try to omit "(all)"? For example, if no benchmark has tags, it's weird to display the magic "all" tag.

            +----------------------+---------------------+-----------------------+
            | Benchmark            | mult_list_py36_tags | mult_list_py37_tags   |
            +======================+=====================+=======================+
            | [1]*1000             | 2.13 us             | 2.09 us: 1.02x faster |
            +----------------------+---------------------+-----------------------+
            | [1,2]*1000           | 3.70 us             | 5.28 us: 1.42x slower |
            +----------------------+---------------------+-----------------------+
            | [1,2,3]*1000         | 4.61 us             | 6.05 us: 1.31x slower |
            +----------------------+---------------------+-----------------------+
            | Geometric mean (all) | (ref)               | 1.22x slower          |
            +----------------------+---------------------+-----------------------+
            | Geometric mean (bar) | (ref)               | 1.37x slower          |
            +----------------------+---------------------+-----------------------+
            | Geometric mean (foo) | (ref)               | 1.18x slower          |
            +----------------------+---------------------+-----------------------+

In this table, it's also not easy for me to understand which benchmarks are used to compute the geometric means for each tag, since benchmark tags are not listed. Would it make sense to list tags?

vstinner · 2022-05-24T21:57:44Z

Do you have real examples of tags on benchmarks? I mean what are real tag values?

vstinner · 2022-05-24T22:00:02Z

In this table, it's also not easy for me to understand which benchmarks are used to compute the geometric means for each tag, since benchmark tags are not listed. Would it make sense to list tags?

Another option is to render one table per tag: it would only list benchmarks matching this tag, and so the "geometric mean" final row would summarize the table. And there would always be a last table with all benchmarks.

mdboom · 2022-05-25T16:40:57Z

Do you have real examples of tags on benchmarks? I mean what are real tag values?

We're mostly doing this work in expectation of cleaning the tags up to be more useful. The motivation is so we don't overoptimize for microbenchmarks, of which there are currently many in the suite. There's further discussion of how we might use tags going forward.

Another option is to render one table per tag: it would only list benchmarks matching this tag, and so the "geometric mean" final row would summarize the table. And there would always be a last table with all benchmarks.

I like this idea. It also would resolve your other comment about 'all' being surprising in the untagged case.

Addresses python/pyperformance#208 This reports geometric mean organized by the tag(s) assigned to each benchmark. This will allow us to include benchmarks in the pyperformance suite that we don't necessarily want to include in "one big overall number" to represent progress.

mdboom · 2022-06-13T12:57:44Z

@vstinner: Do these changes work for you?

vstinner · 2022-06-16T14:04:59Z

pyperf/tests/test_perf_cli.py

+            +----------------+---------------------+-----------------------+
+            | Geometric mean | (ref)               | 1.22x slower          |
+            +----------------+---------------------+-----------------------+
+        """


Oh wow, that looks great, thank you!

vstinner

LGTM.

vstinner · 2022-06-16T14:13:32Z

Sadly, the tests fail:

- FileNotFoundError: [Errno 2] No such file or directory: '/home/runner/work/pyperf/pyperf/pyperf/tests/mult_list_py36_tags.json'

mdboom · 2022-06-16T19:29:36Z

Sadly, the tests fail:

- FileNotFoundError: [Errno 2] No such file or directory: '/home/runner/work/pyperf/pyperf/pyperf/tests/mult_list_py36_tags.json'

Sorry -- forgot to commit the new test files -- let's see how this works.

EDIT: I guess this needs @vstinner or someone to re-approve the CI run.

This requires the following changes to pyperf first: psf/pyperf#132

* Support reporting geometric mean by tags This requires the following changes to pyperf first: psf/pyperf#132 * Ensure `tags` is always a list * Use property * Update pyperf

mdboom added a commit to mdboom/pyperformance that referenced this pull request May 24, 2022

Support reporting geometric mean by tags

9a7cc59

This requires the following changes to pyperf first: psf/pyperf#132

mdboom mentioned this pull request May 24, 2022

Support reporting geometric mean by tags python/pyperformance#209

Merged

vstinner reviewed May 24, 2022

View reviewed changes

pyperf/_metadata.py Outdated Show resolved Hide resolved

pyperf/_metadata.py Outdated Show resolved Hide resolved

mdboom force-pushed the report-mean-by-tags branch from 78ea3ec to efb04a7 Compare May 25, 2022 18:20

mdboom requested a review from vstinner May 25, 2022 18:21

Add docs

30e8f03

mdboom force-pushed the report-mean-by-tags branch from c40c127 to 30e8f03 Compare May 26, 2022 15:49

vstinner reviewed Jun 16, 2022

View reviewed changes

vstinner approved these changes Jun 16, 2022

View reviewed changes

Add missing test files

8bcd393

mdboom requested a review from vstinner June 16, 2022 19:29

vstinner merged commit 2430bf8 into psf:main Jun 16, 2022

mdboom added a commit to mdboom/pyperformance that referenced this pull request Aug 10, 2022

Support reporting geometric mean by tags

7143ffe

This requires the following changes to pyperf first: psf/pyperf#132

mdboom mentioned this pull request Aug 17, 2022

add some more http benchmarks python/pyperformance#227

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support reporting geometric mean by benchmark tags #132

Support reporting geometric mean by benchmark tags #132

mdboom commented May 24, 2022

vstinner commented May 24, 2022

vstinner commented May 24, 2022

vstinner commented May 24, 2022

mdboom commented May 25, 2022

mdboom commented Jun 13, 2022

vstinner Jun 16, 2022

vstinner left a comment

vstinner commented Jun 16, 2022

mdboom commented Jun 16, 2022 •

edited

Support reporting geometric mean by benchmark tags #132

Support reporting geometric mean by benchmark tags #132

Conversation

mdboom commented May 24, 2022

vstinner commented May 24, 2022

vstinner commented May 24, 2022

vstinner commented May 24, 2022

mdboom commented May 25, 2022

mdboom commented Jun 13, 2022

vstinner Jun 16, 2022

Choose a reason for hiding this comment

vstinner left a comment

Choose a reason for hiding this comment

vstinner commented Jun 16, 2022

mdboom commented Jun 16, 2022 • edited

mdboom commented Jun 16, 2022 •

edited