Performance testing infrastructure. #10117

jmchilton · 2020-08-17T13:43:39Z

No description provided.

.github/workflows/perf_tests.yaml

hexylena · 2020-12-08T09:12:16Z

I like the idea of using statsd but it seems excessive and I'm worried the telegraf scraping is at too wide for test cases and it seems like unnecessary infrastructure in the context of narrow API

telegraf/statsd does seem a bit excessive as you suggest, have you considered something like https://github.com/airspeed-velocity/asv/ ? I tested it out with gxadmin (not a normal use case) and it was pretty nice for per-commit/per-infra setup performance numbers.

mvdbeek · 2020-12-08T09:27:12Z

That does look interesting, but the docs are a bit basic. Did you manage to do timings around log statements ? The benchmarks all seem to revolve around total runtime when asv runs the actual code, which doesn't seem like it would be applicable / detailed enough ?

hexylena · 2020-12-08T09:31:38Z

Hey @mvdbeek. I didn't try and do timings around log statements, only around select function calls. I think it grew out of benchmarking individual numpy function calls which felt like a good fit for galaxy, but I didn't look closely at the PR and if you're doing it based off of the diff between two log statements, maybe it isn't such an easy solution.

That said, if you want to use their viz infrastructure, since you're already writing out JSON, you could just write something like their JSON format (ugly I know) and get the nice auto-generated website for free?

.github/workflows/perf_tests.yaml

jmchilton · 2020-12-15T17:13:20Z

I spent some time looking at allure for another context, it wasn't exactly what I'd want for this but it might be able to get there with the right plugin and some bootstrap. The numpy benchmarking tool looks awesome though.

What this PR currently produces is comparisons of the PR branch versus the target that look like:

You can download these artifacts from the Github action right now. There is another version that is just the timings without the comparison that I wanted to include on all the tests - but it really slowed down the tests immensely and to the point where they didn't get close to finishing it looks like so I reverted that change in the latest commit and just did the PR vs target comparison for the performance tests instead of for all the API tests.

jmchilton · 2021-01-04T16:20:07Z

Older version of this PR backed up (https://github.com/jmchilton/galaxy/pull/new/performance_tests_backup). Did a very stripped down rebase into a single commit just now.

jdavcs · 2021-01-07T08:02:35Z

Can I runtest_workflow_framework_performance.py as a standalone test? I'm getting a logging error: ValueError: I/O operation on closed file - is this because I missed an option I should've set or because it's not supposed to be run separately?

jmchilton · 2021-01-14T20:32:47Z

It works for me to just run it directly with pytest (https://gist.github.com/jmchilton/e6007becb21db94db4c392b861094f6d). Is there any more context around that error?

jdavcs · 2021-01-14T20:51:28Z

Thank you for the gist - that's exactly the output I got. I was referring to this error (which, I assume, is not relevant) - https://gist.github.com/jmchilton/e6007becb21db94db4c392b861094f6d#file-gistfile1-txt-L36
I'll review this tonight, or tomorrow at the latest.

scripts/tests_markdown.tpl

scripts/tests_markdown.py

jdavcs · 2021-02-10T02:19:26Z

scripts/tests_markdown_compare.tpl

+        | **Sum** *(ms)* | ``{{ has_metrics.values() | map(attribute="sum") | join("`` | ``") }}`` |
+        | **Median** *(ms)* | ``{{ has_metrics.values() | map(attribute="median") | join("`` | ``") }}`` |
+        | **Mean** *(ms)* | ``{{ has_metrics.values() | map(attribute="mean") | join("`` | ``") }}`` |
+        | **Standard Deviation** | ``{{ has_metrics.values() | map(attribute="stdev") | join("`` | ``") }}`` |


same formatting suggestion as in tests_markdown.tpl

.github/workflows/perf_comp_api_tests.yaml

scripts/tests_markdown.py

jdavcs

This is so very cool! (that is, once I figured out how to run the tests and generate the reports!) I'm very sorry it took me so long to review this - now it has to be rebased. All my comments are minor suggestions only.

jmchilton · 2021-02-13T17:02:15Z

Exploiting that approve to merge without a final check because I need to show some progress at PI meeting Monday, but I just rebased with the above comments. Thanks for the detailed review @ic4f!

jdavcs · 2021-02-15T16:27:41Z

Thank you for putting all that work into addressing those minor comments! Again, sorry for the delayed review @jmchilton !

jmchilton added kind/enhancement area/testing area/performance labels Aug 17, 2020

galaxybot added the status/WIP label Aug 17, 2020

jmchilton force-pushed the performance_tests branch 8 times, most recently from bedb635 to 64691f1 Compare August 21, 2020 15:58

jmchilton force-pushed the performance_tests branch from 74406f3 to 3345511 Compare December 2, 2020 14:28

github-actions bot added the area/testing/selenium label Dec 2, 2020

jmchilton force-pushed the performance_tests branch from f618487 to 3152de7 Compare December 2, 2020 16:44

github-actions bot added area/dependencies area/scripts area/testing/api area/testing/integration labels Dec 2, 2020

jmchilton force-pushed the performance_tests branch 2 times, most recently from 0753f12 to bbaaacd Compare December 2, 2020 22:43

jdavcs reviewed Dec 7, 2020

View reviewed changes

.github/workflows/perf_tests.yaml Outdated Show resolved Hide resolved

jdavcs reviewed Dec 7, 2020

View reviewed changes

.github/workflows/perf_tests.yaml Outdated Show resolved Hide resolved

jdavcs reviewed Dec 8, 2020

View reviewed changes

.github/workflows/perf_tests.yaml Outdated Show resolved Hide resolved

jmchilton force-pushed the performance_tests branch 2 times, most recently from 8d40a69 to 709b1c6 Compare December 10, 2020 20:47

jmchilton mentioned this pull request Dec 14, 2020

Performance Testing Metrics Aggregation and Comparison #10915

Open

jmchilton force-pushed the performance_tests branch from 7d51824 to 7a87575 Compare January 4, 2021 16:19

jmchilton force-pushed the performance_tests branch from 7a87575 to e86f343 Compare January 4, 2021 16:50

jmchilton changed the title ~~[WIP] Performance testing infrastructure.~~ Performance testing infrastructure. Jan 4, 2021

jmchilton removed the status/WIP label Jan 4, 2021

jmchilton requested a review from jdavcs January 4, 2021 20:50

jmchilton force-pushed the performance_tests branch from e86f343 to 6ae49af Compare January 21, 2021 20:35