CI: add benchmark test suite #8538

rouault · 2023-10-10T16:19:03Z

Uses pytest-benchmark
Add a autotest/benchmark module (now with only a few tests for the GTiff and GPKG driver)
Add a new CI benchmark configuration, which builds the current tree and a reference tree, and run the benchmark test suite in both

rouault · 2023-10-10T16:52:49Z

dbaston · 2023-10-10T17:44:08Z

autotest/benchmark/test_ogr_gpkg.py

+    return filename
+
+
+def test_ogr_gpkg_spatial_index(source_file):


I think bringing in the benchmark fixture explicitly would let you more clearly exclude the setup cost. It's not clear to me if the runtime of source_file is part of the timing or not.

It's not clear to me if the runtime of source_file is part of the timing or not.

it is not, comparing timings with the somewhat more verbose below version. source_file is run once as expected

def test_ogr_gpkg_spatial_index(source_file, benchmark): @benchmark def do(): ds = ogr.Open(source_file) lyr = ds.GetLayer(0) lyr.SetSpatialFilterRect(1000, 1000, 10000, 10000) count = 0 for f in lyr: count += 1 assert count == 10000 - 1000 + 1

autotest/conftest.py

dbaston · 2023-10-10T17:51:44Z

.github/workflows/linux_build.yml

@@ -269,6 +276,13 @@ jobs:
            TEST_CMD="ctest -V -j $(nproc)"
          fi

+          if test "${{ matrix.id }}" = "benchmarks"; then


Move into test.sh ?

I don't think this can be done under Docker. Anyway it doesn't look like this is even hit (perhaps because we already run in a container/VM that doesn't expose turboboost setting)

rouault · 2023-10-10T18:20:17Z

d35198a fails as expected (https://github.com/OSGeo/gdal/actions/runs/6473107275/job/17575133870?pr=8538):

 Performance has regressed:
	test_gtiff_byte (0001_ref) - Field 'mean' has failed PercentageRegressionCheck: 100.708833475 > 5.000000000

rouault · 2023-10-10T19:21:08Z

changing to use min statistics as suggest by @DFEvans on the mailing list

coveralls · 2023-10-10T20:15:22Z

coverage: 67.761% (+0.001%) from 67.76% when pulling 974175b on rouault:benchmark into f3bc13d on OSGeo:master.

…_GPKG=ON

rouault · 2023-10-11T17:32:03Z

@dbaston I've written more tests, and it seems we now hit unreliability of timings. Initial failed run in https://github.com/OSGeo/gdal/actions/runs/6482985166/job/17603597567 using the naive procedure (running the reference test suite, and then the one of the PR)

I've attempted to modify the test procedure (cf updated benchmark/test.sh) to do more runs to attempt mitigating this, but given See https://github.com/OSGeo/gdal/actions/runs/6485533623/job/17611807035?pr=8538
this obviously doesn't work: the first comparison run has just one test that is slightly over my 5% arbitrary criterion, but the retry has 2 tests 11% and 21% slower.

Any ideas (except increasing significantly the tolerance threshold to let's say 30% to be hopefully robust, but we won't catch up small perf regressions) or are we hitting a dead end?
Hum looking at https://pytest-benchmark.readthedocs.io/en/stable/faq.html, maybe use an alternate timer : "You could use something like time.process_time (Python 3.3+ only) as the timer. Process time doesn’t include sleeping or waiting for I/O.", assuming the difference in timings result in the VM being randomly scheduled out by the VM hypervisor.

rouault · 2023-10-11T19:13:00Z

more stable timings with --benchmark-timer=time.process_time in https://github.com/OSGeo/gdal/actions/runs/6486353872/job/17614377148?pr=8538.

rouault · 2023-10-11T20:06:45Z

A retry failed : https://github.com/OSGeo/gdal/actions/runs/6486353872/job/17617138599?pr=8538 . Up to 16% of slowdown on one test. Bumping tolerance to 20% ...

rouault · 2023-10-12T11:40:38Z

6 good runs in a row:

rouault · 2023-10-19T14:46:36Z

merging this

rouault force-pushed the benchmark branch 2 times, most recently from 86c21dd to ddb00d5 Compare October 10, 2023 16:52

rouault force-pushed the benchmark branch 4 times, most recently from 1b8bd41 to f6466d9 Compare October 10, 2023 17:50

dbaston reviewed Oct 10, 2023

View reviewed changes

rouault force-pushed the benchmark branch from f6466d9 to d35198a Compare October 10, 2023 18:03

rouault force-pushed the benchmark branch from d35198a to 31c2c33 Compare October 10, 2023 19:20

rouault force-pushed the benchmark branch from 31c2c33 to fe42377 Compare October 10, 2023 21:34

rouault added 3 commits October 11, 2023 15:08

setdevenv.sh: do not use unbound variables

e41f6fc

gdal.cmake: automatically enable SQLite driver if -DOGR_ENABLE_DRIVER…

bd1c534

…_GPKG=ON

autotest: add basic benchmark/

b880cad

rouault force-pushed the benchmark branch from fe42377 to ac34368 Compare October 11, 2023 13:09

CI: add benchmark

8a819bf

rouault force-pushed the benchmark branch from ac34368 to 8a819bf Compare October 11, 2023 16:43

benchmarks/test.sh: use --benchmark-timer=time.process_time

974175b

benchmarks/test.sh: bump tolerance to 20%

b0f6525

rouault force-pushed the benchmark branch from 97b8a4f to b0f6525 Compare October 11, 2023 20:29

rouault merged commit 32ec2a9 into OSGeo:master Oct 19, 2023
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: add benchmark test suite #8538

CI: add benchmark test suite #8538

rouault commented Oct 10, 2023 •

edited

Loading

rouault commented Oct 10, 2023

dbaston Oct 10, 2023

rouault Oct 10, 2023

dbaston Oct 10, 2023

rouault Oct 10, 2023

rouault commented Oct 10, 2023 •

edited

Loading

rouault commented Oct 10, 2023

coveralls commented Oct 10, 2023 •

edited

Loading

rouault commented Oct 11, 2023

rouault commented Oct 11, 2023

rouault commented Oct 11, 2023

rouault commented Oct 12, 2023

rouault commented Oct 19, 2023

		return filename


		def test_ogr_gpkg_spatial_index(source_file):

CI: add benchmark test suite #8538

CI: add benchmark test suite #8538

Conversation

rouault commented Oct 10, 2023 • edited Loading

rouault commented Oct 10, 2023

dbaston Oct 10, 2023

Choose a reason for hiding this comment

rouault Oct 10, 2023

Choose a reason for hiding this comment

dbaston Oct 10, 2023

Choose a reason for hiding this comment

rouault Oct 10, 2023

Choose a reason for hiding this comment

rouault commented Oct 10, 2023 • edited Loading

rouault commented Oct 10, 2023

coveralls commented Oct 10, 2023 • edited Loading

rouault commented Oct 11, 2023

rouault commented Oct 11, 2023

rouault commented Oct 11, 2023

rouault commented Oct 12, 2023

rouault commented Oct 19, 2023

rouault commented Oct 10, 2023 •

edited

Loading

rouault commented Oct 10, 2023 •

edited

Loading

coveralls commented Oct 10, 2023 •

edited

Loading