Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add memory benchmarks #5960

Merged
merged 7 commits into from
May 21, 2024
Merged

Add memory benchmarks #5960

merged 7 commits into from
May 21, 2024

Conversation

stephenworsley
Copy link
Contributor

🚀 Pull Request

Description

Extends existing benchmarks by adding a version which tracks memory for functions where iris may be responsible for memory handling (e.g. by calling dask in a specific way, as resolved in #5767). Memory benchmarks are repeated so that memory leaks may be better detected.

@stephenworsley stephenworsley added the benchmark_this Request that this pull request be benchmarked to check if it introduces performance shifts label May 20, 2024
Copy link

codecov bot commented May 20, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 89.78%. Comparing base (0909918) to head (b63926c).

Additional details and impacted files
@@           Coverage Diff           @@
##             main    #5960   +/-   ##
=======================================
  Coverage   89.78%   89.78%           
=======================================
  Files          93       93           
  Lines       23007    23007           
  Branches     5017     5017           
=======================================
  Hits        20657    20657           
  Misses       1620     1620           
  Partials      730      730           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@trexfeathers trexfeathers added benchmark_this Request that this pull request be benchmarked to check if it introduces performance shifts and removed benchmark_this Request that this pull request be benchmarked to check if it introduces performance shifts labels May 20, 2024
@stephenworsley stephenworsley added benchmark_this Request that this pull request be benchmarked to check if it introduces performance shifts and removed benchmark_this Request that this pull request be benchmarked to check if it introduces performance shifts labels May 20, 2024
@stephenworsley stephenworsley marked this pull request as ready for review May 20, 2024 10:35
Copy link
Contributor

⏱️ Performance Benchmark Report: 3bf0041

Performance shifts

Full benchmark results

Benchmarks that have stayed the same:

| Change   | Before [c956403c]    | After [3bf00417]    | Ratio   | Benchmark (Parameter)                                                                                |
|----------|----------------------|---------------------|---------|------------------------------------------------------------------------------------------------------|
|          | 53.8±1ms             | 53.7±0.6ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_COUNT(False)                                       |
|          | 55.2±0.7ms           | 54.1±0.6ms          | 0.98    | aggregate_collapse.Aggregation.time_aggregated_by_COUNT(True)                                        |
|          | 191±2ms              | 189±2ms             | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_FAST_PERCENTILE(False)                             |
|          | 191±4ms              | 189±3ms             | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_FAST_PERCENTILE(True)                              |
|          | 36.6±0.4ms           | 36.0±0.4ms          | 0.98    | aggregate_collapse.Aggregation.time_aggregated_by_GMEAN(False)                                       |
|          | 37.5±0.8ms           | 37.2±0.8ms          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_GMEAN(True)                                        |
|          | 36.7±0.5ms           | 37.1±0.5ms          | 1.01    | aggregate_collapse.Aggregation.time_aggregated_by_HMEAN(False)                                       |
|          | 37.3±0.4ms           | 37.0±0.7ms          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_HMEAN(True)                                        |
|          | 46.6±1ms             | 46.6±0.9ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_MAX(False)                                         |
|          | 47.7±0.6ms           | 47.4±0.8ms          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_MAX(True)                                          |
|          | 120±0.5ms            | 120±2ms             | 1.01    | aggregate_collapse.Aggregation.time_aggregated_by_MAX_RUN(False)                                     |
|          | 120±1ms              | 119±1ms             | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_MAX_RUN(True)                                      |
|          | 51.4±0.8ms           | 51.2±0.9ms          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_MEAN(False)                                        |
|          | 52.1±0.9ms           | 52.0±0.7ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_MEAN(True)                                         |
|          | 36.1±0.4ms           | 36.2±1ms            | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_MEDIAN(False)                                      |
|          | 37.2±0.5ms           | 37.2±0.5ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_MEDIAN(True)                                       |
|          | 46.8±0.4ms           | 46.7±1ms            | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_MIN(False)                                         |
|          | 47.6±0.8ms           | 47.7±0.7ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_MIN(True)                                          |
|          | 1.33±0.01s           | 1.32±0.01s          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_PEAK(False)                                        |
|          | 1.31±0.01s           | 1.30±0.01s          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_PEAK(True)                                         |
|          | 664±10ms             | 661±10ms            | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_PERCENTILE(False)                                  |
|          | 673±20ms             | 683±10ms            | 1.01    | aggregate_collapse.Aggregation.time_aggregated_by_PERCENTILE(True)                                   |
|          | 35.0±0.4ms           | 34.9±0.3ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_PROPORTION(False)                                  |
|          | 35.5±0.5ms           | 35.6±0.4ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_PROPORTION(True)                                   |
|          | 61.3±0.7ms           | 61.2±0.7ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_RMS(False)                                         |
|          | 61.9±0.9ms           | 61.4±0.5ms          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_RMS(True)                                          |
|          | 66.5±0.9ms           | 65.3±0.8ms          | 0.98    | aggregate_collapse.Aggregation.time_aggregated_by_STD_DEV(False)                                     |
|          | 66.6±1ms             | 66.3±0.9ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_STD_DEV(True)                                      |
|          | 61.5±0.5ms           | 61.4±0.9ms          | 1.00    | aggregate_collapse.Aggregation.time_aggregated_by_VARIANCE(False)                                    |
|          | 62.7±0.9ms           | 62.0±0.9ms          | 0.99    | aggregate_collapse.Aggregation.time_aggregated_by_VARIANCE(True)                                     |
|          | 20.4±0.4ms           | 19.7±0.5ms          | 0.97    | aggregate_collapse.Aggregation.time_collapsed_by_COUNT(False)                                        |
|          | 23.9±0.5ms           | 23.6±0.5ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_COUNT(True)                                         |
|          | 132±2ms              | 130±0.9ms           | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_FAST_PERCENTILE(False)                              |
|          | 144±3ms              | 142±2ms             | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_FAST_PERCENTILE(True)                               |
|          | 18.5±0.6ms           | 18.3±0.4ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_GMEAN(False)                                        |
|          | 22.4±0.3ms           | 21.9±0.3ms          | 0.97    | aggregate_collapse.Aggregation.time_collapsed_by_GMEAN(True)                                         |
|          | 18.5±0.4ms           | 18.3±0.6ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_HMEAN(False)                                        |
|          | 22.3±0.7ms           | 22.3±0.5ms          | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_HMEAN(True)                                         |
|          | 19.0±0.4ms           | 18.8±0.5ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_MAX(False)                                          |
|          | 22.7±0.4ms           | 22.7±0.7ms          | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_MAX(True)                                           |
|          | 34.5±1ms             | 34.3±1ms            | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_MAX_RUN(False)                                      |
|          | 38.2±1ms             | 38.0±2ms            | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_MAX_RUN(True)                                       |
|          | 19.3±0.3ms           | 19.6±0.4ms          | 1.01    | aggregate_collapse.Aggregation.time_collapsed_by_MEAN(False)                                         |
|          | 23.0±0.6ms           | 23.1±0.3ms          | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_MEAN(True)                                          |
|          | 19.3±0.3ms           | 19.1±0.6ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_MEDIAN(False)                                       |
|          | 23.4±0.6ms           | 22.8±0.5ms          | 0.98    | aggregate_collapse.Aggregation.time_collapsed_by_MEDIAN(True)                                        |
|          | 18.8±0.4ms           | 18.7±0.5ms          | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_MIN(False)                                          |
|          | 22.5±0.3ms           | 22.7±0.5ms          | 1.01    | aggregate_collapse.Aggregation.time_collapsed_by_MIN(True)                                           |
|          | 550±7ms              | 548±2ms             | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_PEAK(False)                                         |
|          | 555±5ms              | 552±4ms             | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_PEAK(True)                                          |
|          | 149±2ms              | 149±2ms             | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_PERCENTILE(False)                                   |
|          | 165±1ms              | 165±1ms             | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_PERCENTILE(True)                                    |
|          | 18.2±0.6ms           | 17.8±0.8ms          | 0.97    | aggregate_collapse.Aggregation.time_collapsed_by_PROPORTION(False)                                   |
|          | 22.0±0.5ms           | 21.8±0.4ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_PROPORTION(True)                                    |
|          | 21.6±0.6ms           | 21.2±0.3ms          | 0.98    | aggregate_collapse.Aggregation.time_collapsed_by_RMS(False)                                          |
|          | 24.7±0.8ms           | 24.7±0.3ms          | 1.00    | aggregate_collapse.Aggregation.time_collapsed_by_RMS(True)                                           |
|          | 21.5±0.3ms           | 21.4±0.3ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_STD_DEV(False)                                      |
|          | 25.2±0.3ms           | 25.0±0.4ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_STD_DEV(True)                                       |
|          | 20.9±0.5ms           | 20.7±0.3ms          | 0.99    | aggregate_collapse.Aggregation.time_collapsed_by_VARIANCE(False)                                     |
|          | 24.9±0.3ms           | 24.4±0.5ms          | 0.98    | aggregate_collapse.Aggregation.time_collapsed_by_VARIANCE(True)                                      |
|          | 83.4±1ms             | 83.1±1ms            | 1.00    | aggregate_collapse.WeightedAggregation.time_w_aggregated_by_MEAN(False)                              |
|          | 84.6±0.7ms           | 83.7±1ms            | 0.99    | aggregate_collapse.WeightedAggregation.time_w_aggregated_by_MEAN(True)                               |
|          | 95.1±1ms             | 94.2±0.7ms          | 0.99    | aggregate_collapse.WeightedAggregation.time_w_aggregated_by_RMS(False)                               |
|          | 96.3±2ms             | 94.9±1ms            | 0.98    | aggregate_collapse.WeightedAggregation.time_w_aggregated_by_RMS(True)                                |
|          | 57.5±0.6ms           | 57.4±0.5ms          | 1.00    | aggregate_collapse.WeightedAggregation.time_w_aggregated_by_SUM(False)                               |
|          | 59.1±1ms             | 57.4±0.9ms          | 0.97    | aggregate_collapse.WeightedAggregation.time_w_aggregated_by_SUM(True)                                |
|          | 29.6±0.4ms           | 29.5±0.6ms          | 1.00    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_MEAN(False)                               |
|          | 32.8±0.8ms           | 33.1±0.6ms          | 1.01    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_MEAN(True)                                |
|          | 31.5±0.7ms           | 31.0±0.6ms          | 0.98    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_RMS(False)                                |
|          | 34.7±0.5ms           | 35.0±0.7ms          | 1.01    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_RMS(True)                                 |
|          | 25.9±0.3ms           | 25.5±0.3ms          | 0.98    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_SUM(False)                                |
|          | 29.6±0.4ms           | 29.4±0.4ms          | 0.99    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_SUM(True)                                 |
|          | 325±3ms              | 320±3ms             | 0.98    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_WPERCENTILE(False)                        |
|          | 345±4ms              | 341±4ms             | 0.99    | aggregate_collapse.WeightedAggregation.time_w_collapsed_by_WPERCENTILE(True)                         |
|          | 1.13±0.01ms          | 1.13±0.01ms         | 1.00    | cube.CubeCreation.time_create(False, 'construct')                                                    |
|          | 407±5μs              | 404±3μs             | 0.99    | cube.CubeCreation.time_create(False, 'instantiate')                                                  |
|          | 957±9μs              | 965±20μs            | 1.01    | cube.CubeCreation.time_create(True, 'construct')                                                     |
|          | 590±10μs             | 593±5μs             | 1.01    | cube.CubeCreation.time_create(True, 'instantiate')                                                   |
|          | 222±4ms              | 223±3ms             | 1.00    | cube.CubeEquality.time_equality(False, False, 'all_equal')                                           |
|          | 112±1ms              | 110±1ms             | 0.98    | cube.CubeEquality.time_equality(False, False, 'coord_inequality')                                    |
|          | 234±4ms              | 232±2ms             | 0.99    | cube.CubeEquality.time_equality(False, False, 'data_inequality')                                     |
|          | 16.9±0.1μs           | 17.0±0.3μs          | 1.01    | cube.CubeEquality.time_equality(False, False, 'metadata_inequality')                                 |
|          | 307±3ms              | 303±4ms             | 0.99    | cube.CubeEquality.time_equality(False, True, 'all_equal')                                            |
|          | 198±4ms              | 196±3ms             | 0.99    | cube.CubeEquality.time_equality(False, True, 'coord_inequality')                                     |
|          | 316±8ms              | 312±2ms             | 0.99    | cube.CubeEquality.time_equality(False, True, 'data_inequality')                                      |
|          | 17.0±0.2μs           | 17.1±0.3μs          | 1.00    | cube.CubeEquality.time_equality(False, True, 'metadata_inequality')                                  |
|          | 223±1ms              | 223±2ms             | 1.00    | cube.CubeEquality.time_equality(True, False, 'all_equal')                                            |
|          | 112±1ms              | 111±2ms             | 0.99    | cube.CubeEquality.time_equality(True, False, 'coord_inequality')                                     |
|          | 235±2ms              | 233±3ms             | 0.99    | cube.CubeEquality.time_equality(True, False, 'data_inequality')                                      |
|          | 54.1±0.5μs           | 54.3±0.9μs          | 1.00    | cube.CubeEquality.time_equality(True, False, 'metadata_inequality')                                  |
|          | 306±4ms              | 305±4ms             | 0.99    | cube.CubeEquality.time_equality(True, True, 'all_equal')                                             |
|          | 196±1ms              | 196±2ms             | 1.00    | cube.CubeEquality.time_equality(True, True, 'coord_inequality')                                      |
|          | 315±3ms              | 317±3ms             | 1.00    | cube.CubeEquality.time_equality(True, True, 'data_inequality')                                       |
|          | 55.3±0.6μs           | 55.6±0.5μs          | 1.00    | cube.CubeEquality.time_equality(True, True, 'metadata_inequality')                                   |
|          | 397±8ns              | 416±40ns            | 1.05    | experimental.ugrid.regions_combine.CombineRegionsComputeRealData.time_compute_data(50)               |
|          | 258±3ms              | 259±2ms             | 1.00    | experimental.ugrid.regions_combine.CombineRegionsComputeRealData.time_compute_data(500)              |
|          | 14.6±0.09ms          | 14.6±0.4ms          | 1.00    | experimental.ugrid.regions_combine.CombineRegionsCreateCube.time_create_combined_cube(50)            |
|          | 17.0±1ms             | 16.7±0.6ms          | 0.98    | experimental.ugrid.regions_combine.CombineRegionsCreateCube.time_create_combined_cube(500)           |
|          | 1.0                  | 1.0                 | 1.00    | experimental.ugrid.regions_combine.CombineRegionsCreateCube.track_addedmem_create_combined_cube(50)  |
|          | 11.8                 | 11.8                | 1.00    | experimental.ugrid.regions_combine.CombineRegionsCreateCube.track_addedmem_create_combined_cube(500) |
|          | 106±1ms              | 106±2ms             | 1.00    | experimental.ugrid.regions_combine.CombineRegionsFileStreamedCalc.time_stream_file2file(50)          |
|          | 714±6ms              | 708±4ms             | 0.99    | experimental.ugrid.regions_combine.CombineRegionsFileStreamedCalc.time_stream_file2file(500)         |
|          | 66.8±0.8ms           | 66.3±1ms            | 0.99    | experimental.ugrid.regions_combine.CombineRegionsSaveData.time_save(50)                              |
|          | 665±5ms              | 664±5ms             | 1.00    | experimental.ugrid.regions_combine.CombineRegionsSaveData.time_save(500)                             |
|          | 2.1752849999999997   | 2.1752849999999997  | 1.00    | experimental.ugrid.regions_combine.CombineRegionsSaveData.track_filesize_saved(50)                   |
|          | 216.01528499999998   | 216.01528499999998  | 1.00    | experimental.ugrid.regions_combine.CombineRegionsSaveData.track_filesize_saved(500)                  |
|          | 663±5μs              | 665±10μs            | 1.00    | import_iris.Iris.time__concatenate                                                                   |
|          | 184±3μs              | 179±1μs             | 0.97    | import_iris.Iris.time__constraints                                                                   |
|          | 112±2μs              | 112±1μs             | 1.00    | import_iris.Iris.time__data_manager                                                                  |
|          | 94.7±0.5μs           | 94.3±0.7μs          | 1.00    | import_iris.Iris.time__deprecation                                                                   |
|          | 138±1μs              | 138±1μs             | 1.00    | import_iris.Iris.time__lazy_data                                                                     |
|          | 902±6μs              | 899±10μs            | 1.00    | import_iris.Iris.time__merge                                                                         |
|          | 78.3±0.8μs           | 78.5±0.8μs          | 1.00    | import_iris.Iris.time__representation                                                                |
|          | 485±7μs              | 483±5μs             | 0.99    | import_iris.Iris.time_analysis                                                                       |
|          | 140±1μs              | 140±0.9μs           | 1.01    | import_iris.Iris.time_analysis__area_weighted                                                        |
|          | 109±0.8μs            | 111±0.9μs           | 1.01    | import_iris.Iris.time_analysis__grid_angles                                                          |
|          | 248±4μs              | 243±1μs             | 0.98    | import_iris.Iris.time_analysis__interpolation                                                        |
|          | 189±3μs              | 186±1μs             | 0.99    | import_iris.Iris.time_analysis__regrid                                                               |
|          | 112±0.9μs            | 112±0.8μs           | 1.00    | import_iris.Iris.time_analysis__scipy_interpolate                                                    |
|          | 139±1μs              | 140±2μs             | 1.00    | import_iris.Iris.time_analysis_calculus                                                              |
|          | 329±6μs              | 327±1μs             | 0.99    | import_iris.Iris.time_analysis_cartography                                                           |
|          | 95.1±2μs             | 95.6±0.7μs          | 1.01    | import_iris.Iris.time_analysis_geomerty                                                              |
|          | 219±4μs              | 218±2μs             | 1.00    | import_iris.Iris.time_analysis_maths                                                                 |
|          | 99.2±1μs             | 99.3±0.5μs          | 1.00    | import_iris.Iris.time_analysis_stats                                                                 |
|          | 175±1μs              | 177±4μs             | 1.01    | import_iris.Iris.time_analysis_trajectory                                                            |
|          | 302±6μs              | 307±3μs             | 1.02    | import_iris.Iris.time_aux_factory                                                                    |
|          | 84.3±0.5μs           | 84.9±0.9μs          | 1.01    | import_iris.Iris.time_common                                                                         |
|          | 163±2μs              | 165±2μs             | 1.01    | import_iris.Iris.time_common_lenient                                                                 |
|          | 992±20μs             | 981±20μs            | 0.99    | import_iris.Iris.time_common_metadata                                                                |
|          | 135±2μs              | 134±1μs             | 0.99    | import_iris.Iris.time_common_mixin                                                                   |
|          | 1.18±0.1ms           | 1.17±0.01ms         | 0.99    | import_iris.Iris.time_common_resolve                                                                 |
|          | 203±1μs              | 201±2μs             | 0.99    | import_iris.Iris.time_config                                                                         |
|          | 115±1μs              | 116±0.9μs           | 1.01    | import_iris.Iris.time_coord_categorisation                                                           |
|          | 363±10μs             | 354±6μs             | 0.98    | import_iris.Iris.time_coord_systems                                                                  |
|          | 753±10μs             | 743±10μs            | 0.99    | import_iris.Iris.time_coords                                                                         |
|          | 664±10μs             | 664±10μs            | 1.00    | import_iris.Iris.time_cube                                                                           |
|          | 227±2μs              | 226±2μs             | 1.00    | import_iris.Iris.time_exceptions                                                                     |
|          | 78.6±0.4μs           | 80.3±0.5μs          | 1.02    | import_iris.Iris.time_experimental                                                                   |
|          | 187±3μs              | 192±2μs             | 1.02    | import_iris.Iris.time_fileformats                                                                    |
|          | 248±1μs              | 249±2μs             | 1.00    | import_iris.Iris.time_fileformats__ff                                                                |
|          | 2.72±0.04ms          | 2.71±0.02ms         | 1.00    | import_iris.Iris.time_fileformats__ff_cross_references                                               |
|          | 79.7±0.6μs           | 81.1±0.4μs          | 1.02    | import_iris.Iris.time_fileformats__pp_lbproc_pairs                                                   |
|          | 116±2μs              | 116±0.9μs           | 1.00    | import_iris.Iris.time_fileformats_abf                                                                |
|          | 357±8μs              | 356±6μs             | 1.00    | import_iris.Iris.time_fileformats_cf                                                                 |
|          | 5.41±0.08ms          | 5.39±0.07ms         | 1.00    | import_iris.Iris.time_fileformats_dot                                                                |
|          | 76.4±0.7μs           | 76.6±1μs            | 1.00    | import_iris.Iris.time_fileformats_name                                                               |
|          | 261±2μs              | 262±0.9μs           | 1.00    | import_iris.Iris.time_fileformats_name_loaders                                                       |
|          | 120±2μs              | 121±1μs             | 1.01    | import_iris.Iris.time_fileformats_netcdf                                                             |
|          | 124±1μs              | 124±1μs             | 1.00    | import_iris.Iris.time_fileformats_nimrod                                                             |
|          | 212±3μs              | 210±2μs             | 0.99    | import_iris.Iris.time_fileformats_nimrod_load_rules                                                  |
|          | 779±9μs              | 777±5μs             | 1.00    | import_iris.Iris.time_fileformats_pp                                                                 |
|          | 182±1μs              | 183±3μs             | 1.01    | import_iris.Iris.time_fileformats_pp_load_rules                                                      |
|          | 134±2μs              | 135±1μs             | 1.01    | import_iris.Iris.time_fileformats_pp_save_rules                                                      |
|          | 517±2μs              | 516±3μs             | 1.00    | import_iris.Iris.time_fileformats_rules                                                              |
|          | 221±3μs              | 219±2μs             | 0.99    | import_iris.Iris.time_fileformats_structured_array_identification                                    |
|          | 84.0±0.9μs           | 85.2±0.5μs          | 1.01    | import_iris.Iris.time_fileformats_um                                                                 |
|          | 165±3μs              | 162±0.9μs           | 0.98    | import_iris.Iris.time_fileformats_um__fast_load                                                      |
|          | 139±1μs              | 139±0.9μs           | 1.00    | import_iris.Iris.time_fileformats_um__fast_load_structured_fields                                    |
|          | 76.8±1μs             | 78.1±0.5μs          | 1.02    | import_iris.Iris.time_fileformats_um__ff_replacement                                                 |
|          | 82.7±0.8μs           | 84.3±1μs            | 1.02    | import_iris.Iris.time_fileformats_um__optimal_array_structuring                                      |
|          | 990±6μs              | 992±4μs             | 1.00    | import_iris.Iris.time_fileformats_um_cf_map                                                          |
|          | 138±0.6μs            | 139±0.9μs           | 1.01    | import_iris.Iris.time_io                                                                             |
|          | 174±2μs              | 172±2μs             | 0.99    | import_iris.Iris.time_io_format_picker                                                               |
|          | 203±0.9μs            | 205±3μs             | 1.01    | import_iris.Iris.time_iris                                                                           |
|          | 130±2μs              | 130±2μs             | 1.00    | import_iris.Iris.time_iterate                                                                        |
|          | 8.53±0.07ms          | 8.50±0.05ms         | 1.00    | import_iris.Iris.time_palette                                                                        |
|          | 343±3μs              | 345±3μs             | 1.01    | import_iris.Iris.time_plot                                                                           |
|          | 106±2μs              | 107±1μs             | 1.01    | import_iris.Iris.time_quickplot                                                                      |
|          | 2.14±0.01ms          | 2.15±0.01ms         | 1.01    | import_iris.Iris.time_std_names                                                                      |
|          | 1.76±0.01ms          | 1.77±0.02ms         | 1.01    | import_iris.Iris.time_symbols                                                                        |
|          | 96.1±0.8ms           | 94.9±0.7ms          | 0.99    | import_iris.Iris.time_tests                                                                          |
|          | 234±2μs              | 229±3μs             | 0.98    | import_iris.Iris.time_third_party_cartopy                                                            |
|          | 4.85±0.02ms          | 4.85±0.03ms         | 1.00    | import_iris.Iris.time_third_party_cf_units                                                           |
|          | 108±1μs              | 108±2μs             | 1.00    | import_iris.Iris.time_third_party_cftime                                                             |
|          | 2.86±0.08ms          | 2.81±0.01ms         | 0.98    | import_iris.Iris.time_third_party_matplotlib                                                         |
|          | 1.06±0.01ms          | 1.05±0.01ms         | 0.99    | import_iris.Iris.time_third_party_numpy                                                              |
|          | 161±3μs              | 161±1μs             | 1.00    | import_iris.Iris.time_third_party_scipy                                                              |
|          | 102±1μs              | 102±0.8μs           | 0.99    | import_iris.Iris.time_time                                                                           |
|          | 320±3μs              | 323±1μs             | 1.01    | import_iris.Iris.time_util                                                                           |
|          | 74.1±0.9μs           | 75.6±1μs            | 1.02    | iterate.IZip.time_izip                                                                               |
|          | 8.11±0.1ms           | 8.17±0.1ms          | 1.01    | load.LoadAndRealise.time_load((1280, 960, 5), False, 'FF')                                           |
|          | 24.3±0.4ms           | 24.3±0.6ms          | 1.00    | load.LoadAndRealise.time_load((1280, 960, 5), False, 'NetCDF')                                       |
|          | 8.90±0.09ms          | 8.96±0.1ms          | 1.01    | load.LoadAndRealise.time_load((1280, 960, 5), False, 'PP')                                           |
|          | 8.17±0.1ms           | 8.08±0.06ms         | 0.99    | load.LoadAndRealise.time_load((1280, 960, 5), True, 'FF')                                            |
|          | 21.2±0.3ms           | 21.2±0.3ms          | 1.00    | load.LoadAndRealise.time_load((1280, 960, 5), True, 'NetCDF')                                        |
|          | 8.90±0.08ms          | 8.87±0.1ms          | 1.00    | load.LoadAndRealise.time_load((1280, 960, 5), True, 'PP')                                            |
|          | 1.37±0.01s           | 1.36±0.01s          | 0.99    | load.LoadAndRealise.time_load((2, 2, 1000), False, 'FF')                                             |
|          | 20.9±0.5ms           | 20.9±0.2ms          | 1.00    | load.LoadAndRealise.time_load((2, 2, 1000), False, 'NetCDF')                                         |
|          | 1.53±0.02s           | 1.51±0.02s          | 0.99    | load.LoadAndRealise.time_load((2, 2, 1000), False, 'PP')                                             |
|          | 1.34±0.01s           | 1.37±0.01s          | 1.02    | load.LoadAndRealise.time_load((2, 2, 1000), True, 'FF')                                              |
|          | 20.9±0.2ms           | 20.8±0.3ms          | 0.99    | load.LoadAndRealise.time_load((2, 2, 1000), True, 'NetCDF')                                          |
|          | 1.52±0.02s           | 1.52±0.01s          | 1.00    | load.LoadAndRealise.time_load((2, 2, 1000), True, 'PP')                                              |
|          | 3.90±0.03ms          | 3.95±0.03ms         | 1.01    | load.LoadAndRealise.time_load((50, 50, 2), False, 'FF')                                              |
|          | 19.9±0.3ms           | 19.5±0.2ms          | 0.98    | load.LoadAndRealise.time_load((50, 50, 2), False, 'NetCDF')                                          |
|          | 4.18±0.01ms          | 4.18±0.05ms         | 1.00    | load.LoadAndRealise.time_load((50, 50, 2), False, 'PP')                                              |
|          | 3.92±0.03ms          | 3.93±0.03ms         | 1.00    | load.LoadAndRealise.time_load((50, 50, 2), True, 'FF')                                               |
|          | 19.9±0.2ms           | 19.6±0.2ms          | 0.98    | load.LoadAndRealise.time_load((50, 50, 2), True, 'NetCDF')                                           |
|          | 4.17±0.02ms          | 4.21±0.06ms         | 1.01    | load.LoadAndRealise.time_load((50, 50, 2), True, 'PP')                                               |
|          | 31.1±2ms             | 32.2±2ms            | 1.03    | load.LoadAndRealise.time_realise((1280, 960, 5), False, 'FF')                                        |
|          | 19.7±0.3ms           | 19.6±0.5ms          | 0.99    | load.LoadAndRealise.time_realise((1280, 960, 5), False, 'NetCDF')                                    |
|          | 13.3±1ms             | 13.2±2ms            | 0.99    | load.LoadAndRealise.time_realise((1280, 960, 5), False, 'PP')                                        |
|          | 25.7±1ms             | 25.8±1ms            | 1.01    | load.LoadAndRealise.time_realise((1280, 960, 5), True, 'FF')                                         |
|          | 70.6±2ms             | 70.3±2ms            | 1.00    | load.LoadAndRealise.time_realise((1280, 960, 5), True, 'NetCDF')                                     |
|          | 26.0±2ms             | 25.8±0.7ms          | 0.99    | load.LoadAndRealise.time_realise((1280, 960, 5), True, 'PP')                                         |
|          | 447±2ms              | 445±4ms             | 1.00    | load.LoadAndRealise.time_realise((2, 2, 1000), False, 'FF')                                          |
|          | 2.90±0.1ms           | 2.86±0.06ms         | 0.98    | load.LoadAndRealise.time_realise((2, 2, 1000), False, 'NetCDF')                                      |
|          | 450±3ms              | 451±5ms             | 1.00    | load.LoadAndRealise.time_realise((2, 2, 1000), False, 'PP')                                          |
|          | 453±1ms              | 452±6ms             | 1.00    | load.LoadAndRealise.time_realise((2, 2, 1000), True, 'FF')                                           |
|          | 2.93±0.1ms           | 2.84±0.1ms          | 0.97    | load.LoadAndRealise.time_realise((2, 2, 1000), True, 'NetCDF')                                       |
|          | 456±3ms              | 457±4ms             | 1.00    | load.LoadAndRealise.time_realise((2, 2, 1000), True, 'PP')                                           |
|          | 1.47±0.08ms          | 1.51±0.07ms         | 1.03    | load.LoadAndRealise.time_realise((50, 50, 2), False, 'FF')                                           |
|          | 2.94±0.1ms           | 2.82±0.04ms         | 0.96    | load.LoadAndRealise.time_realise((50, 50, 2), False, 'NetCDF')                                       |
|          | 1.55±0.05ms          | 1.54±0.08ms         | 0.99    | load.LoadAndRealise.time_realise((50, 50, 2), False, 'PP')                                           |
|          | 1.57±0.06ms          | 1.56±0.06ms         | 1.00    | load.LoadAndRealise.time_realise((50, 50, 2), True, 'FF')                                            |
|          | 2.88±0.1ms           | 2.94±0.1ms          | 1.02    | load.LoadAndRealise.time_realise((50, 50, 2), True, 'NetCDF')                                        |
|          | 1.58±0.05ms          | 1.60±0.08ms         | 1.02    | load.LoadAndRealise.time_realise((50, 50, 2), True, 'PP')                                            |
|          | 355±2ms              | 354±3ms             | 1.00    | load.ManyVars.time_many_var_load                                                                     |
|          | 8.20±0.08ms          | 8.25±0.09ms         | 1.01    | load.STASHConstraint.time_stash_constraint((1280, 960, 5), 'FF')                                     |
|          | 9.03±0.2ms           | 9.06±0.1ms          | 1.00    | load.STASHConstraint.time_stash_constraint((1280, 960, 5), 'PP')                                     |
|          | 1.37±0.01s           | 1.38±0.02s          | 1.01    | load.STASHConstraint.time_stash_constraint((2, 2, 1000), 'FF')                                       |
|          | 1.54±0.01s           | 1.53±0.01s          | 0.99    | load.STASHConstraint.time_stash_constraint((2, 2, 1000), 'PP')                                       |
|          | 3.94±0.04ms          | 3.99±0.03ms         | 1.01    | load.STASHConstraint.time_stash_constraint((2, 2, 2), 'FF')                                          |
|          | 4.26±0.04ms          | 4.28±0.03ms         | 1.00    | load.STASHConstraint.time_stash_constraint((2, 2, 2), 'PP')                                          |
|          | 8.04±0.1ms           | 8.18±0.08ms         | 1.02    | load.StructuredFF.time_structured_load((1280, 960, 5), False)                                        |
|          | 4.79±0.03ms          | 4.81±0.07ms         | 1.00    | load.StructuredFF.time_structured_load((1280, 960, 5), True)                                         |
|          | 1.36±0.02s           | 1.35±0.01s          | 1.00    | load.StructuredFF.time_structured_load((2, 2, 1000), False)                                          |
|          | 379±7ms              | 376±6ms             | 0.99    | load.StructuredFF.time_structured_load((2, 2, 1000), True)                                           |
|          | 3.90±0.03ms          | 3.92±0.03ms         | 1.00    | load.StructuredFF.time_structured_load((2, 2, 2), False)                                             |
|          | 3.55±0.02ms          | 3.56±0.02ms         | 1.00    | load.StructuredFF.time_structured_load((2, 2, 2), True)                                              |
|          | 150±0.7ms            | 152±1ms             | 1.02    | load.TimeConstraint.time_time_constraint(20, 'FF')                                                   |
|          | 23.2±0.3ms           | 23.5±0.2ms          | 1.01    | load.TimeConstraint.time_time_constraint(20, 'NetCDF')                                               |
|          | 168±2ms              | 167±1ms             | 1.00    | load.TimeConstraint.time_time_constraint(20, 'PP')                                                   |
|          | 29.8±0.4ms           | 29.8±0.1ms          | 1.00    | load.TimeConstraint.time_time_constraint(3, 'FF')                                                    |
|          | 22.7±0.1ms           | 22.8±0.2ms          | 1.00    | load.TimeConstraint.time_time_constraint(3, 'NetCDF')                                                |
|          | 32.1±0.4ms           | 32.0±0.1ms          | 1.00    | load.TimeConstraint.time_time_constraint(3, 'PP')                                                    |
|          | 17.4±0.3ms           | 17.2±0.2ms          | 0.98    | load.ugrid.BasicLoading.time_load_file(1)                                                            |
|          | 41.6±0.3ms           | 41.9±0.5ms          | 1.01    | load.ugrid.BasicLoading.time_load_file(200000)                                                       |
|          | 14.2±0.3ms           | 14.0±0.4ms          | 0.99    | load.ugrid.BasicLoading.time_load_mesh(1)                                                            |
|          | 22.5±0.6ms           | 22.5±0.5ms          | 1.00    | load.ugrid.BasicLoading.time_load_mesh(200000)                                                       |
|          | 17.6±0.6ms           | 17.8±0.3ms          | 1.01    | load.ugrid.BasicLoadingTime.time_load_file(1)                                                        |
|          | 20.8±0.6ms           | 20.2±0.3ms          | 0.97    | load.ugrid.BasicLoadingTime.time_load_file(200000)                                                   |
|          | 14.4±0.2ms           | 14.1±0.3ms          | 0.98    | load.ugrid.BasicLoadingTime.time_load_mesh(1)                                                        |
|          | 17.3±0.5ms           | 17.4±0.4ms          | 1.00    | load.ugrid.BasicLoadingTime.time_load_mesh(200000)                                                   |
|          | 18.9±0.4ms           | 18.8±0.4ms          | 0.99    | load.ugrid.Callback.time_load_file_callback(1)                                                       |
|          | 51.4±0.4ms           | 50.7±1ms            | 0.99    | load.ugrid.Callback.time_load_file_callback(200000)                                                  |
|          | 18.5±0.3ms           | 18.2±0.2ms          | 0.98    | load.ugrid.CallbackTime.time_load_file_callback(1)                                                   |
|          | 22.6±0.4ms           | 22.0±0.4ms          | 0.97    | load.ugrid.CallbackTime.time_load_file_callback(200000)                                              |
|          | 2.88±0.1ms           | 2.80±0.2ms          | 0.97    | load.ugrid.DataRealisation.time_realise_data(10000)                                                  |
|          | 5.48±0.7ms           | 4.11±1ms            | ~0.75   | load.ugrid.DataRealisation.time_realise_data(200000)                                                 |
|          | 38.9±1ms             | 38.0±0.8ms          | 0.98    | load.ugrid.DataRealisationTime.time_realise_data(10000)                                              |
|          | 802±4ms              | 803±9ms             | 1.00    | load.ugrid.DataRealisationTime.time_realise_data(200000)                                             |
|          | 186±3ms              | 186±1ms             | 1.00    | merge_concat.Concatenate.time_concatenate                                                            |
|          | 226.9                | 226.9               | 1.00    | merge_concat.Concatenate.track_mem_merge                                                             |
|          | 47.6±0.6ms           | 47.2±0.8ms          | 0.99    | merge_concat.Merge.time_merge                                                                        |
|          | 10.8                 | 10.8                | 1.00    | merge_concat.Merge.track_mem_merge                                                                   |
|          | 6.60±0.03ms          | 6.57±0.05ms         | 0.99    | plot.AuxSort.time_aux_sort                                                                           |
|          | 77.6±4ms             | 78.2±3ms            | 1.01    | regridding.CurvilinearRegridding.time_regrid_pic                                                     |
|          | 144.8                | 144.8               | 1.00    | regridding.CurvilinearRegridding.track_mem_regrid_pic                                                |
|          | 98.0±0.7ms           | 99.0±0.8ms          | 1.01    | regridding.HorizontalChunkedRegridding.time_regrid_area_w                                            |
|          | 49.2±2ms             | 49.6±2ms            | 1.01    | regridding.HorizontalChunkedRegridding.time_regrid_area_w_new_grid                                   |
|          | 111.5                | 111.5               | 1.00    | regridding.HorizontalChunkedRegridding.track_mem_regrid_area_w                                       |
|          | 151.6                | 151.6               | 1.00    | regridding.HorizontalChunkedRegridding.track_mem_regrid_area_w_new_grid                              |
|          | 4.07±0.05ms          | 4.04±0.04ms         | 0.99    | save.NetcdfSave.time_netcdf_save_cube(50, False)                                                     |
|          | 72.1±0.9ms           | 72.6±1ms            | 1.01    | save.NetcdfSave.time_netcdf_save_cube(50, True)                                                      |
|          | 52.0±0.9ms           | 52.8±0.6ms          | 1.01    | save.NetcdfSave.time_netcdf_save_cube(600, False)                                                    |
|          | 568±5ms              | 569±4ms             | 1.00    | save.NetcdfSave.time_netcdf_save_cube(600, True)                                                     |
|          | 90.3±0.9ns           | 91.4±7ns            | 1.01    | save.NetcdfSave.time_netcdf_save_mesh(50, False)                                                     |
|          | 55.5±0.8ms           | 55.6±1ms            | 1.00    | save.NetcdfSave.time_netcdf_save_mesh(50, True)                                                      |
|          | 89.3±0.5ns           | 90.6±6ns            | 1.01    | save.NetcdfSave.time_netcdf_save_mesh(600, False)                                                    |
|          | 502±2ms              | 503±5ms             | 1.00    | save.NetcdfSave.time_netcdf_save_mesh(600, True)                                                     |
|          | 42.7±1ms             | 43.5±1ms            | 1.02    | stats.PearsonR.time_lazy                                                                             |
|          | 19.0±0.3ms           | 19.0±0.5ms          | 1.00    | stats.PearsonR.time_real                                                                             |
|          | 19.4                 | 19.4                | 1.00    | stats.PearsonR.track_lazy                                                                            |
|          | 17.8                 | 17.8                | 1.00    | stats.PearsonR.track_real                                                                            |
|          | 22.8±1ms             | 22.6±1ms            | 0.99    | trajectory.TrajectoryInterpolation.time_trajectory_linear                                            |
|          | 58.5±0.5ms           | 58.6±0.6ms          | 1.00    | trajectory.TrajectoryInterpolation.time_trajectory_nearest                                           |
|          | 32.1                 | 32.1                | 1.00    | trajectory.TrajectoryInterpolation.track_trajectory_linear                                           |
|          | 21.5                 | 21.5                | 1.00    | trajectory.TrajectoryInterpolation.track_trajectory_nearest                                          |

Generated by GHA run 9157401305

Copy link
Member

@pp-mo pp-mo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some questions did occur to me about whether we are measuring the right things,
but I think this is a good start, so let's merge + see if the results prove stable enough to use as regular checks for performance changes.

@pp-mo
Copy link
Member

pp-mo commented May 21, 2024

Note: I added a test of an operation like this to the evaluation branch for, and results are interesting
From that it would seem that we are measuring somehow the effects of memory allocations, not simply the original requests,
because

  1. results show variation with repeat runs, and
  2. repeated operations which regularly increase allocated space don't see a regular rise in the measured peak memory.

@pp-mo pp-mo merged commit 22c98e8 into SciTools:main May 21, 2024
21 checks passed
stephenworsley added a commit to stephenworsley/iris that referenced this pull request Jun 10, 2024
* main: (759 commits)
  Bump scitools/workflows from 2024.05.1 to 2024.06.0 (SciTools#5986)
  [pre-commit.ci] pre-commit autoupdate (SciTools#5980)
  Updated environment lockfiles (SciTools#5983)
  Bump scitools/workflows from 2024.05.0 to 2024.05.1 (SciTools#5984)
  Make `slices_over` tests go faster (SciTools#5973)
  Updated environment lockfiles (SciTools#5979)
  Update lock files with associated fixes (SciTools#5953)
  List 25 slowest tests (SciTools#5969)
  used a note to highlight some text (SciTools#5971)
  Lazy `iris.cube.Cube.rolling_window` (SciTools#5795)
  Add memory benchmarks (SciTools#5960)
  Whatsnew for several benchmark developments. (SciTools#5961)
  Remove "on-demand" from some benchmarks (SciTools#5959)
  Add bm_runner 'trialrun' subcommand. (SciTools#5957)
  Automatically install iris-test-data for benchmark data generation (SciTools#5958)
  Added benchmarks for collapse and aggregate (SciTools#5954)
  Use tracemalloc for memory measurements. (SciTools#5948)
  Provide a Nox `benchmarks` session as the recommended entry point (SciTools#5951)
  [pre-commit.ci] pre-commit autoupdate (SciTools#5952)
  Remove unit benchmarks (SciTools#5949)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
benchmark_this Request that this pull request be benchmarked to check if it introduces performance shifts
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants