Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: 2d augmentation benchmarks #2853

Merged
merged 26 commits into from
Mar 27, 2024
Merged

feat: 2d augmentation benchmarks #2853

merged 26 commits into from
Mar 27, 2024

Conversation

johnnv1
Copy link
Member

@johnnv1 johnnv1 commented Mar 23, 2024

  • added the 2d augmentation benchmarks
  • setup optimizer backend as a pytest option, so we can select which backend to use e.g inductor, onnxrt, eager within torch compile, or jit itself using torch.jit.script
  • added a docker for benchmark
  • fixes on augmentation docs
  • fixes to plackian_jitter related to RandomPlanckianJitter incorrect values on gpu other than 0 #2791
  • added benchmark on contributing
  • added CPU and GPU info into pytest header

To run all benchmarks within default configs, you can use

$ make benchmark-docker

which will build a docker image based on pytorch cuda 12.1, and run the benchmark suite

@johnnv1 johnnv1 added the code heatlh 💊 Improvement the package code health label Mar 23, 2024
@johnnv1
Copy link
Member Author

johnnv1 commented Mar 23, 2024

Results on my machine:

augmentation shape are in sequence B, C, H, W, per example for test_aug_2d_affine[5-1-128-128-cuda-float32-inductor] we have B=5, C=1, H=W=128

$ make benchmark-docker BENCHMARK_BACKENDS='inductor,eager' BENCHMARK_OPTS='--device=cuda -vv'
docker run -e "TERM=xterm-256color" \
		   -e "BACKENDS=inductor,eager" \
		   -e "OPTS=--device=cuda -vv" \
		   --gpus all\
		   -it kornia-benchmark:latest
# We want to always run within warmup because torch optimizer backend
pytest benchmarks/ --benchmark-warmup=on --benchmark-warmup-iterations=100 --benchmark-calibration-precision=10 --benchmark-group-by=func --optimizer=inductor,eager --device=cuda -vv 
Setting up torch compile...
====================================================================================================== test session starts =======================================================================================================
platform linux -- Python 3.10.13, pytest-8.1.1, pluggy-1.4.0 -- /opt/conda/bin/python
cachedir: .pytest_cache
hypothesis profile 'default' -> database=DirectoryBasedExampleDatabase(PosixPath('/kornia-benchmarks/.hypothesis/examples'))
benchmark: 4.0.0 (defaults: timer=time.perf_counter disable_gc=False min_rounds=5 min_time=0.000005 max_time=1.0 calibration_precision=10 warmup=True warmup_iterations=100)

cpu info:
    - Model name: AMD Ryzen 5 5600X 6-Core Processor
    - Architecture: x86_64
    - CPU(s): 12
    - Thread(s) per core: 2
    - CPU max MHz: 4650.2920
    - CPU min MHz: 2200.0000
gpu info: {'GPU 0': 'NVIDIA GeForce RTX 3060 Ti'}
main deps:
    - kornia-0.7.3-dev
    - torch-2.2.1
        - commit: 6c8c5ad5eaf47a62fafbb4a2747198cbffbf1ff0
        - cuda: 12.1
        - nvidia-driver: 525.147.05
x deps:
    - accelerate-0.28.0
dev deps:
    - kornia_rs-0.1.2
    - onnx-1.15.0
gcc info: (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0
available optimizers: {'', 'jit', 'openxla', 'onnxrt', 'inductor', 'cudagraphs', None, 'openxla_eval', 'tvm'}

rootdir: /kornia-benchmarks
plugins: hypothesis-6.98.10, timeout-2.3.1, benchmark-4.0.0
test_aug_2d_affine
-------------------------------------------------------------------------------------- benchmark 'test_aug_2d_affine': 8 tests ---------------------------------------------------------------------------------------
Name (time in ms)                                            Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_affine[1-1-128-128-cuda-float32-inductor]     1.0362 (1.0)      1.0552 (1.0)      1.0471 (1.0)      0.0084 (1.0)      1.0506 (1.0)      0.0146 (1.34)          1;0  954.9922 (1.0)           5           1
test_aug_2d_affine[1-3-128-128-cuda-float32-inductor]     1.0939 (1.06)     1.1263 (1.07)     1.1076 (1.06)     0.0121 (1.44)     1.1086 (1.06)     0.0142 (1.30)          2;0  902.8180 (0.95)          5           1
test_aug_2d_affine[5-1-128-128-cuda-float32-inductor]     1.1084 (1.07)     1.1817 (1.12)     1.1307 (1.08)     0.0300 (3.56)     1.1177 (1.06)     0.0331 (3.04)          1;0  884.4396 (0.93)          5           1
test_aug_2d_affine[5-3-128-128-cuda-float32-inductor]     1.1422 (1.10)     1.1656 (1.10)     1.1512 (1.10)     0.0118 (1.41)     1.1429 (1.09)     0.0209 (1.92)          1;0  868.6218 (0.91)          5           1
test_aug_2d_affine[5-1-128-128-cuda-float32-eager]        1.6639 (1.61)     2.2153 (2.10)     1.6827 (1.61)     0.0253 (3.00)     1.6800 (1.60)     0.0113 (1.04)         9;11  594.2927 (0.62)        600           1
test_aug_2d_affine[1-1-128-128-cuda-float32-eager]        1.6705 (1.61)     2.9471 (2.79)     1.7555 (1.68)     0.0881 (10.45)    1.7759 (1.69)     0.0981 (9.01)         19;9  569.6273 (0.60)        597           1
test_aug_2d_affine[1-3-128-128-cuda-float32-eager]        1.6724 (1.61)     2.2103 (2.09)     1.6922 (1.62)     0.0267 (3.17)     1.6888 (1.61)     0.0109 (1.0)         13;20  590.9621 (0.62)        597           1
test_aug_2d_affine[5-3-128-128-cuda-float32-eager]        1.7584 (1.70)     2.0057 (1.90)     1.7792 (1.70)     0.0188 (2.23)     1.7765 (1.69)     0.0121 (1.11)        34;22  562.0379 (0.59)        594           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_auto_contrast
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_auto_contrast': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                     Min                   Max                Mean             StdDev              Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_auto_contrast[1-3-128-128-cuda-float32-inductor]     352.8110 (1.0)        399.9600 (1.0)      362.6452 (1.0)       4.9935 (1.0)      361.2710 (1.0)      4.6495 (1.09)      552;210        2.7575 (1.0)        2748           1
test_aug_2d_auto_contrast[1-1-128-128-cuda-float32-inductor]     355.1600 (1.01)       417.1000 (1.04)     365.2143 (1.01)      5.1676 (1.03)     364.0255 (1.01)     4.2610 (1.0)       565;223        2.7381 (0.99)       2698           1
test_aug_2d_auto_contrast[5-3-128-128-cuda-float32-inductor]     356.4710 (1.01)       446.6700 (1.12)     367.0811 (1.01)      6.3651 (1.27)     365.7455 (1.01)     6.2590 (1.47)      573;136        2.7242 (0.99)       2694           1
test_aug_2d_auto_contrast[5-1-128-128-cuda-float32-inductor]     359.1300 (1.02)       409.6400 (1.02)     368.3896 (1.02)      5.0699 (1.02)     367.1600 (1.02)     4.4725 (1.05)      499;192        2.7145 (0.98)       2655           1
test_aug_2d_auto_contrast[5-1-128-128-cuda-float32-eager]        439.1610 (1.24)       545.4610 (1.36)     449.8343 (1.24)      7.0840 (1.42)     448.2705 (1.24)     5.7000 (1.34)      309;135        2.2230 (0.81)       2210           1
test_aug_2d_auto_contrast[1-1-128-128-cuda-float32-eager]        439.5910 (1.25)     1,310.0810 (3.28)     452.3025 (1.25)     39.0705 (7.82)     448.8055 (1.24)     6.0900 (1.43)       11;110        2.2109 (0.80)       2222           1
test_aug_2d_auto_contrast[1-3-128-128-cuda-float32-eager]        442.4500 (1.25)       505.6910 (1.26)     454.0052 (1.25)      6.2744 (1.26)     452.7355 (1.25)     6.9245 (1.63)       514;69        2.2026 (0.80)       2224           1
test_aug_2d_auto_contrast[5-3-128-128-cuda-float32-eager]        443.8800 (1.26)       600.3010 (1.50)     471.5924 (1.30)     13.4934 (2.70)     473.1105 (1.31)     7.9390 (1.86)      597;498        2.1205 (0.77)       2246           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_box_blur
--------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_box_blur': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_box_blur[1-1-128-128-cuda-float32-inductor]     470.1600 (1.0)      665.1310 (1.21)     486.5504 (1.0)      13.7950 (2.17)     482.7610 (1.0)       9.3117 (1.46)      248;220        2.0553 (1.0)        2081           1
test_aug_2d_box_blur[1-3-128-128-cuda-float32-inductor]     475.9400 (1.01)     549.5610 (1.0)      488.5753 (1.00)      7.1695 (1.13)     487.4700 (1.01)      8.9550 (1.40)       540;46        2.0468 (1.00)       2036           1
test_aug_2d_box_blur[5-3-128-128-cuda-float32-inductor]     478.4910 (1.02)     569.3600 (1.04)     496.2117 (1.02)      9.7617 (1.53)     496.4910 (1.03)     15.3360 (2.40)        677;8        2.0153 (0.98)       2011           1
test_aug_2d_box_blur[5-1-128-128-cuda-float32-inductor]     480.4910 (1.02)     550.8000 (1.00)     491.4792 (1.01)      6.3695 (1.0)      490.0610 (1.02)      6.4900 (1.02)      411;100        2.0347 (0.99)       2057           1
test_aug_2d_box_blur[5-3-128-128-cuda-float32-eager]        519.6210 (1.11)     757.5110 (1.38)     531.2681 (1.09)      9.5616 (1.50)     529.3910 (1.10)      6.4525 (1.01)       152;90        1.8823 (0.92)       1901           1
test_aug_2d_box_blur[1-3-128-128-cuda-float32-eager]        522.4210 (1.11)     925.9510 (1.68)     533.5152 (1.10)     11.0332 (1.73)     532.1155 (1.10)      6.3905 (1.0)         97;77        1.8744 (0.91)       1892           1
test_aug_2d_box_blur[1-1-128-128-cuda-float32-eager]        547.7210 (1.16)     689.7210 (1.26)     560.3062 (1.15)      7.8394 (1.23)     558.9760 (1.16)      7.1890 (1.12)       289;70        1.7847 (0.87)       1804           1
test_aug_2d_box_blur[5-1-128-128-cuda-float32-eager]        551.1210 (1.17)     931.7510 (1.70)     563.4893 (1.16)     17.0661 (2.68)     560.6410 (1.16)      6.8128 (1.07)        35;75        1.7747 (0.86)       1793           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_brightness
--------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_brightness': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                  Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_brightness[1-3-128-128-cuda-float32-inductor]     386.3500 (1.0)      523.0100 (1.21)     399.1027 (1.00)      6.6366 (1.47)     397.4900 (1.00)      7.7385 (1.39)       549;73        2.5056 (1.00)       2471           1
test_aug_2d_brightness[1-1-128-128-cuda-float32-inductor]     388.9910 (1.01)     502.3210 (1.16)     398.7394 (1.0)       6.8615 (1.52)     397.0300 (1.0)       5.5690 (1.0)       357;138        2.5079 (1.0)        2474           1
test_aug_2d_brightness[5-1-128-128-cuda-float32-inductor]     394.1910 (1.02)     560.9610 (1.30)     418.8390 (1.05)     16.3468 (3.62)     414.9505 (1.05)     26.8545 (4.82)       1006;4        2.3876 (0.95)       2500           1
test_aug_2d_brightness[5-3-128-128-cuda-float32-inductor]     416.5000 (1.08)     432.3810 (1.0)      422.3284 (1.06)      4.5100 (1.0)      422.0955 (1.06)      6.6600 (1.20)          5;0        2.3678 (0.94)         14           1
test_aug_2d_brightness[1-3-128-128-cuda-float32-eager]        449.2600 (1.16)     888.9410 (2.06)     462.3992 (1.16)     14.1878 (3.15)     460.4510 (1.16)      6.5885 (1.18)       92;120        2.1626 (0.86)       2187           1
test_aug_2d_brightness[1-1-128-128-cuda-float32-eager]        449.2910 (1.16)     823.3210 (1.90)     459.9572 (1.15)     10.1036 (2.24)     458.6105 (1.16)      5.9190 (1.06)        91;76        2.1741 (0.87)       2194           1
test_aug_2d_brightness[5-3-128-128-cuda-float32-eager]        453.8910 (1.17)     695.2910 (1.61)     465.0721 (1.17)      9.2622 (2.05)     463.6655 (1.17)      6.3895 (1.15)       142;79        2.1502 (0.86)       2164           1
test_aug_2d_brightness[5-1-128-128-cuda-float32-eager]        458.6100 (1.19)     709.1310 (1.64)     468.4767 (1.17)      8.5279 (1.89)     466.8355 (1.18)      5.8210 (1.05)      167;102        2.1346 (0.85)       2154           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_centercrop
-------------------------------------------------------------------------------------- benchmark 'test_aug_2d_centercrop': 8 tests ---------------------------------------------------------------------------------------
Name (time in ms)                                                Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_centercrop[1-3-128-128-cuda-float32-inductor]     1.3602 (1.0)      2.3254 (1.62)     1.3976 (1.00)     0.0859 (11.80)    1.3794 (1.0)      0.0130 (1.65)        35;62  715.5029 (1.00)        731           1
test_aug_2d_centercrop[1-1-128-128-cuda-float32-inductor]     1.3739 (1.01)     1.4392 (1.0)      1.3947 (1.0)      0.0157 (2.16)     1.3933 (1.01)     0.0177 (2.24)          2;1  716.9829 (1.0)          14           1
test_aug_2d_centercrop[1-3-128-128-cuda-float32-eager]        1.3889 (1.02)     3.1201 (2.17)     1.4280 (1.02)     0.1490 (20.45)    1.4070 (1.02)     0.0111 (1.41)        18;46  700.3014 (0.98)        718           1
test_aug_2d_centercrop[1-1-128-128-cuda-float32-eager]        1.3961 (1.03)     2.1444 (1.49)     1.4250 (1.02)     0.0696 (9.56)     1.4124 (1.02)     0.0115 (1.46)        24;49  701.7680 (0.98)        714           1
test_aug_2d_centercrop[5-3-128-128-cuda-float32-inductor]     1.5950 (1.17)     1.9788 (1.37)     1.6173 (1.16)     0.0235 (3.23)     1.6146 (1.17)     0.0121 (1.53)        11;13  618.3230 (0.86)        626           1
test_aug_2d_centercrop[5-1-128-128-cuda-float32-eager]        1.6117 (1.18)     2.1328 (1.48)     1.6364 (1.17)     0.0254 (3.49)     1.6331 (1.18)     0.0115 (1.46)        19;23  611.0877 (0.85)        619           1
test_aug_2d_centercrop[5-3-128-128-cuda-float32-eager]        1.6122 (1.19)     2.3063 (1.60)     1.6327 (1.17)     0.0310 (4.25)     1.6292 (1.18)     0.0110 (1.40)        12;24  612.4990 (0.85)        622           1
test_aug_2d_centercrop[5-1-128-128-cuda-float32-inductor]     1.6136 (1.19)     1.6419 (1.14)     1.6250 (1.17)     0.0073 (1.0)      1.6234 (1.18)     0.0079 (1.0)           4;1  615.3990 (0.86)         14           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_channel_shuffle
----------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_channel_shuffle': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                       Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_channel_shuffle[1-1-128-128-cuda-float32-eager]        437.7110 (1.0)        725.3710 (1.18)     448.9741 (1.0)       8.6717 (1.15)     447.6100 (1.0)       6.7023 (1.01)       194;64        2.2273 (1.0)        2249           1
test_aug_2d_channel_shuffle[1-3-128-128-cuda-float32-inductor]     442.9910 (1.01)       688.0500 (1.12)     472.3563 (1.05)     17.8868 (2.38)     465.8610 (1.04)     30.4350 (4.60)        592;5        2.1170 (0.95)       2203           1
test_aug_2d_channel_shuffle[1-1-128-128-cuda-float32-inductor]     444.6910 (1.02)       654.5510 (1.07)     457.6646 (1.02)      8.9167 (1.18)     456.6500 (1.02)      7.6592 (1.16)       270;67        2.1850 (0.98)       2205           1
test_aug_2d_channel_shuffle[1-3-128-128-cuda-float32-eager]        468.9310 (1.07)       613.6410 (1.0)      482.8886 (1.08)     10.7609 (1.43)     479.3500 (1.07)     15.2525 (2.30)       414;13        2.0709 (0.93)       2233           1
test_aug_2d_channel_shuffle[5-3-128-128-cuda-float32-eager]        553.8210 (1.27)       932.9510 (1.52)     571.0053 (1.27)     17.5246 (2.33)     565.0210 (1.26)     10.7647 (1.63)      295;295        1.7513 (0.79)       1789           1
test_aug_2d_channel_shuffle[5-3-128-128-cuda-float32-inductor]     553.9410 (1.27)     1,557.7920 (2.54)     569.5426 (1.27)     33.0471 (4.39)     566.8610 (1.27)      7.9425 (1.20)        13;51        1.7558 (0.79)       1777           1
test_aug_2d_channel_shuffle[5-1-128-128-cuda-float32-eager]        556.1010 (1.27)       974.2420 (1.59)     567.2180 (1.26)     18.0950 (2.40)     564.8810 (1.26)      6.6227 (1.0)         31;69        1.7630 (0.79)       1777           1
test_aug_2d_channel_shuffle[5-1-128-128-cuda-float32-inductor]     572.4210 (1.31)       752.0420 (1.23)     586.2156 (1.31)      7.5247 (1.0)      584.9055 (1.31)      7.0955 (1.07)       294;62        1.7059 (0.77)       1720           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_clahe
------------------------------------------------------------------------------------------------ benchmark 'test_aug_2d_clahe': 8 tests -----------------------------------------------------------------------------------------------
Name (time in ms)                                            Min                   Max                  Mean                StdDev             Median                   IQR            Outliers       OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_clahe[1-1-128-128-cuda-float32-eager]         2.9843 (1.0)          3.0263 (1.0)          3.0065 (1.0)          0.0165 (1.0)       3.0082 (1.0)          0.0255 (1.0)           2;0  332.6076 (1.0)           6           1
test_aug_2d_clahe[1-1-128-128-cuda-float32-inductor]      3.3733 (1.13)     1,907.3932 (630.28)     384.2792 (127.81)     851.4466 (>1000.0)   3.4393 (1.14)       476.3029 (>1000.0)       1;1    2.6023 (0.01)          5           1
test_aug_2d_clahe[1-3-128-128-cuda-float32-eager]         5.8785 (1.97)         5.9296 (1.96)         5.9001 (1.96)         0.0198 (1.20)      5.8922 (1.96)         0.0267 (1.05)          2;0  169.4895 (0.51)          5           1
test_aug_2d_clahe[1-3-128-128-cuda-float32-inductor]      7.3677 (2.47)     2,016.8129 (666.44)     645.1075 (214.57)     921.1404 (>1000.0)   7.7232 (2.57)     1,386.4634 (>1000.0)       1;0    1.5501 (0.00)          5           1
test_aug_2d_clahe[5-1-128-128-cuda-float32-eager]         8.7895 (2.95)         9.2868 (3.07)         8.9167 (2.97)         0.2104 (12.75)     8.8158 (2.93)         0.1891 (7.40)          1;1  112.1496 (0.34)          5           1
test_aug_2d_clahe[5-1-128-128-cuda-float32-inductor]     11.6408 (3.90)     2,076.2323 (686.07)     789.9067 (262.73)   1,068.8487 (>1000.0)  11.9640 (3.98)     1,885.9220 (>1000.0)       1;0    1.2660 (0.00)          5           1
test_aug_2d_clahe[5-3-128-128-cuda-float32-eager]        23.1070 (7.74)       154.5860 (51.08)       49.6179 (16.50)       58.6798 (>1000.0)  23.3320 (7.76)        33.4075 (>1000.0)       1;1   20.1540 (0.06)          5           1
test_aug_2d_clahe[5-3-128-128-cuda-float32-inductor]     31.7154 (10.63)    4,999.1252 (>1000.0)  1,466.4112 (487.74)   2,193.4602 (>1000.0)  32.7907 (10.90)    2,895.4862 (>1000.0)       1;0    0.6819 (0.00)          5           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_contrast
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_contrast': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                Min                   Max                Mean             StdDev              Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_contrast[1-3-128-128-cuda-float32-eager]        512.8410 (1.0)        916.8710 (1.32)     527.1680 (1.0)      11.5040 (1.28)     525.7500 (1.00)     7.1875 (1.05)       104;77        1.8969 (1.0)        1915           1
test_aug_2d_contrast[1-1-128-128-cuda-float32-eager]        513.1510 (1.00)       881.4220 (1.26)     528.7108 (1.00)     13.6235 (1.52)     526.9100 (1.00)     8.3543 (1.22)        81;64        1.8914 (1.00)       1911           1
test_aug_2d_contrast[5-3-128-128-cuda-float32-eager]        515.8710 (1.01)       942.5720 (1.35)     529.5411 (1.00)     27.9574 (3.11)     525.6700 (1.0)      6.9258 (1.01)        25;79        1.8884 (1.00)       1905           1
test_aug_2d_contrast[5-1-128-128-cuda-float32-eager]        517.8200 (1.01)       896.3820 (1.29)     530.8324 (1.01)     15.7877 (1.76)     528.7210 (1.01)     6.8398 (1.0)         48;72        1.8838 (0.99)       1893           1
test_aug_2d_contrast[1-3-128-128-cuda-float32-inductor]     522.2110 (1.02)       854.9110 (1.23)     541.4350 (1.03)     12.0789 (1.34)     539.9905 (1.03)     7.0405 (1.03)       124;92        1.8469 (0.97)       1852           1
test_aug_2d_contrast[5-3-128-128-cuda-float32-inductor]     522.7400 (1.02)       939.5810 (1.35)     538.1123 (1.02)     16.5367 (1.84)     536.7110 (1.02)     9.4450 (1.38)        55;48        1.8583 (0.98)       1875           1
test_aug_2d_contrast[1-1-128-128-cuda-float32-inductor]     525.5110 (1.02)       696.8110 (1.0)      539.9615 (1.02)      8.9889 (1.0)      538.8155 (1.03)     8.0990 (1.18)       234;58        1.8520 (0.98)       1870           1
test_aug_2d_contrast[5-1-128-128-cuda-float32-inductor]     527.2410 (1.03)     1,039.1720 (1.49)     543.9219 (1.03)     27.8476 (3.10)     540.2510 (1.03)     8.0200 (1.17)        27;71        1.8385 (0.97)       1841           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_crop
-------------------------------------------------------------------------------------- benchmark 'test_aug_2d_crop': 8 tests ---------------------------------------------------------------------------------------
Name (time in ms)                                          Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_crop[1-3-128-128-cuda-float32-inductor]     1.5232 (1.0)      1.9031 (1.0)      1.5517 (1.00)     0.0253 (1.02)     1.5462 (1.00)     0.0145 (1.39)        60;58  644.4629 (1.00)        652           1
test_aug_2d_crop[1-1-128-128-cuda-float32-inductor]     1.5235 (1.00)     1.9754 (1.04)     1.5487 (1.0)      0.0247 (1.0)      1.5459 (1.0)      0.0125 (1.20)        15;19  645.7186 (1.0)         651           1
test_aug_2d_crop[5-3-128-128-cuda-float32-inductor]     1.7197 (1.13)     2.2555 (1.19)     1.7834 (1.15)     0.0337 (1.37)     1.7853 (1.15)     0.0213 (2.04)       130;71  560.7189 (0.87)        576           1
test_aug_2d_crop[5-1-128-128-cuda-float32-inductor]     1.7238 (1.13)     2.6545 (1.39)     1.8044 (1.17)     0.0742 (3.01)     1.7927 (1.16)     0.0237 (2.28)       60;114  554.1877 (0.86)        575           1
test_aug_2d_crop[1-1-128-128-cuda-float32-eager]        1.8427 (1.21)     2.3053 (1.21)     1.8650 (1.20)     0.0289 (1.17)     1.8610 (1.20)     0.0104 (1.0)         16;27  536.1948 (0.83)        542           1
test_aug_2d_crop[1-3-128-128-cuda-float32-eager]        1.8464 (1.21)     2.3588 (1.24)     1.8771 (1.21)     0.0379 (1.54)     1.8688 (1.21)     0.0174 (1.68)        32;50  532.7451 (0.83)        542           1
test_aug_2d_crop[5-3-128-128-cuda-float32-eager]        2.0158 (1.32)     3.1973 (1.68)     2.0822 (1.34)     0.0724 (2.93)     2.0778 (1.34)     0.0222 (2.14)        14;63  480.2582 (0.74)        495           1
test_aug_2d_crop[5-1-128-128-cuda-float32-eager]        2.0191 (1.33)     2.5363 (1.33)     2.0943 (1.35)     0.0487 (1.97)     2.0858 (1.35)     0.0236 (2.27)      128;111  477.4799 (0.74)        493           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_denormalize
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_denormalize': 8 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                   Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_denormalize[1-1-128-128-cuda-float32-inductor]     357.5410 (1.0)      716.9400 (1.58)     375.5946 (1.01)     23.7797 (3.47)     372.3200 (1.01)      6.1445 (1.19)       42;159        2.6624 (0.99)       2660           1
test_aug_2d_denormalize[5-1-128-128-cuda-float32-inductor]     357.5900 (1.00)     668.9210 (1.48)     372.1281 (1.00)      9.3364 (1.36)     369.4100 (1.0)      10.8910 (2.11)       316;31        2.6872 (1.00)       2630           1
test_aug_2d_denormalize[1-3-128-128-cuda-float32-inductor]     360.1110 (1.01)     453.1210 (1.0)      371.6722 (1.0)       6.8511 (1.0)      370.0410 (1.00)      8.1845 (1.58)       547;67        2.6905 (1.0)        2652           1
test_aug_2d_denormalize[5-3-128-128-cuda-float32-inductor]     367.9400 (1.03)     577.1810 (1.27)     378.2698 (1.02)      7.2158 (1.05)     376.9910 (1.02)      5.6540 (1.09)      308;144        2.6436 (0.98)       2653           1
test_aug_2d_denormalize[5-1-128-128-cuda-float32-eager]        420.3400 (1.18)     820.2310 (1.81)     430.1837 (1.16)     10.9101 (1.59)     428.6210 (1.16)      5.5490 (1.07)       98;114        2.3246 (0.86)       2324           1
test_aug_2d_denormalize[5-3-128-128-cuda-float32-eager]        422.1300 (1.18)     728.3810 (1.61)     432.2413 (1.16)     10.9135 (1.59)     430.2100 (1.16)      6.2565 (1.21)      120;112        2.3135 (0.86)       2287           1
test_aug_2d_denormalize[1-1-128-128-cuda-float32-eager]        422.3110 (1.18)     636.1310 (1.40)     432.4913 (1.16)      9.1537 (1.34)     430.8310 (1.17)      5.1650 (1.0)       156;152        2.3122 (0.86)       2325           1
test_aug_2d_denormalize[1-3-128-128-cuda-float32-eager]        429.2600 (1.20)     771.1010 (1.70)     440.6123 (1.19)      8.7597 (1.28)     439.2510 (1.19)      5.6605 (1.10)      173;108        2.2696 (0.84)       2333           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_elastic_transform
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_elastic_transform': 8 tests ------------------------------------------------------------------------------------------------
Name (time in us)                                                           Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_elastic_transform[1-3-128-128-cuda-float32-inductor]       586.1700 (1.0)        742.6510 (1.08)       600.4720 (1.0)      10.4619 (1.0)        599.0010 (1.0)       7.4525 (1.10)       199;76  1,665.3565 (1.0)        1657           1
test_aug_2d_elastic_transform[1-1-128-128-cuda-float32-inductor]       609.1710 (1.04)       689.1410 (1.0)        619.3051 (1.03)     17.9676 (1.72)       613.4705 (1.02)      6.7855 (1.0)           2;2  1,614.7130 (0.97)         20           1
test_aug_2d_elastic_transform[1-1-128-128-cuda-float32-eager]        1,229.9710 (2.10)     1,544.4020 (2.24)     1,245.4349 (2.07)     15.9385 (1.52)     1,242.4410 (2.07)      9.8317 (1.45)        38;35    802.9324 (0.48)        777           1
test_aug_2d_elastic_transform[1-3-128-128-cuda-float32-eager]        1,235.9010 (2.11)     1,624.8020 (2.36)     1,250.3450 (2.08)     17.4454 (1.67)     1,247.7820 (2.08)      8.6540 (1.28)        20;24    799.7792 (0.48)        808           1
test_aug_2d_elastic_transform[5-1-128-128-cuda-float32-inductor]     6,159.4970 (10.51)    6,221.0370 (9.03)     6,204.3944 (10.33)    15.1595 (1.45)     6,208.3820 (10.36)    10.0005 (1.47)          5;3    161.1761 (0.10)         20           1
test_aug_2d_elastic_transform[5-3-128-128-cuda-float32-inductor]     6,211.9480 (10.60)    6,675.2780 (9.69)     6,243.2546 (10.40)    50.0322 (4.78)     6,224.3265 (10.39)    46.5700 (6.86)          4;3    160.1729 (0.10)        162           1
test_aug_2d_elastic_transform[5-3-128-128-cuda-float32-eager]        6,465.7670 (11.03)    7,147.9970 (10.37)    6,583.8177 (10.96)    74.9023 (7.16)     6,560.1870 (10.95)    40.9618 (6.04)          8;9    151.8876 (0.09)        155           1
test_aug_2d_elastic_transform[5-1-128-128-cuda-float32-eager]        6,497.0680 (11.08)    7,168.0080 (10.40)    6,573.3861 (10.95)    75.9877 (7.26)     6,554.4060 (10.94)    49.2692 (7.26)          5;4    152.1286 (0.09)        155           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_equalize
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_equalize': 8 tests --------------------------------------------------------------------------------------------------
Name (time in us)                                                  Min                   Max                  Mean              StdDev                Median                 IQR            Outliers         OPS            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_equalize[1-1-128-128-cuda-float32-inductor]       738.1610 (1.0)      1,006.0110 (1.0)        754.6666 (1.0)       13.4676 (1.0)        753.0610 (1.0)        8.8300 (1.0)         66;38  1,325.0885 (1.0)        1342           1
test_aug_2d_equalize[1-1-128-128-cuda-float32-eager]          772.4610 (1.05)     1,570.6120 (1.56)       790.9317 (1.05)      24.8325 (1.84)       787.4960 (1.05)       9.6900 (1.10)        26;88  1,264.3316 (0.95)       1282           1
test_aug_2d_equalize[1-3-128-128-cuda-float32-inductor]     1,436.4920 (1.95)     2,300.8830 (2.29)     1,475.3022 (1.95)      67.7736 (5.03)     1,460.9530 (1.94)      37.8220 (4.28)        13;13    677.8272 (0.51)        691           1
test_aug_2d_equalize[1-3-128-128-cuda-float32-eager]        1,509.3630 (2.04)     2,284.6530 (2.27)     1,531.1151 (2.03)      41.0432 (3.05)     1,527.5020 (2.03)      10.9200 (1.24)         6;20    653.1188 (0.49)        660           1
test_aug_2d_equalize[5-1-128-128-cuda-float32-inductor]     2,274.0420 (3.08)     3,855.0150 (3.83)     2,342.0729 (3.10)      97.5111 (7.24)     2,313.0530 (3.07)      73.8975 (8.37)          6;4    426.9722 (0.32)        435           1
test_aug_2d_equalize[5-1-128-128-cuda-float32-eager]        2,347.1730 (3.18)     3,099.6240 (3.08)     2,377.6541 (3.15)      44.7698 (3.32)     2,372.8730 (3.15)      12.9455 (1.47)         9;19    420.5826 (0.32)        424           1
test_aug_2d_equalize[5-3-128-128-cuda-float32-eager]        5,547.0870 (7.51)     6,143.0790 (6.11)     5,584.6632 (7.40)      48.3437 (3.59)     5,577.3170 (7.41)      22.4540 (2.54)          6;8    179.0618 (0.14)        181           1
test_aug_2d_equalize[5-3-128-128-cuda-float32-inductor]     6,135.8880 (8.31)     6,954.8700 (6.91)     6,352.7499 (8.42)     184.6006 (13.71)    6,462.9490 (8.58)     335.2153 (37.96)        51;0    157.4121 (0.12)        163           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_erasing
------------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_erasing': 8 tests ------------------------------------------------------------------------------------------------------
Name (time in us)                                                 Min                     Max                   Mean                  StdDev                Median                    IQR            Outliers         OPS            Rounds  Iterations
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_erasing[1-1-128-128-cuda-float32-inductor]       759.3200 (1.0)          786.5410 (1.0)         772.4428 (1.0)           10.1809 (1.0)        771.4710 (1.0)          13.7353 (1.28)          2;0  1,294.5942 (1.0)           5           1
test_aug_2d_erasing[1-3-128-128-cuda-float32-inductor]       800.6810 (1.05)         879.6210 (1.12)        822.2270 (1.06)          32.6752 (3.21)       809.2510 (1.05)         29.3650 (2.74)          1;1  1,216.2092 (0.94)          5           1
test_aug_2d_erasing[5-1-128-128-cuda-float32-inductor]       911.7520 (1.20)       1,037.6010 (1.32)        961.6372 (1.24)          51.1013 (5.02)       970.7610 (1.26)         72.3148 (6.75)          1;0  1,039.8932 (0.80)          5           1
test_aug_2d_erasing[1-1-128-128-cuda-float32-eager]          955.7410 (1.26)       1,406.4210 (1.79)        975.4345 (1.26)          18.6172 (1.83)       972.2610 (1.26)         11.1675 (1.04)        66;56  1,025.1841 (0.79)       1033           1
test_aug_2d_erasing[1-3-128-128-cuda-float32-eager]        1,000.3810 (1.32)       1,236.5410 (1.57)      1,017.5203 (1.32)          11.3568 (1.12)     1,015.1460 (1.32)         10.7200 (1.0)        157;23    982.7814 (0.76)        994           1
test_aug_2d_erasing[5-3-128-128-cuda-float32-inductor]     1,074.6610 (1.42)     330,389.0170 (420.05)   66,963.4804 (86.69)    147,259.3567 (>1000.0)  1,091.0510 (1.41)     82,390.1723 (>1000.0)       1;1     14.9335 (0.01)          5           1
test_aug_2d_erasing[5-1-128-128-cuda-float32-eager]        1,114.1610 (1.47)       1,922.9220 (2.44)      1,139.8752 (1.48)          38.0501 (3.74)     1,133.4610 (1.47)         13.6700 (1.28)        18;38    877.2890 (0.68)        882           1
test_aug_2d_erasing[5-3-128-128-cuda-float32-eager]        1,300.5310 (1.71)       1,624.0810 (2.06)      1,321.2970 (1.71)          27.1493 (2.67)     1,313.9920 (1.70)         14.4825 (1.35)        53;55    756.8321 (0.58)        769           1
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_fisheye
----------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_fisheye': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                               Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_fisheye[1-3-128-128-cuda-float32-inductor]     605.1510 (1.0)        616.2510 (1.0)      609.2749 (1.0)       4.1853 (1.10)     607.6700 (1.0)       6.4830 (1.62)          2;0        1.6413 (1.0)           7           1
test_aug_2d_fisheye[1-1-128-128-cuda-float32-inductor]     613.2310 (1.01)       674.3110 (1.09)     627.5850 (1.03)     20.9350 (5.48)     621.1610 (1.02)      5.8180 (1.45)          1;1        1.5934 (0.97)          7           1
test_aug_2d_fisheye[5-3-128-128-cuda-float32-inductor]     620.5810 (1.03)       647.5010 (1.05)     631.6907 (1.04)     10.4831 (2.75)     627.7900 (1.03)     16.8142 (4.19)          3;0        1.5831 (0.96)          7           1
test_aug_2d_fisheye[5-1-128-128-cuda-float32-inductor]     620.9710 (1.03)       632.0610 (1.03)     624.6680 (1.03)      3.8188 (1.0)      622.6110 (1.02)      4.0128 (1.0)           1;0        1.6009 (0.98)          7           1
test_aug_2d_fisheye[1-1-128-128-cuda-float32-eager]        722.4900 (1.19)     1,108.8520 (1.80)     739.1629 (1.21)     21.9363 (5.74)     734.0000 (1.21)     11.5550 (2.88)        56;63        1.3529 (0.82)       1371           1
test_aug_2d_fisheye[5-3-128-128-cuda-float32-eager]        724.8300 (1.20)     1,377.4020 (2.24)     753.8985 (1.24)     55.1089 (14.43)    740.4810 (1.22)     27.4600 (6.84)        22;23        1.3264 (0.81)       1363           1
test_aug_2d_fisheye[1-3-128-128-cuda-float32-eager]        728.6610 (1.20)       768.8510 (1.25)     738.8239 (1.21)     12.1554 (3.18)     732.8110 (1.21)     10.2028 (2.54)          3;3        1.3535 (0.82)         19           1
test_aug_2d_fisheye[5-1-128-128-cuda-float32-eager]        729.6010 (1.21)     1,195.0510 (1.94)     746.0804 (1.22)     17.9514 (4.70)     743.0310 (1.22)     10.0350 (2.50)        64;64        1.3403 (0.82)       1360           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gamma
----------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_gamma': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                             Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gamma[1-3-128-128-cuda-float32-inductor]     552.8010 (1.0)        993.0910 (1.51)     567.1205 (1.0)      22.8162 (2.00)     564.2655 (1.0)       8.6050 (1.17)        22;59        1.7633 (1.0)        1756           1
test_aug_2d_gamma[1-1-128-128-cuda-float32-inductor]     567.5810 (1.03)       849.3710 (1.29)     605.5493 (1.07)     12.1193 (1.06)     604.2155 (1.07)     10.9600 (1.49)       206;53        1.6514 (0.94)       1754           1
test_aug_2d_gamma[5-3-128-128-cuda-float32-inductor]     571.1810 (1.03)       859.9310 (1.31)     588.1486 (1.04)     12.8870 (1.13)     586.0255 (1.04)      9.9600 (1.35)        99;36        1.7003 (0.96)       1686           1
test_aug_2d_gamma[5-1-128-128-cuda-float32-inductor]     587.0310 (1.06)       656.7900 (1.0)      602.8720 (1.06)     22.1951 (1.94)     592.2210 (1.05)     17.8975 (2.43)          1;1        1.6587 (0.94)          9           1
test_aug_2d_gamma[1-3-128-128-cuda-float32-eager]        592.5210 (1.07)     1,032.2020 (1.57)     610.2380 (1.08)     14.6361 (1.28)     608.9410 (1.08)      8.6008 (1.17)        54;40        1.6387 (0.93)       1641           1
test_aug_2d_gamma[1-1-128-128-cuda-float32-eager]        598.2110 (1.08)     1,424.3820 (2.17)     614.0076 (1.08)     24.5480 (2.15)     612.1410 (1.08)      8.8350 (1.20)        20;48        1.6286 (0.92)       1643           1
test_aug_2d_gamma[5-3-128-128-cuda-float32-eager]        600.0610 (1.09)       912.6720 (1.39)     614.3268 (1.08)     11.4253 (1.0)      612.9160 (1.09)      7.4300 (1.01)        81;49        1.6278 (0.92)       1638           1
test_aug_2d_gamma[5-1-128-128-cuda-float32-eager]        605.3500 (1.10)     1,429.1820 (2.18)     617.8932 (1.09)     21.5044 (1.88)     616.2010 (1.09)      7.3508 (1.0)         12;48        1.6184 (0.92)       1635           1
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gaussian_blur
----------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_gaussian_blur': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                     Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gaussian_blur[1-1-128-128-cuda-float32-inductor]     513.4710 (1.0)        771.0610 (1.15)     527.8701 (1.0)      11.4525 (1.55)     526.3210 (1.0)       6.5575 (1.0)         79;59        1.8944 (1.0)        1919           1
test_aug_2d_gaussian_blur[5-1-128-128-cuda-float32-inductor]     516.6500 (1.01)       786.5810 (1.18)     533.0215 (1.01)     13.2308 (1.79)     531.8110 (1.01)      7.5005 (1.14)       106;82        1.8761 (0.99)       1896           1
test_aug_2d_gaussian_blur[1-3-128-128-cuda-float32-inductor]     529.8910 (1.03)       674.5010 (1.01)     543.3119 (1.03)      8.3459 (1.13)     542.5060 (1.03)      8.9505 (1.36)       333;42        1.8406 (0.97)       1840           1
test_aug_2d_gaussian_blur[5-3-128-128-cuda-float32-inductor]     560.0200 (1.09)       668.1810 (1.0)      573.8396 (1.09)      7.4099 (1.0)      572.6705 (1.09)      7.3600 (1.12)       338;69        1.7426 (0.92)       1766           1
test_aug_2d_gaussian_blur[5-1-128-128-cuda-float32-eager]        745.2610 (1.45)     1,211.5020 (1.81)     762.6083 (1.44)     15.9726 (2.16)     759.4510 (1.44)     13.8350 (2.11)        88;14        1.3113 (0.69)       1331           1
test_aug_2d_gaussian_blur[1-1-128-128-cuda-float32-eager]        765.1510 (1.49)     1,184.3710 (1.77)     779.7152 (1.48)     15.9645 (2.15)     777.7510 (1.48)      8.7975 (1.34)        41;39        1.2825 (0.68)       1295           1
test_aug_2d_gaussian_blur[5-3-128-128-cuda-float32-eager]        769.5310 (1.50)     1,064.6610 (1.59)     784.5794 (1.49)     10.6471 (1.44)     783.2620 (1.49)      8.3350 (1.27)       139;35        1.2746 (0.67)       1285           1
test_aug_2d_gaussian_blur[1-3-128-128-cuda-float32-eager]        771.2110 (1.50)     1,243.0020 (1.86)     787.6384 (1.49)     22.6648 (3.06)     784.8810 (1.49)      7.8700 (1.20)        17;53        1.2696 (0.67)       1278           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gaussian_illumination
-------------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_gaussian_illumination': 8 tests -------------------------------------------------------------------------------------------------------
Name (time in us)                                                             Min                       Max                    Mean                  StdDev                Median                     IQR            Outliers         OPS            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gaussian_illumination[1-1-128-128-cuda-float32-inductor]     477.2500 (1.0)            549.1210 (1.0)          496.4226 (1.0)           29.8911 (3.17)       487.9000 (1.0)           25.0027 (3.31)          1;1  2,014.4127 (1.0)           5           1
test_aug_2d_gaussian_illumination[1-3-128-128-cuda-float32-inductor]     508.8300 (1.07)           572.9010 (1.04)         534.2066 (1.08)          25.6628 (2.72)       527.7310 (1.08)          37.9545 (5.03)          1;0  1,871.9349 (0.93)          5           1
test_aug_2d_gaussian_illumination[5-1-128-128-cuda-float32-inductor]     556.3410 (1.17)     1,365,228.3610 (>1000.0)  273,529.3208 (551.00)   610,278.3174 (>1000.0)    640.1110 (1.31)     341,218.4282 (>1000.0)       1;1      3.6559 (0.00)          5           1
test_aug_2d_gaussian_illumination[5-3-128-128-cuda-float32-inductor]     670.3210 (1.40)     1,380,674.0550 (>1000.0)  276,726.3400 (557.44)   617,125.5365 (>1000.0)    743.9420 (1.52)     345,093.8885 (>1000.0)       1;1      3.6137 (0.00)          5           1
test_aug_2d_gaussian_illumination[1-1-128-128-cuda-float32-eager]        711.9010 (1.49)           912.8010 (1.66)         726.5606 (1.46)           9.4233 (1.0)        725.1710 (1.49)           7.5510 (1.0)        140;50  1,376.3476 (0.68)       1366           1
test_aug_2d_gaussian_illumination[1-3-128-128-cuda-float32-eager]        776.6610 (1.63)         1,006.7610 (1.83)         814.2184 (1.64)          26.2463 (2.79)       822.5510 (1.69)          45.2872 (6.00)        420;9  1,228.1717 (0.61)       1275           1
test_aug_2d_gaussian_illumination[5-1-128-128-cuda-float32-eager]        799.6410 (1.68)         1,167.6920 (2.13)         828.3745 (1.67)          43.1097 (4.57)       818.8610 (1.68)           9.9328 (1.32)        58;89  1,207.1835 (0.60)       1221           1
test_aug_2d_gaussian_illumination[5-3-128-128-cuda-float32-eager]        968.6610 (2.03)         1,914.0420 (3.49)       1,043.9126 (2.10)          96.7742 (10.27)    1,013.5165 (2.08)          42.4005 (5.62)       88;120    957.9346 (0.48)        996           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gaussian_noise
-------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_gaussian_noise': 8 tests ---------------------------------------------------------------------------------------------------
Name (time in us)                                                      Min                    Max                  Mean                 StdDev              Median                   IQR            Outliers         OPS            Rounds  Iterations
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_gaussian_noise[1-3-128-128-cuda-float32-inductor]     346.2000 (1.0)         606.1110 (1.18)       359.7249 (1.0)          14.5959 (1.90)     356.6100 (1.0)          7.4292 (1.49)      200;225  2,779.9020 (1.0)        2801           1
test_aug_2d_gaussian_noise[5-1-128-128-cuda-float32-inductor]     353.2200 (1.02)        559.6900 (1.09)       368.0287 (1.02)         12.8778 (1.67)     363.6700 (1.02)        10.4695 (2.10)      236;163  2,717.1798 (0.98)       2732           1
test_aug_2d_gaussian_noise[5-3-128-128-cuda-float32-inductor]     354.5900 (1.02)        512.4710 (1.0)        364.8061 (1.01)          7.6929 (1.0)      362.9310 (1.02)         6.2693 (1.26)      378;195  2,741.1825 (0.99)       2761           1
test_aug_2d_gaussian_noise[1-1-128-128-cuda-float32-eager]        364.5210 (1.05)        523.4410 (1.02)       376.2045 (1.05)         11.5654 (1.50)     373.0055 (1.05)         5.8300 (1.17)      195;234  2,658.1291 (0.96)       2682           1
test_aug_2d_gaussian_noise[5-1-128-128-cuda-float32-eager]        366.6000 (1.06)        587.9310 (1.15)       378.6866 (1.05)         11.8654 (1.54)     375.5300 (1.05)         6.6523 (1.34)      189;205  2,640.7064 (0.95)       2609           1
test_aug_2d_gaussian_noise[5-3-128-128-cuda-float32-eager]        367.1810 (1.06)        639.0610 (1.25)       379.5471 (1.06)         14.5058 (1.89)     376.0500 (1.05)         5.5710 (1.12)      174;255  2,634.7192 (0.95)       2665           1
test_aug_2d_gaussian_noise[1-3-128-128-cuda-float32-eager]        368.2710 (1.06)        563.2110 (1.10)       378.4165 (1.05)         10.6774 (1.39)     375.4610 (1.05)         4.9827 (1.0)       205;266  2,642.5906 (0.95)       2689           1
test_aug_2d_gaussian_noise[1-1-128-128-cuda-float32-inductor]     407.5610 (1.18)     28,138.6000 (54.91)    6,257.6108 (17.40)    12,245.9764 (>1000.0)  480.8200 (1.35)     7,944.5030 (>1000.0)       1;1    159.8054 (0.06)          5           1
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_grayscale
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_grayscale': 4 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                 Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_grayscale[1-3-128-128-cuda-float32-inductor]     312.6400 (1.0)      534.2100 (1.09)     325.9801 (1.00)      8.0130 (1.0)      324.0500 (1.00)      5.0775 (1.0)       392;272        3.0677 (1.00)       3171           1
test_aug_2d_grayscale[5-3-128-128-cuda-float32-inductor]     313.0200 (1.00)     490.7700 (1.0)      325.2540 (1.0)      10.5598 (1.32)     322.6910 (1.0)      10.1745 (2.00)      277;113        3.0745 (1.0)        3035           1
test_aug_2d_grayscale[1-3-128-128-cuda-float32-eager]        455.6310 (1.46)     858.9610 (1.75)     468.1724 (1.44)     11.8494 (1.48)     465.9160 (1.44)      6.5600 (1.29)      166;163        2.1360 (0.69)       2150           1
test_aug_2d_grayscale[5-3-128-128-cuda-float32-eager]        457.5710 (1.46)     920.9910 (1.88)     479.3781 (1.47)     50.8240 (6.34)     466.8910 (1.45)      9.0200 (1.78)       90;232        2.0860 (0.68)       2156           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_horizontal_flip
--------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_horizontal_flip': 8 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                       Min                 Max                Mean             StdDev              Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_horizontal_flip[1-3-128-128-cuda-float32-inductor]     344.3200 (1.0)      496.5100 (1.13)     354.0033 (1.0)       6.6290 (1.0)      352.5200 (1.0)      6.6615 (1.25)       446;94        2.8248 (1.0)        2781           1
test_aug_2d_horizontal_flip[1-1-128-128-cuda-float32-inductor]     344.7700 (1.00)     714.0310 (1.63)     359.4656 (1.02)     29.6977 (4.48)     354.9800 (1.01)     7.2868 (1.37)       51;119        2.7819 (0.98)       2745           1
test_aug_2d_horizontal_flip[5-1-128-128-cuda-float32-eager]        353.6600 (1.03)     439.2900 (1.0)      363.2451 (1.03)      7.1893 (1.08)     360.8205 (1.02)     6.6250 (1.24)      415;200        2.7530 (0.97)       2752           1
test_aug_2d_horizontal_flip[1-3-128-128-cuda-float32-eager]        354.1200 (1.03)     596.2310 (1.36)     363.9802 (1.03)     14.5099 (2.19)     361.2810 (1.02)     5.8190 (1.09)       76;163        2.7474 (0.97)       2778           1
test_aug_2d_horizontal_flip[1-1-128-128-cuda-float32-eager]        354.7910 (1.03)     616.7000 (1.40)     363.2222 (1.03)     10.8591 (1.64)     361.0200 (1.02)     5.4410 (1.02)      113;148        2.7531 (0.97)       2698           1
test_aug_2d_horizontal_flip[5-3-128-128-cuda-float32-inductor]     357.8300 (1.04)     688.2110 (1.57)     373.8151 (1.06)     24.7946 (3.74)     370.2100 (1.05)     9.7675 (1.83)        58;90        2.6751 (0.95)       2663           1
test_aug_2d_horizontal_flip[5-3-128-128-cuda-float32-eager]        358.5410 (1.04)     652.8710 (1.49)     368.3079 (1.04)     11.6772 (1.76)     366.0500 (1.04)     5.3380 (1.0)       101;189        2.7151 (0.96)       2739           1
test_aug_2d_horizontal_flip[5-1-128-128-cuda-float32-inductor]     369.7400 (1.07)     586.2410 (1.33)     381.1183 (1.08)      9.0356 (1.36)     378.8055 (1.07)     6.6200 (1.24)      321;228        2.6239 (0.93)       2632           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_hue
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_hue': 4 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                           Min                   Max                Mean             StdDev              Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_hue[1-3-128-128-cuda-float32-inductor]     524.2900 (1.0)        871.5320 (1.30)     539.5061 (1.0)      17.4368 (2.32)     535.9310 (1.0)      8.7790 (1.15)      123;157        1.8535 (1.0)        1882           1
test_aug_2d_hue[5-3-128-128-cuda-float32-inductor]     532.9300 (1.02)       669.8710 (1.0)      546.4297 (1.01)      7.5075 (1.0)      545.1910 (1.02)     7.6553 (1.0)        352;61        1.8301 (0.99)       1833           1
test_aug_2d_hue[1-3-128-128-cuda-float32-eager]        884.5510 (1.69)     1,122.8920 (1.68)     900.6561 (1.67)     12.3732 (1.65)     897.6560 (1.67)     9.4110 (1.23)       151;84        1.1103 (0.60)       1122           1
test_aug_2d_hue[5-3-128-128-cuda-float32-eager]        886.9720 (1.69)     1,106.7020 (1.65)     904.6531 (1.68)     13.7177 (1.83)     902.3020 (1.68)     9.4400 (1.23)        71;39        1.1054 (0.60)       1119           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_invert
--------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_invert': 8 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                              Min                 Max                Mean             StdDev              Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_invert[1-3-128-128-cuda-float32-inductor]     342.4310 (1.0)      633.9110 (1.23)     352.4024 (1.0)      10.1553 (1.41)     350.5005 (1.0)      5.6345 (1.09)      168;177        2.8377 (1.0)        2776           1
test_aug_2d_invert[5-1-128-128-cuda-float32-inductor]     344.4510 (1.01)     514.5300 (1.0)      359.6341 (1.02)      7.2134 (1.0)      358.9105 (1.02)     5.8200 (1.13)      507;159        2.7806 (0.98)       2772           1
test_aug_2d_invert[5-3-128-128-cuda-float32-inductor]     346.2400 (1.01)     544.4400 (1.06)     360.1411 (1.02)      8.2609 (1.15)     358.8855 (1.02)     6.3045 (1.22)      375;129        2.7767 (0.98)       2756           1
test_aug_2d_invert[1-1-128-128-cuda-float32-inductor]     349.6610 (1.02)     532.6410 (1.04)     360.1689 (1.02)      7.4100 (1.03)     358.7110 (1.02)     5.8990 (1.14)      295;119        2.7765 (0.98)       2773           1
test_aug_2d_invert[5-3-128-128-cuda-float32-eager]        378.4310 (1.11)     676.0610 (1.31)     387.8884 (1.10)      9.8134 (1.36)     386.4000 (1.10)     5.1577 (1.0)       106;117        2.5781 (0.91)       2587           1
test_aug_2d_invert[1-3-128-128-cuda-float32-eager]        379.2300 (1.11)     673.4610 (1.31)     390.0718 (1.11)      9.7717 (1.35)     388.5110 (1.11)     5.4603 (1.06)      135;127        2.5636 (0.90)       2573           1
test_aug_2d_invert[1-1-128-128-cuda-float32-eager]        380.1900 (1.11)     653.8110 (1.27)     390.0501 (1.11)      8.0609 (1.12)     388.8355 (1.11)     5.1900 (1.01)      170;114        2.5638 (0.90)       2582           1
test_aug_2d_invert[5-1-128-128-cuda-float32-eager]        388.2000 (1.13)     587.5310 (1.14)     418.5427 (1.19)      8.9037 (1.23)     416.9455 (1.19)     7.0200 (1.36)      309;169        2.3892 (0.84)       2590           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_linear_corner_illumination
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_linear_corner_illumination': 8 tests -------------------------------------------------------------------------------------------------
Name (time in us)                                                                    Min                   Max                  Mean              StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_linear_corner_illumination[1-1-128-128-cuda-float32-inductor]       622.9110 (1.0)        722.6110 (1.0)        645.5825 (1.0)       11.1108 (1.0)        644.1910 (1.0)      15.7255 (1.02)       522;11  1,548.9886 (1.0)        1576           1
test_aug_2d_linear_corner_illumination[1-3-128-128-cuda-float32-inductor]       665.7000 (1.07)       910.1710 (1.26)       694.1256 (1.08)      16.7947 (1.51)       691.6610 (1.07)     18.4700 (1.20)       277;14  1,440.6615 (0.93)       1485           1
test_aug_2d_linear_corner_illumination[1-1-128-128-cuda-float32-eager]          676.8410 (1.09)       818.7410 (1.13)       700.7773 (1.09)      12.1951 (1.10)       698.9615 (1.09)     15.3600 (1.0)        394;24  1,426.9869 (0.92)       1434           1
test_aug_2d_linear_corner_illumination[1-3-128-128-cuda-float32-eager]          731.4710 (1.17)     1,087.4620 (1.50)       764.2093 (1.18)      21.9152 (1.97)       761.1510 (1.18)     19.5000 (1.27)       177;51  1,308.5421 (0.84)       1334           1
test_aug_2d_linear_corner_illumination[5-1-128-128-cuda-float32-inductor]       773.2710 (1.24)     1,148.2010 (1.59)       825.5361 (1.28)      27.4477 (2.47)       822.7765 (1.28)     27.5360 (1.79)       227;31  1,211.3341 (0.78)       1280           1
test_aug_2d_linear_corner_illumination[5-1-128-128-cuda-float32-eager]          827.5910 (1.33)     1,342.3220 (1.86)       904.9778 (1.40)      37.7631 (3.40)       905.4710 (1.41)     46.1242 (3.00)        291;8  1,104.9995 (0.71)       1207           1
test_aug_2d_linear_corner_illumination[5-3-128-128-cuda-float32-inductor]       921.9910 (1.48)     1,655.6530 (2.29)     1,017.3216 (1.58)      81.0608 (7.30)       997.1510 (1.55)     56.7150 (3.69)        95;71    982.9733 (0.63)       1072           1
test_aug_2d_linear_corner_illumination[5-3-128-128-cuda-float32-eager]        1,028.1220 (1.65)     1,825.8830 (2.53)     1,166.3271 (1.81)     139.9185 (12.59)    1,119.8920 (1.74)     79.3977 (5.17)        81;90    857.3924 (0.55)        911           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_linear_illumination
----------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_linear_illumination': 8 tests ----------------------------------------------------------------------------------------------
Name (time in us)                                                           Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_linear_illumination[1-1-128-128-cuda-float32-eager]        569.6010 (1.0)        962.5020 (1.17)     594.1083 (1.0)      24.5077 (1.57)     589.8510 (1.0)      12.4223 (1.03)      138;142        1.6832 (1.0)        1715           1
test_aug_2d_linear_illumination[1-1-128-128-cuda-float32-inductor]     576.8910 (1.01)       876.5510 (1.07)     601.3538 (1.01)     17.2745 (1.11)     598.2610 (1.01)     12.0450 (1.0)       215;137        1.6629 (0.99)       1703           1
test_aug_2d_linear_illumination[1-3-128-128-cuda-float32-eager]        611.2710 (1.07)       884.9710 (1.08)     650.0757 (1.09)     25.7699 (1.65)     639.9410 (1.08)     40.5977 (3.37)       432;13        1.5383 (0.91)       1639           1
test_aug_2d_linear_illumination[1-3-128-128-cuda-float32-inductor]     617.6010 (1.08)       838.3910 (1.02)     662.1805 (1.11)     19.8060 (1.27)     664.9210 (1.13)     19.5350 (1.62)       457;50        1.5102 (0.90)       1608           1
test_aug_2d_linear_illumination[5-1-128-128-cuda-float32-eager]        689.6810 (1.21)       822.9810 (1.0)      730.1645 (1.23)     15.5929 (1.0)      729.0010 (1.24)     20.0150 (1.66)       410;22        1.3696 (0.81)       1405           1
test_aug_2d_linear_illumination[5-1-128-128-cuda-float32-inductor]     723.8010 (1.27)       903.6310 (1.10)     785.0821 (1.32)     26.0323 (1.67)     784.3160 (1.33)     40.9595 (3.40)        473;6        1.2738 (0.76)       1340           1
test_aug_2d_linear_illumination[5-3-128-128-cuda-float32-eager]        823.2210 (1.45)     1,060.6420 (1.29)     891.8123 (1.50)     33.2451 (2.13)     895.0710 (1.52)     56.1232 (4.66)        429;2        1.1213 (0.67)       1185           1
test_aug_2d_linear_illumination[5-3-128-128-cuda-float32-inductor]     857.9810 (1.51)     1,491.5120 (1.81)     935.5589 (1.57)     65.7632 (4.22)     921.6310 (1.56)     53.1877 (4.42)       111;75        1.0689 (0.64)       1159           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_median_blur
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_median_blur': 8 tests -------------------------------------------------------------------------------------------------
Name (time in us)                                                     Min                   Max                  Mean              StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_median_blur[1-1-128-128-cuda-float32-inductor]       404.3500 (1.0)        515.7310 (1.08)       415.4380 (1.0)        7.9490 (1.04)       413.4100 (1.0)       6.3023 (1.35)      338;180  2,407.0982 (1.0)        2425           1
test_aug_2d_median_blur[1-3-128-128-cuda-float32-inductor]       405.6810 (1.00)       476.1710 (1.0)        424.3720 (1.02)      10.3485 (1.36)       423.2900 (1.02)     11.9350 (2.56)      595;104  2,356.4231 (0.98)       2351           1
test_aug_2d_median_blur[5-1-128-128-cuda-float32-inductor]       412.6910 (1.02)     1,090.8520 (2.29)       618.4094 (1.49)      86.9802 (11.42)      654.0410 (1.58)      4.6550 (1.0)       388;584  1,617.0517 (0.67)       2385           1
test_aug_2d_median_blur[5-3-128-128-cuda-float32-inductor]       416.1510 (1.03)     2,568.8340 (5.39)     1,843.1543 (4.44)     252.7933 (33.18)    1,885.7870 (4.56)     13.8810 (2.98)       87;126    542.5482 (0.23)       2382           1
test_aug_2d_median_blur[1-1-128-128-cuda-float32-eager]          460.4810 (1.14)       540.7110 (1.14)       470.5780 (1.13)       7.6189 (1.0)        468.5400 (1.13)      6.1435 (1.32)      313;178  2,125.0462 (0.88)       2141           1
test_aug_2d_median_blur[5-1-128-128-cuda-float32-eager]          463.5610 (1.15)     1,295.4110 (2.72)       652.5739 (1.57)      56.5235 (7.42)       665.5510 (1.61)      5.3690 (1.15)      167;511  1,532.3936 (0.64)       2150           1
test_aug_2d_median_blur[1-3-128-128-cuda-float32-eager]          487.9500 (1.21)       565.4410 (1.19)       500.5068 (1.20)       7.6959 (1.01)       498.8655 (1.21)      8.6100 (1.85)       513;87  1,997.9747 (0.83)       2114           1
test_aug_2d_median_blur[5-3-128-128-cuda-float32-eager]        1,852.4330 (4.58)     2,495.8440 (5.24)     1,903.3932 (4.58)      33.5020 (4.40)     1,900.7830 (4.60)     15.1800 (3.26)        62;85    525.3775 (0.22)       2145           1
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_motion_blur
--------------------------------------------------------------------------------------- benchmark 'test_aug_2d_motion_blur': 8 tests --------------------------------------------------------------------------------------
Name (time in ms)                                                 Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_motion_blur[1-1-128-128-cuda-float32-inductor]     1.0484 (1.0)      1.0762 (1.0)      1.0641 (1.0)      0.0102 (1.35)     1.0661 (1.0)      0.0158 (1.48)          3;0  939.7221 (1.0)           7           1
test_aug_2d_motion_blur[1-3-128-128-cuda-float32-inductor]     1.1078 (1.06)     1.1367 (1.06)     1.1228 (1.06)     0.0120 (1.59)     1.1213 (1.05)     0.0211 (1.98)          3;0  890.6516 (0.95)          6           1
test_aug_2d_motion_blur[5-1-128-128-cuda-float32-inductor]     1.1220 (1.07)     1.1424 (1.06)     1.1307 (1.06)     0.0076 (1.0)      1.1300 (1.06)     0.0110 (1.03)          2;0  884.4364 (0.94)          6           1
test_aug_2d_motion_blur[5-3-128-128-cuda-float32-inductor]     1.1503 (1.10)     1.1736 (1.09)     1.1614 (1.09)     0.0082 (1.08)     1.1618 (1.09)     0.0107 (1.0)           2;0  861.0001 (0.92)          6           1
test_aug_2d_motion_blur[1-3-128-128-cuda-float32-eager]        1.3552 (1.29)     1.5213 (1.41)     1.3794 (1.30)     0.0175 (2.31)     1.3743 (1.29)     0.0184 (1.73)       149;32  724.9779 (0.77)        732           1
test_aug_2d_motion_blur[1-1-128-128-cuda-float32-eager]        1.3650 (1.30)     1.4105 (1.31)     1.3814 (1.30)     0.0129 (1.71)     1.3775 (1.29)     0.0175 (1.64)          6;0  723.8855 (0.77)         21           1
test_aug_2d_motion_blur[5-3-128-128-cuda-float32-eager]        1.3943 (1.33)     1.5380 (1.43)     1.4126 (1.33)     0.0125 (1.65)     1.4109 (1.32)     0.0121 (1.13)        95;25  707.9244 (0.75)        708           1
test_aug_2d_motion_blur[5-1-128-128-cuda-float32-eager]        1.4057 (1.34)     1.9553 (1.82)     1.4266 (1.34)     0.0337 (4.46)     1.4221 (1.33)     0.0116 (1.09)        15;24  700.9688 (0.75)        710           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_normalize
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_normalize': 8 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                 Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_normalize[1-1-128-128-cuda-float32-inductor]     356.5400 (1.0)      754.2910 (1.18)     367.9956 (1.0)      13.4915 (1.46)     366.1400 (1.0)       6.0078 (1.0)        62;106        2.7174 (1.0)        2731           1
test_aug_2d_normalize[5-1-128-128-cuda-float32-inductor]     360.7300 (1.01)     654.6120 (1.02)     384.9387 (1.05)     16.5414 (1.79)     388.9260 (1.06)     24.8600 (4.14)        871;9        2.5978 (0.96)       2726           1
test_aug_2d_normalize[5-3-128-128-cuda-float32-inductor]     386.8100 (1.08)     640.5900 (1.0)      401.0184 (1.09)      9.7801 (1.06)     398.5200 (1.09)      8.1332 (1.35)      308;147        2.4937 (0.92)       2469           1
test_aug_2d_normalize[1-3-128-128-cuda-float32-inductor]     388.0610 (1.09)     778.8920 (1.22)     404.9927 (1.10)     12.8983 (1.40)     403.2805 (1.10)      7.0600 (1.18)        89;75        2.4692 (0.91)       2476           1
test_aug_2d_normalize[5-1-128-128-cuda-float32-eager]        423.8210 (1.19)     849.7820 (1.33)     435.3907 (1.18)     12.0110 (1.30)     432.8305 (1.18)      7.1945 (1.20)      189;163        2.2968 (0.85)       2328           1
test_aug_2d_normalize[1-1-128-128-cuda-float32-eager]        426.1900 (1.20)     842.2810 (1.31)     437.2852 (1.19)     15.5611 (1.68)     435.0405 (1.19)      6.5890 (1.10)        47;83        2.2868 (0.84)       2318           1
test_aug_2d_normalize[5-3-128-128-cuda-float32-eager]        431.0810 (1.21)     726.2020 (1.13)     443.0444 (1.20)      9.2377 (1.0)      440.8150 (1.20)      7.3745 (1.23)      277;145        2.2571 (0.83)       2288           1
test_aug_2d_normalize[1-3-128-128-cuda-float32-eager]        435.4610 (1.22)     826.9710 (1.29)     459.4882 (1.25)     39.8533 (4.31)     449.1055 (1.23)     21.8710 (3.64)        79;81        2.1763 (0.80)       2282           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_padto
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_padto': 8 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                             Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_padto[1-1-128-128-cuda-float32-inductor]     305.8300 (1.0)      532.6310 (1.19)     318.7562 (1.0)       9.7073 (1.52)     317.6600 (1.0)       7.9055 (1.59)       259;95        3.1372 (1.0)        3069           1
test_aug_2d_padto[1-3-128-128-cuda-float32-inductor]     311.8700 (1.02)     448.8710 (1.0)      321.6786 (1.01)      6.3856 (1.00)     320.3200 (1.01)      6.7743 (1.36)       536;83        3.1087 (0.99)       3095           1
test_aug_2d_padto[5-3-128-128-cuda-float32-inductor]     312.7400 (1.02)     467.3100 (1.04)     321.6810 (1.01)      7.0549 (1.11)     320.2300 (1.01)      5.1600 (1.04)      264;152        3.1087 (0.99)       3012           1
test_aug_2d_padto[5-1-128-128-cuda-float32-inductor]     315.9800 (1.03)     562.2500 (1.25)     325.1317 (1.02)      8.3469 (1.31)     323.6500 (1.02)      4.9822 (1.0)       185;153        3.0757 (0.98)       3013           1
test_aug_2d_padto[5-1-128-128-cuda-float32-eager]        367.0300 (1.20)     680.1800 (1.52)     377.4891 (1.18)     19.1775 (3.01)     374.6300 (1.18)      5.4850 (1.10)       45;142        2.6491 (0.84)       2669           1
test_aug_2d_padto[1-1-128-128-cuda-float32-eager]        367.7300 (1.20)     572.3710 (1.28)     378.4239 (1.19)      9.4434 (1.48)     376.7900 (1.19)      6.2408 (1.25)      159;106        2.6425 (0.84)       2643           1
test_aug_2d_padto[5-3-128-128-cuda-float32-eager]        368.3100 (1.20)     483.4810 (1.08)     377.7550 (1.19)      6.3711 (1.0)      376.5400 (1.19)      5.6078 (1.13)      374;119        2.6472 (0.84)       2667           1
test_aug_2d_padto[1-3-128-128-cuda-float32-eager]        379.6500 (1.24)     734.1100 (1.64)     404.3470 (1.27)     27.7047 (4.35)     405.8850 (1.28)     25.6800 (5.15)        70;40        2.4731 (0.79)       2594           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_perspective
--------------------------------------------------------------------------------------- benchmark 'test_aug_2d_perspective': 8 tests --------------------------------------------------------------------------------------
Name (time in ms)                                                 Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_perspective[1-3-128-128-cuda-float32-inductor]     1.2940 (1.0)      1.3657 (1.03)     1.3236 (1.01)     0.0256 (3.21)     1.3202 (1.01)     0.0410 (3.31)          3;0  755.5246 (0.99)          9           1
test_aug_2d_perspective[5-1-128-128-cuda-float32-inductor]     1.2996 (1.00)     1.3406 (1.01)     1.3200 (1.01)     0.0128 (1.60)     1.3183 (1.01)     0.0167 (1.35)          2;0  757.5626 (0.99)          8           1
test_aug_2d_perspective[1-1-128-128-cuda-float32-inductor]     1.3006 (1.01)     1.3234 (1.0)      1.3097 (1.0)      0.0080 (1.0)      1.3083 (1.0)      0.0124 (1.0)           2;0  763.5204 (1.0)           8           1
test_aug_2d_perspective[5-3-128-128-cuda-float32-inductor]     1.3295 (1.03)     1.3827 (1.04)     1.3455 (1.03)     0.0207 (2.59)     1.3350 (1.02)     0.0264 (2.13)          2;0  743.2455 (0.97)          8           1
test_aug_2d_perspective[5-3-128-128-cuda-float32-eager]        1.6482 (1.27)     1.9946 (1.51)     1.6698 (1.27)     0.0202 (2.52)     1.6665 (1.27)     0.0155 (1.25)        47;12  598.8666 (0.78)        606           1
test_aug_2d_perspective[1-1-128-128-cuda-float32-eager]        1.6528 (1.28)     2.0591 (1.56)     1.6780 (1.28)     0.0258 (3.23)     1.6722 (1.28)     0.0169 (1.37)        60;42  595.9600 (0.78)        604           1
test_aug_2d_perspective[1-3-128-128-cuda-float32-eager]        1.6564 (1.28)     2.3726 (1.79)     1.6833 (1.29)     0.0400 (5.01)     1.6746 (1.28)     0.0174 (1.41)        20;59  594.0802 (0.78)        602           1
test_aug_2d_perspective[5-1-128-128-cuda-float32-eager]        1.6591 (1.28)     2.2853 (1.73)     1.6805 (1.28)     0.0319 (4.00)     1.6752 (1.28)     0.0160 (1.30)        16;21  595.0623 (0.78)        603           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plackian_jitter
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_plackian_jitter': 4 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                       Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plackian_jitter[1-3-128-128-cuda-float32-inductor]     480.2100 (1.0)      508.7000 (1.0)      492.2421 (1.0)      10.2566 (1.94)     487.5010 (1.0)      19.1390 (2.81)          5;0        2.0315 (1.0)          14           1
test_aug_2d_plackian_jitter[5-3-128-128-cuda-float32-inductor]     492.3210 (1.03)     509.4110 (1.00)     500.2400 (1.02)      5.2847 (1.0)      500.5010 (1.03)      6.7990 (1.0)           3;0        1.9990 (0.98)         14           1
test_aug_2d_plackian_jitter[1-3-128-128-cuda-float32-eager]        545.0510 (1.14)     870.7810 (1.71)     557.7628 (1.13)     11.6050 (2.20)     556.0110 (1.14)      8.8700 (1.30)       132;50        1.7929 (0.88)       1814           1
test_aug_2d_plackian_jitter[5-3-128-128-cuda-float32-eager]        549.2200 (1.14)     832.1010 (1.64)     563.0498 (1.14)     10.0842 (1.91)     561.6010 (1.15)      8.0825 (1.19)       130;42        1.7760 (0.87)       1787           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plasma_briggtness
--------------------------------------------------------------------------------------- benchmark 'test_aug_2d_plasma_briggtness': 8 tests --------------------------------------------------------------------------------------
Name (time in ms)                                                       Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plasma_briggtness[1-1-128-128-cuda-float32-inductor]     1.2962 (1.0)      1.5730 (1.08)     1.3369 (1.0)      0.0217 (1.69)     1.3396 (1.0)      0.0299 (2.27)        203;6  747.9845 (1.0)         745           1
test_aug_2d_plasma_briggtness[5-1-128-128-cuda-float32-inductor]     1.3396 (1.03)     1.4554 (1.0)      1.3740 (1.03)     0.0204 (1.59)     1.3685 (1.02)     0.0310 (2.36)        236;2  727.7782 (0.97)        736           1
test_aug_2d_plasma_briggtness[1-3-128-128-cuda-float32-inductor]     1.3506 (1.04)     1.4875 (1.02)     1.3744 (1.03)     0.0128 (1.0)      1.3724 (1.02)     0.0153 (1.16)       184;21  727.5682 (0.97)        718           1
test_aug_2d_plasma_briggtness[5-3-128-128-cuda-float32-inductor]     1.3698 (1.06)     1.8928 (1.30)     1.3989 (1.05)     0.0345 (2.68)     1.3926 (1.04)     0.0150 (1.14)        40;58  714.8360 (0.96)        724           1
test_aug_2d_plasma_briggtness[1-1-128-128-cuda-float32-eager]        2.8637 (2.21)     3.1783 (2.18)     2.8895 (2.16)     0.0211 (1.65)     2.8869 (2.16)     0.0132 (1.0)         18;12  346.0818 (0.46)        348           1
test_aug_2d_plasma_briggtness[5-1-128-128-cuda-float32-eager]        2.8791 (2.22)     3.1010 (2.13)     2.9151 (2.18)     0.0241 (1.88)     2.9094 (2.17)     0.0262 (1.99)        72;11  343.0408 (0.46)        346           1
test_aug_2d_plasma_briggtness[1-3-128-128-cuda-float32-eager]        2.8994 (2.24)     3.7677 (2.59)     3.0247 (2.26)     0.1079 (8.40)     2.9638 (2.21)     0.1906 (14.46)        64;2  330.6164 (0.44)        345           1
test_aug_2d_plasma_briggtness[5-3-128-128-cuda-float32-eager]        2.9165 (2.25)     3.3996 (2.34)     2.9560 (2.21)     0.0436 (3.40)     2.9432 (2.20)     0.0241 (1.83)        25;26  338.2946 (0.45)        343           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plasma_contrast
--------------------------------------------------------------------------------------- benchmark 'test_aug_2d_plasma_contrast': 8 tests --------------------------------------------------------------------------------------
Name (time in ms)                                                     Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plasma_contrast[1-3-128-128-cuda-float32-inductor]     1.3082 (1.0)      1.5334 (1.11)     1.3450 (1.0)      0.0206 (1.37)     1.3474 (1.00)     0.0307 (2.39)        235;6  743.4893 (1.0)         742           1
test_aug_2d_plasma_contrast[1-1-128-128-cuda-float32-inductor]     1.3206 (1.01)     1.3875 (1.0)      1.3452 (1.00)     0.0160 (1.06)     1.3417 (1.0)      0.0155 (1.21)          2;1  743.3675 (1.00)         13           1
test_aug_2d_plasma_contrast[5-3-128-128-cuda-float32-inductor]     1.3241 (1.01)     1.5525 (1.12)     1.3500 (1.00)     0.0150 (1.0)      1.3476 (1.00)     0.0128 (1.0)         76;20  740.7316 (1.00)        747           1
test_aug_2d_plasma_contrast[5-1-128-128-cuda-float32-inductor]     1.3328 (1.02)     1.4319 (1.03)     1.3785 (1.02)     0.0183 (1.22)     1.3790 (1.03)     0.0224 (1.75)        244;5  725.4108 (0.98)        730           1
test_aug_2d_plasma_contrast[1-1-128-128-cuda-float32-eager]        2.8638 (2.19)     3.5080 (2.53)     3.0809 (2.29)     0.0573 (3.81)     3.0805 (2.30)     0.0174 (1.35)        25;33  324.5829 (0.44)        351           1
test_aug_2d_plasma_contrast[5-1-128-128-cuda-float32-eager]        2.8689 (2.19)     3.3113 (2.39)     2.9472 (2.19)     0.0862 (5.73)     2.9075 (2.17)     0.0346 (2.69)        72;74  339.3062 (0.46)        349           1
test_aug_2d_plasma_contrast[1-3-128-128-cuda-float32-eager]        2.8768 (2.20)     3.3439 (2.41)     2.9064 (2.16)     0.0295 (1.96)     2.9021 (2.16)     0.0159 (1.24)        18;17  344.0653 (0.46)        347           1
test_aug_2d_plasma_contrast[5-3-128-128-cuda-float32-eager]        2.8927 (2.21)     4.5361 (3.27)     2.9272 (2.18)     0.0993 (6.60)     2.9152 (2.17)     0.0147 (1.14)         6;27  341.6270 (0.46)        346           1
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plasma_shadow
--------------------------------------------------------------------------------------- benchmark 'test_aug_2d_plasma_shadow': 8 tests --------------------------------------------------------------------------------------
Name (time in ms)                                                   Min               Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_plasma_shadow[1-3-128-128-cuda-float32-inductor]     1.4005 (1.0)      1.5297 (1.05)     1.4248 (1.0)      0.0465 (4.21)     1.4077 (1.0)      0.0102 (1.0)           1;1  701.8603 (1.0)           7           1
test_aug_2d_plasma_shadow[5-1-128-128-cuda-float32-inductor]     1.4262 (1.02)     1.4532 (1.0)      1.4394 (1.01)     0.0110 (1.0)      1.4363 (1.02)     0.0205 (2.00)          3;0  694.7343 (0.99)          7           1
test_aug_2d_plasma_shadow[5-3-128-128-cuda-float32-inductor]     1.4644 (1.05)     1.5065 (1.04)     1.4849 (1.04)     0.0144 (1.30)     1.4877 (1.06)     0.0194 (1.90)          2;0  673.4528 (0.96)          7           1
test_aug_2d_plasma_shadow[1-1-128-128-cuda-float32-inductor]     1.4894 (1.06)     1.6454 (1.13)     1.5309 (1.07)     0.0644 (5.83)     1.4951 (1.06)     0.0858 (8.39)          2;0  653.2108 (0.93)          7           1
test_aug_2d_plasma_shadow[1-1-128-128-cuda-float32-eager]        2.9117 (2.08)     3.4069 (2.34)     2.9557 (2.07)     0.0470 (4.26)     2.9398 (2.09)     0.0279 (2.73)        47;48  338.3306 (0.48)        344           1
test_aug_2d_plasma_shadow[1-3-128-128-cuda-float32-eager]        2.9123 (2.08)     3.3915 (2.33)     2.9647 (2.08)     0.0651 (5.89)     2.9387 (2.09)     0.0289 (2.83)        67;72  337.3033 (0.48)        343           1
test_aug_2d_plasma_shadow[5-1-128-128-cuda-float32-eager]        2.9137 (2.08)     3.2962 (2.27)     2.9445 (2.07)     0.0250 (2.27)     2.9417 (2.09)     0.0122 (1.19)        14;18  339.6201 (0.48)        342           1
test_aug_2d_plasma_shadow[5-3-128-128-cuda-float32-eager]        2.9192 (2.08)     3.5921 (2.47)     2.9559 (2.07)     0.0423 (3.83)     2.9517 (2.10)     0.0153 (1.50)         7;14  338.3075 (0.48)        340           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_posterize
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_posterize': 8 tests -------------------------------------------------------------------------------------------------
Name (time in us)                                                   Min                   Max                  Mean             StdDev                Median                 IQR            Outliers         OPS            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_posterize[1-1-128-128-cuda-float32-eager]          544.4510 (1.0)        847.6720 (1.21)       655.9942 (1.0)      35.7982 (1.05)       665.1000 (1.0)       12.0797 (1.23)      207;224  1,524.4037 (1.0)        1787           1
test_aug_2d_posterize[1-3-128-128-cuda-float32-eager]          545.9210 (1.00)       898.0310 (1.28)       659.3928 (1.01)     34.6273 (1.01)       665.4110 (1.00)       9.8400 (1.0)       246;305  1,516.5467 (0.99)       1794           1
test_aug_2d_posterize[1-3-128-128-cuda-float32-inductor]       549.4110 (1.01)       895.7010 (1.28)       670.2056 (1.02)     34.1270 (1.0)        677.2910 (1.02)      13.0250 (1.32)      193;202  1,492.0795 (0.98)       1759           1
test_aug_2d_posterize[1-1-128-128-cuda-float32-inductor]       565.5510 (1.04)       701.7110 (1.0)        670.9233 (1.02)     46.7212 (1.37)       684.5410 (1.03)      17.1300 (1.74)          2;2  1,490.4833 (0.98)         13           1
test_aug_2d_posterize[5-1-128-128-cuda-float32-eager]        1,199.6220 (2.20)     2,017.4230 (2.88)     1,474.1478 (2.25)     85.8354 (2.52)     1,505.3820 (2.26)     106.0395 (10.78)      114;22    678.3580 (0.44)        839           1
test_aug_2d_posterize[5-3-128-128-cuda-float32-eager]        1,202.6410 (2.21)     2,046.2230 (2.92)     1,473.6508 (2.25)     83.9504 (2.46)     1,513.8920 (2.28)     106.4555 (10.82)       92;12    678.5868 (0.45)        760           1
test_aug_2d_posterize[5-3-128-128-cuda-float32-inductor]     1,445.5920 (2.66)     2,505.7530 (3.57)     1,915.3211 (2.92)     93.4505 (2.74)     1,955.2475 (2.94)     118.0505 (12.00)       81;11    522.1057 (0.34)        588           1
test_aug_2d_posterize[5-1-128-128-cuda-float32-inductor]     1,881.3430 (3.46)     2,033.6740 (2.90)     1,977.3479 (3.01)     57.1343 (1.67)     2,003.0130 (3.01)     101.6045 (10.33)         3;0    505.7279 (0.33)         12           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_rain
---------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_rain': 8 tests -----------------------------------------------------------------------------------------------------
Name (time in us)                                              Min                    Max                   Mean                 StdDev                Median                    IQR            Outliers       OPS            Rounds  Iterations
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_rain[1-3-128-128-cuda-float32-inductor]       900.9710 (1.0)       2,411.9840 (1.14)      1,220.7531 (1.0)         186.7052 (1.0)      1,218.8415 (1.0)         315.0990 (1.01)        455;2  819.1665 (1.0)        1086           1
test_aug_2d_rain[1-1-128-128-cuda-float32-eager]          921.8510 (1.02)      2,110.3830 (1.0)       1,278.8340 (1.05)        193.7017 (1.04)     1,274.8410 (1.05)        313.3010 (1.0)         424;1  781.9623 (0.95)       1086           1
test_aug_2d_rain[1-3-128-128-cuda-float32-eager]          935.5920 (1.04)      2,277.1030 (1.08)      1,266.1508 (1.04)        189.0465 (1.01)     1,264.1720 (1.04)        323.2445 (1.03)        436;2  789.7954 (0.96)       1045           1
test_aug_2d_rain[1-1-128-128-cuda-float32-inductor]     1,223.7520 (1.36)     50,638.7630 (24.00)    11,317.4164 (9.27)     21,982.5126 (117.74)   1,465.5220 (1.20)     12,693.6370 (40.52)         1;1   88.3594 (0.11)          5           1
test_aug_2d_rain[5-3-128-128-cuda-float32-inductor]     2,610.8750 (2.90)      4,877.3060 (2.31)      3,813.1879 (3.12)        424.0021 (2.27)     3,816.9205 (3.13)        626.4255 (2.00)        120;0  262.2478 (0.32)        352           1
test_aug_2d_rain[5-1-128-128-cuda-float32-eager]        2,652.0040 (2.94)      4,977.0470 (2.36)      3,775.9582 (3.09)        396.5977 (2.12)     3,777.4150 (3.10)        573.2955 (1.83)        122;1  264.8334 (0.32)        377           1
test_aug_2d_rain[5-3-128-128-cuda-float32-eager]        2,657.8340 (2.95)      5,461.6380 (2.59)      3,761.3305 (3.08)        426.4610 (2.28)     3,763.7315 (3.09)        627.2150 (2.00)        128;1  265.8634 (0.32)        388           1
test_aug_2d_rain[5-1-128-128-cuda-float32-inductor]     3,915.9370 (4.35)     54,272.4800 (25.72)    14,241.6508 (11.67)    22,382.1721 (119.88)   4,019.1260 (3.30)     13,318.7365 (42.51)         1;1   70.2166 (0.09)          5           1
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_resize
-------------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_resize': 8 tests ---------------------------------------------------------------------------------------------------------
Name (time in us)                                                Min                       Max                    Mean                  StdDev                Median                     IQR            Outliers         OPS            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_resize[1-1-128-128-cuda-float32-inductor]       745.0410 (1.0)      1,725,492.0050 (>1000.0)  345,712.5876 (373.76)   771,320.1435 (>1000.0)    750.8210 (1.0)      431,243.6293 (>1000.0)       1;1      2.8926 (0.00)          5           1
test_aug_2d_resize[1-3-128-128-cuda-float32-inductor]       761.8000 (1.02)       905,267.3290 (883.13)   181,686.4244 (196.43)   404,494.0245 (>1000.0)    780.0510 (1.04)     226,199.7773 (>1000.0)       1;1      5.5040 (0.01)          5           1
test_aug_2d_resize[1-1-128-128-cuda-float32-eager]          909.7310 (1.22)         1,025.0710 (1.0)          924.9625 (1.0)            9.2248 (1.12)       923.5110 (1.23)           9.0875 (1.0)        172;29  1,081.1249 (1.0)        1093           1
test_aug_2d_resize[1-3-128-128-cuda-float32-eager]          925.1910 (1.24)         1,027.3310 (1.00)         940.7511 (1.02)           8.2356 (1.0)        939.3610 (1.25)           9.3775 (1.03)       251;29  1,062.9804 (0.98)       1071           1
test_aug_2d_resize[5-1-128-128-cuda-float32-inductor]     1,099.7210 (1.48)     1,747,825.1890 (>1000.0)  350,470.3590 (378.90)   781,145.0981 (>1000.0)  1,121.7010 (1.49)     436,746.2720 (>1000.0)       1;1      2.8533 (0.00)          5           1
test_aug_2d_resize[5-3-128-128-cuda-float32-inductor]     1,122.5420 (1.51)       921,933.6120 (899.39)   185,314.5116 (200.35)   411,782.5982 (>1000.0)  1,148.9310 (1.53)     230,285.3657 (>1000.0)       1;1      5.3962 (0.00)          5           1
test_aug_2d_resize[5-3-128-128-cuda-float32-eager]        1,259.4520 (1.69)         1,470.3120 (1.43)       1,284.5796 (1.39)          13.6232 (1.65)     1,283.4010 (1.71)          11.0500 (1.22)        97;20    778.4648 (0.72)        789           1
test_aug_2d_resize[5-1-128-128-cuda-float32-eager]        1,262.5910 (1.69)         1,456.0620 (1.42)       1,284.6195 (1.39)          14.2223 (1.73)     1,282.5820 (1.71)          11.4485 (1.26)        99;26    778.4406 (0.72)        789           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_resized_crop
--------------------------------------------------------------------------------------- benchmark 'test_aug_2d_resized_crop': 8 tests ---------------------------------------------------------------------------------------
Name (time in ms)                                                  Min                Max              Mean            StdDev            Median               IQR            Outliers       OPS            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_resized_crop[1-1-128-128-cuda-float32-inductor]     1.5426 (1.0)       1.7593 (1.07)     1.5896 (1.00)     0.0448 (2.29)     1.5839 (1.0)      0.0149 (1.0)           2;3  629.1055 (1.00)         18           1
test_aug_2d_resized_crop[1-3-128-128-cuda-float32-inductor]     1.5475 (1.00)      1.6389 (1.0)      1.5890 (1.0)      0.0196 (1.0)      1.5893 (1.00)     0.0204 (1.37)          4;1  629.3395 (1.0)          18           1
test_aug_2d_resized_crop[1-1-128-128-cuda-float32-eager]        1.8047 (1.17)     41.7881 (25.50)    1.9001 (1.20)     1.6966 (86.66)    1.8231 (1.15)     0.0171 (1.15)         1;19  526.3010 (0.84)        555           1
test_aug_2d_resized_crop[1-3-128-128-cuda-float32-eager]        1.8084 (1.17)      2.2460 (1.37)     1.8328 (1.15)     0.0229 (1.17)     1.8284 (1.15)     0.0152 (1.02)        37;27  545.5986 (0.87)        561           1
test_aug_2d_resized_crop[5-3-128-128-cuda-float32-inductor]     2.0081 (1.30)      2.7067 (1.65)     2.1650 (1.36)     0.0656 (3.35)     2.1708 (1.37)     0.0918 (6.18)        159;1  461.8919 (0.73)        485           1
test_aug_2d_resized_crop[5-1-128-128-cuda-float32-inductor]     2.0352 (1.32)      2.4597 (1.50)     2.1698 (1.37)     0.0579 (2.96)     2.1761 (1.37)     0.0805 (5.42)        146;4  460.8682 (0.73)        479           1
test_aug_2d_resized_crop[5-3-128-128-cuda-float32-eager]        2.1734 (1.41)      2.8220 (1.72)     2.2568 (1.42)     0.0599 (3.06)     2.2773 (1.44)     0.0942 (6.34)        125;4  443.1135 (0.70)        461           1
test_aug_2d_resized_crop[5-1-128-128-cuda-float32-eager]        2.1794 (1.41)      2.8406 (1.73)     2.2750 (1.43)     0.0675 (3.45)     2.2875 (1.44)     0.0865 (5.83)         93;8  439.5686 (0.70)        458           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_rgb_shift
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_rgb_shift': 4 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                 Min                 Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_rgb_shift[1-3-128-128-cuda-float32-inductor]     455.5410 (1.0)      479.5610 (1.0)      464.0623 (1.0)       8.3643 (1.0)      460.8900 (1.0)      10.3700 (1.54)          2;0        2.1549 (1.0)           7           1
test_aug_2d_rgb_shift[5-3-128-128-cuda-float32-inductor]     457.0300 (1.00)     486.8910 (1.02)     468.7034 (1.01)     10.4091 (1.24)     467.9900 (1.02)     14.4672 (2.14)          2;0        2.1335 (0.99)          7           1
test_aug_2d_rgb_shift[5-3-128-128-cuda-float32-eager]        473.2700 (1.04)     792.9610 (1.65)     485.5908 (1.05)     14.7517 (1.76)     483.0910 (1.05)      7.1523 (1.06)      139;164        2.0593 (0.96)       2085           1
test_aug_2d_rgb_shift[1-3-128-128-cuda-float32-eager]        476.0700 (1.05)     822.4110 (1.71)     489.9706 (1.06)     15.7405 (1.88)     487.3010 (1.06)      6.7500 (1.0)       114;165        2.0409 (0.95)       2058           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_rotation
------------------------------------------------------------------------------------------------ benchmark 'test_aug_2d_rotation': 8 tests -------------------------------------------------------------------------------------------------
Name (time in us)                                                  Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_rotation[5-3-128-128-cuda-float32-inductor]       827.7910 (1.0)        995.5920 (1.06)       848.4416 (1.0)      14.4554 (1.21)       844.6710 (1.0)      11.2100 (1.11)      201;118  1,178.6315 (1.0)        1155           1
test_aug_2d_rotation[1-1-128-128-cuda-float32-inductor]       827.8510 (1.00)     1,086.1510 (1.15)       852.8331 (1.01)     16.2051 (1.36)       848.5710 (1.00)     17.3375 (1.72)       250;41  1,172.5624 (0.99)       1191           1
test_aug_2d_rotation[1-3-128-128-cuda-float32-inductor]       831.9710 (1.01)     1,075.8310 (1.14)       856.0516 (1.01)     17.3208 (1.45)       849.8610 (1.01)     22.7700 (2.25)        290;7  1,168.1539 (0.99)       1187           1
test_aug_2d_rotation[5-1-128-128-cuda-float32-inductor]       833.0010 (1.01)       941.2410 (1.0)        854.6100 (1.01)     11.9350 (1.0)        852.6110 (1.01)     12.4825 (1.24)       296;47  1,170.1244 (0.99)       1179           1
test_aug_2d_rotation[5-3-128-128-cuda-float32-eager]        1,291.3520 (1.56)     1,735.1420 (1.84)     1,310.7807 (1.54)     25.9802 (2.18)     1,305.2720 (1.55)     10.8510 (1.07)        38;52    762.9041 (0.65)        774           1
test_aug_2d_rotation[5-1-128-128-cuda-float32-eager]        1,295.2410 (1.56)     1,753.6530 (1.86)     1,315.2656 (1.55)     21.4543 (1.80)     1,311.6715 (1.55)     14.9000 (1.48)        33;21    760.3027 (0.65)        770           1
test_aug_2d_rotation[1-1-128-128-cuda-float32-eager]        1,314.0310 (1.59)     1,631.5620 (1.73)     1,333.6464 (1.57)     16.1689 (1.35)     1,330.0860 (1.57)     15.9300 (1.58)         77;9    749.8240 (0.64)        758           1
test_aug_2d_rotation[1-3-128-128-cuda-float32-eager]        1,317.3320 (1.59)     1,919.5020 (2.04)     1,335.0880 (1.57)     32.0912 (2.69)     1,330.9620 (1.58)     10.0993 (1.0)         14;22    749.0143 (0.64)        759           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_salt_and_peper_noise
------------------------------------------------------------------------------------------------ benchmark 'test_aug_2d_salt_and_peper_noise': 8 tests -------------------------------------------------------------------------------------------------
Name (time in us)                                                              Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_salt_and_peper_noise[1-1-128-128-cuda-float32-eager]          686.1210 (1.0)        968.9910 (1.30)       702.4374 (1.0)      12.8204 (2.23)       699.5860 (1.0)       9.0550 (1.27)       149;96  1,423.6144 (1.0)        1448           1
test_aug_2d_salt_and_peper_noise[5-1-128-128-cuda-float32-inductor]       723.3610 (1.05)       766.5210 (1.03)       742.3932 (1.06)     15.9116 (2.76)       739.9520 (1.06)     18.9275 (2.66)          2;0  1,346.9951 (0.95)          5           1
test_aug_2d_salt_and_peper_noise[1-1-128-128-cuda-float32-inductor]       728.0010 (1.06)       745.0120 (1.0)        736.6992 (1.05)      6.1670 (1.07)       737.0810 (1.05)      7.1027 (1.0)           2;0  1,357.4061 (0.95)          5           1
test_aug_2d_salt_and_peper_noise[1-3-128-128-cuda-float32-eager]          732.2810 (1.07)     1,064.6620 (1.43)       752.6563 (1.07)     29.9299 (5.20)       744.7710 (1.06)     13.3122 (1.87)       73;128  1,328.6277 (0.93)       1351           1
test_aug_2d_salt_and_peper_noise[1-3-128-128-cuda-float32-inductor]       733.9610 (1.07)       764.9510 (1.03)       748.2850 (1.07)     11.4184 (1.98)       749.7510 (1.07)     13.4775 (1.90)          2;0  1,336.3892 (0.94)          5           1
test_aug_2d_salt_and_peper_noise[5-3-128-128-cuda-float32-inductor]       769.1910 (1.12)       782.2510 (1.05)       775.5494 (1.10)      5.7601 (1.0)        775.1810 (1.11)     10.5550 (1.49)          2;0  1,289.4085 (0.91)          5           1
test_aug_2d_salt_and_peper_noise[5-1-128-128-cuda-float32-eager]        1,173.3720 (1.71)     1,760.4620 (2.36)     1,200.9675 (1.71)     35.1100 (6.10)     1,195.1415 (1.71)     17.4150 (2.45)        30;32    832.6620 (0.58)        848           1
test_aug_2d_salt_and_peper_noise[5-3-128-128-cuda-float32-eager]        1,240.4820 (1.81)     1,548.6920 (2.08)     1,268.2790 (1.81)     23.4953 (4.08)     1,264.7120 (1.81)     17.0650 (2.40)        51;28    788.4700 (0.55)        799           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_saturation
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_saturation': 4 tests -----------------------------------------------------------------------------------------------
Name (time in us)                                                  Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_saturation[1-3-128-128-cuda-float32-inductor]     496.5600 (1.0)        814.0420 (1.0)      514.0862 (1.0)      14.8142 (1.30)     511.9555 (1.0)      11.5805 (1.44)       168;46        1.9452 (1.0)        1976           1
test_aug_2d_saturation[5-3-128-128-cuda-float32-inductor]     502.2810 (1.01)       964.1520 (1.18)     518.3586 (1.01)     25.9197 (2.28)     513.9210 (1.00)      8.4557 (1.06)       54;170        1.9292 (0.99)       1951           1
test_aug_2d_saturation[1-3-128-128-cuda-float32-eager]        871.0610 (1.75)     1,374.8910 (1.69)     890.6048 (1.73)     22.2389 (1.95)     887.4310 (1.73)     11.8032 (1.47)        22;23        1.1228 (0.58)       1141           1
test_aug_2d_saturation[5-3-128-128-cuda-float32-eager]        876.1810 (1.76)     1,067.2210 (1.31)     891.0892 (1.73)     11.3871 (1.0)      888.8365 (1.74)      8.0145 (1.0)         97;50        1.1222 (0.58)       1128           1
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_sharpness
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_sharpness': 8 tests ------------------------------------------------------------------------------------------------
Name (time in us)                                                   Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_sharpness[1-1-128-128-cuda-float32-inductor]       617.3610 (1.0)        645.8810 (1.0)        631.3156 (1.0)       8.2029 (1.0)        631.9110 (1.0)       9.3200 (1.17)          4;0  1,583.9938 (1.0)          13           1
test_aug_2d_sharpness[1-3-128-128-cuda-float32-inductor]       619.5910 (1.00)     1,075.2020 (1.66)       637.5980 (1.01)     21.2227 (2.59)       635.3410 (1.01)      9.0450 (1.14)        33;44  1,568.3864 (0.99)       1575           1
test_aug_2d_sharpness[1-3-128-128-cuda-float32-eager]          730.8010 (1.18)     1,332.3820 (2.06)       747.6619 (1.18)     27.4684 (3.35)       744.4105 (1.18)      7.9600 (1.0)         26;55  1,337.5029 (0.84)       1356           1
test_aug_2d_sharpness[1-1-128-128-cuda-float32-eager]          739.9110 (1.20)     1,344.2020 (2.08)       760.2948 (1.20)     29.0746 (3.54)       756.8310 (1.20)     10.0265 (1.26)        25;47  1,315.2792 (0.83)       1337           1
test_aug_2d_sharpness[5-3-128-128-cuda-float32-inductor]     1,058.5210 (1.71)     1,445.1020 (2.24)     1,083.8110 (1.72)     19.0335 (2.32)     1,081.9820 (1.71)     14.2623 (1.79)        72;19    922.6701 (0.58)        929           1
test_aug_2d_sharpness[5-1-128-128-cuda-float32-inductor]     1,083.8010 (1.76)     1,121.9820 (1.74)     1,100.0039 (1.74)     10.6281 (1.30)     1,099.6010 (1.74)     11.1883 (1.41)          4;1    909.0877 (0.57)         13           1
test_aug_2d_sharpness[5-3-128-128-cuda-float32-eager]        1,140.3020 (1.85)     1,637.3920 (2.54)     1,173.6445 (1.86)     36.6594 (4.47)     1,169.2715 (1.85)     21.9850 (2.76)        22;19    852.0468 (0.54)        868           1
test_aug_2d_sharpness[5-1-128-128-cuda-float32-eager]        1,148.9320 (1.86)     1,545.3830 (2.39)     1,174.0437 (1.86)     22.1284 (2.70)     1,170.1210 (1.85)     15.8150 (1.99)        40;19    851.7571 (0.54)        860           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_shear
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_shear': 8 tests ------------------------------------------------------------------------------------------------
Name (time in us)                                               Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_shear[1-3-128-128-cuda-float32-inductor]       808.0110 (1.0)      1,035.7710 (1.23)       852.0454 (1.03)     24.2029 (4.34)       850.9160 (1.03)     36.1600 (9.25)        366;3  1,173.6464 (0.97)       1142           1
test_aug_2d_shear[5-3-128-128-cuda-float32-inductor]       822.8910 (1.02)       969.5720 (1.15)       841.7723 (1.01)      9.3576 (1.68)       840.7110 (1.01)      9.3050 (2.38)       214;31  1,187.9697 (0.99)       1068           1
test_aug_2d_shear[1-1-128-128-cuda-float32-inductor]       825.3810 (1.02)       844.1410 (1.0)        830.5389 (1.0)       5.5784 (1.0)        828.8520 (1.0)       3.9100 (1.0)           1;1  1,204.0375 (1.0)           9           1
test_aug_2d_shear[5-1-128-128-cuda-float32-inductor]       855.4310 (1.06)       930.1110 (1.10)       870.9085 (1.05)     24.5044 (4.39)       862.9610 (1.04)      9.2600 (2.37)          1;1  1,148.2262 (0.95)          8           1
test_aug_2d_shear[1-1-128-128-cuda-float32-eager]        1,146.2820 (1.42)     2,102.9030 (2.49)     1,178.7522 (1.42)     69.1044 (12.39)    1,169.8910 (1.41)      9.3607 (2.39)        15;38    848.3547 (0.70)        863           1
test_aug_2d_shear[5-3-128-128-cuda-float32-eager]        1,154.9310 (1.43)     1,520.5620 (1.80)     1,169.9414 (1.41)     15.2892 (2.74)     1,167.6865 (1.41)      9.1855 (2.35)        37;28    854.7437 (0.71)        864           1
test_aug_2d_shear[5-1-128-128-cuda-float32-eager]        1,155.5820 (1.43)     1,589.6320 (1.88)     1,169.7825 (1.41)     20.6701 (3.71)     1,167.3510 (1.41)      9.5755 (2.45)        21;30    854.8598 (0.71)        864           1
test_aug_2d_shear[1-3-128-128-cuda-float32-eager]        1,165.4210 (1.44)     1,586.1930 (1.88)     1,180.4778 (1.42)     23.2862 (4.17)     1,177.2465 (1.42)      9.9405 (2.54)        23;41    847.1146 (0.70)        856           1
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_snow
------------------------------------------------------------------------------------------------ benchmark 'test_aug_2d_snow': 3 tests -------------------------------------------------------------------------------------------------
Name (time in us)                                              Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_snow[1-3-128-128-cuda-float32-inductor]       492.5700 (1.0)        705.6810 (1.0)        507.2558 (1.0)       9.7536 (1.0)        505.6810 (1.0)       6.7375 (1.0)        183;99  1,971.3918 (1.0)        1987           1
test_aug_2d_snow[1-3-128-128-cuda-float32-eager]        1,019.5810 (2.07)     1,607.4720 (2.28)     1,044.2518 (2.06)     33.6574 (3.45)     1,037.8860 (2.05)     14.5700 (2.16)        29;66    957.6234 (0.49)       1030           1
test_aug_2d_snow[5-3-128-128-cuda-float32-eager]        1,107.0220 (2.25)     1,229.3520 (1.74)     1,126.1409 (2.22)     11.5759 (1.19)     1,124.1115 (2.22)     10.5410 (1.56)       138;47    887.9884 (0.45)        898           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_solarize
---------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_solarize': 8 tests -----------------------------------------------------------------------------------------------
Name (time in us)                                                Min                   Max                Mean             StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_solarize[1-1-128-128-cuda-float32-inductor]     447.2410 (1.0)        816.9520 (1.04)     460.8954 (1.0)      13.8522 (1.19)     458.0610 (1.0)       8.4500 (1.0)       194;173        2.1697 (1.0)        2178           1
test_aug_2d_solarize[1-3-128-128-cuda-float32-inductor]     452.5400 (1.01)       782.6110 (1.0)      467.4728 (1.01)     15.8601 (1.36)     464.2300 (1.01)     10.9425 (1.29)       160;97        2.1392 (0.99)       2133           1
test_aug_2d_solarize[5-3-128-128-cuda-float32-inductor]     581.0610 (1.30)       835.0710 (1.07)     599.2533 (1.30)     15.9741 (1.37)     594.7810 (1.30)     11.8200 (1.40)      162;117        1.6687 (0.77)       1658           1
test_aug_2d_solarize[5-1-128-128-cuda-float32-inductor]     581.7010 (1.30)     1,004.7810 (1.28)     599.9223 (1.30)     17.2716 (1.48)     597.7110 (1.30)      8.9547 (1.06)        47;49        1.6669 (0.77)       1649           1
test_aug_2d_solarize[1-1-128-128-cuda-float32-eager]        587.8110 (1.31)       829.7210 (1.06)     604.0290 (1.31)     11.6408 (1.0)      601.7160 (1.31)     10.4100 (1.23)       233;58        1.6555 (0.76)       1662           1
test_aug_2d_solarize[1-3-128-128-cuda-float32-eager]        594.7710 (1.33)       969.8610 (1.24)     610.1335 (1.32)     16.4800 (1.42)     607.2605 (1.33)     10.7200 (1.27)        97;57        1.6390 (0.76)       1652           1
test_aug_2d_solarize[5-3-128-128-cuda-float32-eager]        601.6810 (1.35)     1,005.5110 (1.28)     617.0499 (1.34)     16.5033 (1.42)     614.1905 (1.34)     10.5500 (1.25)        93;54        1.6206 (0.75)       1634           1
test_aug_2d_solarize[5-1-128-128-cuda-float32-eager]        608.8410 (1.36)       915.5910 (1.17)     624.2874 (1.35)     20.2561 (1.74)     620.5460 (1.35)      9.0000 (1.07)       81;111        1.6018 (0.74)       1620           1
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_thin_plate_spline
----------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_thin_plate_spline': 8 tests -----------------------------------------------------------------------------------------------
Name (time in us)                                                         Min                   Max                Mean              StdDev              Median                IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_thin_plate_spline[1-3-128-128-cuda-float32-inductor]     577.3310 (1.0)        815.8610 (1.34)     593.4207 (1.0)       11.7706 (2.22)     590.4710 (1.0)      13.7510 (2.15)       247;24        1.6851 (1.0)        1682           1
test_aug_2d_thin_plate_spline[5-3-128-128-cuda-float32-inductor]     583.6810 (1.01)       830.8710 (1.36)     601.4078 (1.01)      10.7981 (2.03)     599.9400 (1.02)     11.8277 (1.85)       326;18        1.6628 (0.99)       1687           1
test_aug_2d_thin_plate_spline[1-1-128-128-cuda-float32-inductor]     589.4210 (1.02)       608.9710 (1.0)      596.0473 (1.00)       5.3073 (1.0)      596.2810 (1.01)      6.4100 (1.0)           4;1        1.6777 (1.00)         14           1
test_aug_2d_thin_plate_spline[5-1-128-128-cuda-float32-inductor]     599.5400 (1.04)     1,093.2920 (1.80)     654.0885 (1.10)     132.8227 (25.03)    614.5800 (1.04)     14.7120 (2.30)          1;2        1.5288 (0.91)         13           1
test_aug_2d_thin_plate_spline[1-3-128-128-cuda-float32-eager]        919.0910 (1.59)     1,312.1910 (2.15)     932.9390 (1.57)      15.4028 (2.90)     930.3510 (1.58)      8.9303 (1.39)        56;50        1.0719 (0.64)       1081           1
test_aug_2d_thin_plate_spline[1-1-128-128-cuda-float32-eager]        920.5010 (1.59)     1,198.6410 (1.97)     933.6605 (1.57)      14.6685 (2.76)     931.3110 (1.58)      8.3450 (1.30)        43;37        1.0711 (0.64)       1079           1
test_aug_2d_thin_plate_spline[5-1-128-128-cuda-float32-eager]        921.1510 (1.60)     1,516.0620 (2.49)     938.7986 (1.58)      27.8778 (5.25)     935.2310 (1.58)      9.0675 (1.41)        15;32        1.0652 (0.63)       1079           1
test_aug_2d_thin_plate_spline[5-3-128-128-cuda-float32-eager]        925.4610 (1.60)     1,470.3720 (2.41)     940.7696 (1.59)      22.8356 (4.30)     937.9710 (1.59)      8.8200 (1.38)        18;28        1.0630 (0.63)       1073           1
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_translate
------------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_translate': 8 tests ------------------------------------------------------------------------------------------------
Name (time in us)                                                   Min                   Max                  Mean             StdDev                Median                IQR            Outliers         OPS            Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_translate[1-3-128-128-cuda-float32-inductor]       740.0410 (1.0)        988.3420 (1.20)       757.1736 (1.0)      12.6937 (1.52)       755.2710 (1.0)       9.4200 (1.03)       103;38  1,320.7010 (1.0)        1330           1
test_aug_2d_translate[5-3-128-128-cuda-float32-inductor]       741.5710 (1.00)       925.8910 (1.13)       765.6690 (1.01)     15.9175 (1.91)       760.6710 (1.01)     19.5127 (2.14)       357;26  1,306.0474 (0.99)       1281           1
test_aug_2d_translate[1-1-128-128-cuda-float32-inductor]       756.6800 (1.02)       832.3910 (1.01)       774.6398 (1.02)     23.3457 (2.80)       763.7810 (1.01)     18.7625 (2.06)          1;1  1,290.9226 (0.98)          9           1
test_aug_2d_translate[5-1-128-128-cuda-float32-inductor]       796.4710 (1.08)       821.7410 (1.0)        805.9109 (1.06)      8.3309 (1.0)        806.0310 (1.07)     12.0682 (1.32)          2;0  1,240.8320 (0.94)          9           1
test_aug_2d_translate[5-1-128-128-cuda-float32-eager]        1,007.4410 (1.36)     1,503.4520 (1.83)     1,024.6314 (1.35)     22.2071 (2.67)     1,020.9920 (1.35)      9.1125 (1.0)         29;43    975.9607 (0.74)        989           1
test_aug_2d_translate[5-3-128-128-cuda-float32-eager]        1,016.3610 (1.37)     1,756.8020 (2.14)     1,038.9757 (1.37)     47.3812 (5.69)     1,028.4160 (1.36)     12.5250 (1.37)        27;96    962.4864 (0.73)        944           1
test_aug_2d_translate[1-3-128-128-cuda-float32-eager]        1,020.7710 (1.38)     1,530.2920 (1.86)     1,040.3963 (1.37)     26.4376 (3.17)     1,035.3410 (1.37)     10.1500 (1.11)        67;87    961.1722 (0.73)        972           1
test_aug_2d_translate[1-1-128-128-cuda-float32-eager]        1,024.1210 (1.38)     1,292.3420 (1.57)     1,042.2057 (1.38)     20.6832 (2.48)     1,037.4860 (1.37)     10.5500 (1.16)        70;80    959.5035 (0.73)        970           1
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_vertical_flip
--------------------------------------------------------------------------------------------- benchmark 'test_aug_2d_vertical_flip': 8 tests ---------------------------------------------------------------------------------------------
Name (time in us)                                                     Min                 Max                Mean             StdDev              Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_aug_2d_vertical_flip[1-3-128-128-cuda-float32-inductor]     352.8800 (1.0)      458.7700 (1.09)     364.5742 (1.0)       7.3625 (1.41)     362.0000 (1.0)      9.9980 (2.09)       643;40        2.7429 (1.0)        2697           1
test_aug_2d_vertical_flip[1-1-128-128-cuda-float32-inductor]     355.7500 (1.01)     437.9710 (1.04)     370.4122 (1.02)      6.5317 (1.25)     369.3055 (1.02)     6.3310 (1.32)      500;130        2.6997 (0.98)       2698           1
test_aug_2d_vertical_flip[1-1-128-128-cuda-float32-eager]        357.9200 (1.01)     586.3610 (1.40)     368.3578 (1.01)      7.8930 (1.51)     366.8210 (1.01)     4.7810 (1.0)       233;196        2.7148 (0.99)       2734           1
test_aug_2d_vertical_flip[5-3-128-128-cuda-float32-eager]        358.0600 (1.01)     687.9510 (1.64)     372.1532 (1.02)     10.0820 (1.93)     370.7705 (1.02)     5.2010 (1.09)      192;183        2.6871 (0.98)       2702           1
test_aug_2d_vertical_flip[5-1-128-128-cuda-float32-eager]        358.9510 (1.02)     583.8600 (1.39)     367.4583 (1.01)      7.4469 (1.43)     366.0410 (1.01)     5.2190 (1.09)      235;158        2.7214 (0.99)       2722           1
test_aug_2d_vertical_flip[5-3-128-128-cuda-float32-inductor]     364.6300 (1.03)     420.0300 (1.0)      373.9198 (1.03)      5.2179 (1.0)      372.7210 (1.03)     5.0305 (1.05)      589;172        2.6744 (0.98)       2700           1
test_aug_2d_vertical_flip[1-3-128-128-cuda-float32-eager]        365.4000 (1.04)     544.0200 (1.30)     375.7093 (1.03)      8.1574 (1.56)     374.1910 (1.03)     6.0780 (1.27)      234;104        2.6616 (0.97)       2669           1
test_aug_2d_vertical_flip[5-1-128-128-cuda-float32-inductor]     373.3600 (1.06)     443.7900 (1.06)     384.6589 (1.06)      5.5279 (1.06)     383.5000 (1.06)     5.6655 (1.19)      571;113        2.5997 (0.95)       2600           1
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_grayscale_to_rgb
-------------------------------------------------------------------------------------------- benchmark 'test_grayscale_to_rgb': 36 tests --------------------------------------------------------------------------------------------
Name (time in us)                                                Min                   Max               Mean             StdDev             Median               IQR            Outliers  OPS (Kops/s)            Rounds  Iterations
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
test_grayscale_to_rgb[cuda-float32-eager-512-128-1-1]        26.6700 (1.0)        890.1820 (12.80)    28.8678 (1.02)      5.8325 (4.29)     28.2500 (1.00)     0.6100 (1.15)    1508;2828       34.6407 (0.98)      34917           1
test_grayscale_to_rgb[cuda-float32-eager-128-128-1-1]        26.7300 (1.00)     1,438.5120 (20.68)    29.5341 (1.04)     12.8892 (9.49)     28.6800 (1.02)     0.7300 (1.38)     732;1833       33.8592 (0.96)      34365           1
test_grayscale_to_rgb[cuda-float32-eager-237-512-1-1]        26.7700 (1.00)        73.1200 (1.05)     28.4221 (1.00)      1.4965 (1.10)     28.2100 (1.0)      0.6390 (1.21)    1409;1678       35.1839 (1.00)      34989           1
test_grayscale_to_rgb[cuda-float32-eager-237-237-1-1]        26.7800 (1.00)        71.9400 (1.03)     28.4319 (1.00)      1.3948 (1.03)     28.2200 (1.00)     0.6100 (1.15)    1425;1707       35.1717 (1.00)      35224           1
test_grayscale_to_rgb[cuda-float32-eager-237-128-1-1]        26.7900 (1.00)        74.7200 (1.07)     28.4168 (1.0)       1.4885 (1.10)     28.2100 (1.0)      0.5600 (1.06)    1143;1679       35.1905 (1.0)       34271           1
test_grayscale_to_rgb[cuda-float32-eager-512-237-1-1]        26.8500 (1.01)       104.2310 (1.50)     28.5043 (1.00)      1.4939 (1.10)     28.2900 (1.00)     0.6200 (1.17)    1361;1616       35.0824 (1.00)      34687           1
test_grayscale_to_rgb[cuda-float32-eager-237-128-1-5]        26.8600 (1.01)        74.5800 (1.07)     28.4940 (1.00)      1.4594 (1.07)     28.2900 (1.00)     0.5700 (1.08)    1201;1585       35.0951 (1.00)      35175           1
test_grayscale_to_rgb[cuda-float32-eager-512-128-1-5]        26.9900 (1.01)        75.7700 (1.09)     28.7325 (1.01)      1.5720 (1.16)     28.5100 (1.01)     0.6100 (1.15)    1368;1650       34.8038 (0.99)      34844           1
test_grayscale_to_rgb[cuda-float32-eager-237-237-1-5]        27.0000 (1.01)        73.1000 (1.05)     28.7051 (1.01)      1.5484 (1.14)     28.4800 (1.01)     0.5700 (1.08)    1357;1762       34.8370 (0.99)      34711           1
test_grayscale_to_rgb[cuda-float32-eager-128-512-1-1]        27.0300 (1.01)        69.5600 (1.0)      28.7214 (1.01)      1.3588 (1.0)      28.5300 (1.01)     0.5300 (1.0)     1105;1525       34.8173 (0.99)      34663           1
test_grayscale_to_rgb[cuda-float32-eager-128-237-1-1]        27.0700 (1.01)        72.8700 (1.05)     28.7067 (1.01)      1.5061 (1.11)     28.5000 (1.01)     0.5900 (1.11)    1168;1601       34.8351 (0.99)      34783           1
test_grayscale_to_rgb[cuda-float32-eager-128-237-1-5]        27.1010 (1.02)        72.3600 (1.04)     28.7591 (1.01)      1.5649 (1.15)     28.5300 (1.01)     0.5400 (1.02)    1160;1640       34.7717 (0.99)      34759           1
test_grayscale_to_rgb[cuda-float32-eager-512-512-1-1]        27.1200 (1.02)       486.8110 (7.00)     28.8219 (1.01)      3.1752 (2.34)     28.5300 (1.01)     0.5500 (1.04)    1004;1908       34.6958 (0.99)      34978           1
test_grayscale_to_rgb[cuda-float32-eager-128-512-1-5]        27.1200 (1.02)        94.0410 (1.35)     28.8122 (1.01)      1.5692 (1.15)     28.5900 (1.01)     0.5700 (1.08)    1121;1602       34.7075 (0.99)      34567           1
test_grayscale_to_rgb[cuda-float32-eager-128-128-1-5]        27.1600 (1.02)        70.5300 (1.01)     29.0672 (1.02)      1.5515 (1.14)     28.8700 (1.02)     0.5800 (1.09)    1129;1453       34.4031 (0.98)      34783           1
test_grayscale_to_rgb[cuda-float32-eager-512-512-1-5]        27.6300 (1.04)       727.3710 (10.46)    77.1338 (2.71)     12.3444 (9.08)     78.7400 (2.79)     1.5200 (2.87)    1737;2897       12.9645 (0.37)      34771           1
test_grayscale_to_rgb[cuda-float32-eager-237-512-1-5]        27.7800 (1.04)       711.0910 (10.22)    36.3437 (1.28)      5.6190 (4.14)     37.0405 (1.31)     0.9400 (1.77)    4524;5612       27.5151 (0.78)      34868           1
test_grayscale_to_rgb[cuda-float32-eager-512-237-1-5]        27.9600 (1.05)       487.4810 (7.01)     36.3277 (1.28)      5.2587 (3.87)     37.0100 (1.31)     0.9700 (1.83)    4864;5940       27.5272 (0.78)      34471           1
test_grayscale_to_rgb[cuda-float32-inductor-128-128-1-1]     42.3900 (1.59)       103.0300 (1.48)     44.3324 (1.56)      2.5357 (1.87)     43.9000 (1.56)     0.7177 (1.35)     900;1564       22.5569 (0.64)      22387           1
test_grayscale_to_rgb[cuda-float32-inductor-237-237-1-1]     42.4500 (1.59)        85.4100 (1.23)     44.5150 (1.57)      1.8268 (1.34)     44.1700 (1.57)     0.9200 (1.74)     994;1047       22.4643 (0.64)      22599           1
test_grayscale_to_rgb[cuda-float32-inductor-128-512-1-5]     42.5000 (1.59)        70.4200 (1.01)     44.6345 (1.57)      1.9344 (1.42)     44.2800 (1.57)     0.5900 (1.11)    1055;1351       22.4042 (0.64)      22402           1
test_grayscale_to_rgb[cuda-float32-inductor-237-237-1-5]     42.5100 (1.59)        94.3200 (1.36)     44.8466 (1.58)      1.8247 (1.34)     44.5600 (1.58)     0.6600 (1.25)     945;1107       22.2982 (0.63)      22193           1
test_grayscale_to_rgb[cuda-float32-inductor-237-512-1-5]     42.5900 (1.60)       100.1000 (1.44)     44.4649 (1.56)      1.7650 (1.30)     44.2000 (1.57)     0.6300 (1.19)     928;1151       22.4896 (0.64)      22427           1
test_grayscale_to_rgb[cuda-float32-inductor-512-128-1-1]     42.6000 (1.60)        93.2500 (1.34)     44.5633 (1.57)      1.8825 (1.39)     44.2600 (1.57)     0.6300 (1.19)    1015;1213       22.4400 (0.64)      22498           1
test_grayscale_to_rgb[cuda-float32-inductor-128-237-1-1]     42.6510 (1.60)        88.1300 (1.27)     44.8303 (1.58)      3.1369 (2.31)     44.3400 (1.57)     0.6900 (1.30)     827;1382       22.3064 (0.63)      22656           1
test_grayscale_to_rgb[cuda-float32-inductor-512-237-1-5]     42.7500 (1.60)        93.0100 (1.34)     44.7148 (1.57)      1.9445 (1.43)     44.4100 (1.57)     0.5610 (1.06)     971;1192       22.3640 (0.64)      22372           1
test_grayscale_to_rgb[cuda-float32-inductor-128-237-1-5]     42.7600 (1.60)        87.5400 (1.26)     44.7373 (1.57)      1.9262 (1.42)     44.3700 (1.57)     0.6700 (1.26)    1022;1480       22.3527 (0.64)      21964           1
test_grayscale_to_rgb[cuda-float32-inductor-237-128-1-5]     42.9900 (1.61)        91.2500 (1.31)     44.9688 (1.58)      1.6989 (1.25)     44.7100 (1.58)     0.6700 (1.26)     927;1029       22.2376 (0.63)      21232           1
test_grayscale_to_rgb[cuda-float32-inductor-128-128-1-5]     43.0200 (1.61)        90.2600 (1.30)     45.1468 (1.59)      3.1177 (2.29)     44.6700 (1.58)     0.6800 (1.28)     799;1352       22.1500 (0.63)      22247           1
test_grayscale_to_rgb[cuda-float32-inductor-512-512-1-1]     43.0500 (1.61)       112.0600 (1.61)     45.1872 (1.59)      2.4862 (1.83)     44.8600 (1.59)     0.6300 (1.19)     875;1160       22.1302 (0.63)      22523           1
test_grayscale_to_rgb[cuda-float32-inductor-237-512-1-1]     43.1700 (1.62)        91.3500 (1.31)     45.0320 (1.58)      1.8643 (1.37)     44.7400 (1.59)     0.5793 (1.09)     994;1303       22.2064 (0.63)      22717           1
test_grayscale_to_rgb[cuda-float32-inductor-237-128-1-1]     43.3000 (1.62)       103.6600 (1.49)     45.1162 (1.59)      1.8642 (1.37)     44.8300 (1.59)     0.6000 (1.13)     970;1149       22.1650 (0.63)      22599           1
test_grayscale_to_rgb[cuda-float32-inductor-512-512-1-5]     43.3600 (1.63)       528.7400 (7.60)     56.0975 (1.97)      8.2730 (6.09)     58.1000 (2.06)     2.1700 (4.09)    4510;4852       17.8261 (0.51)      21916           1
test_grayscale_to_rgb[cuda-float32-inductor-512-128-1-5]     43.3700 (1.63)        99.9000 (1.44)     45.2174 (1.59)      2.1057 (1.55)     44.9000 (1.59)     0.6100 (1.15)     927;1277       22.1154 (0.63)      21792           1
test_grayscale_to_rgb[cuda-float32-inductor-512-237-1-1]     43.3700 (1.63)        94.4300 (1.36)     45.4151 (1.60)      1.8334 (1.35)     45.1300 (1.60)     0.6200 (1.17)     949;1126       22.0191 (0.63)      22085           1
test_grayscale_to_rgb[cuda-float32-inductor-128-512-1-1]     44.2800 (1.66)        91.4000 (1.31)     47.5148 (1.67)      2.0247 (1.49)     47.1800 (1.67)     0.8900 (1.68)    1397;1512       21.0461 (0.60)      22297           1
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Legend:
Outliers: 1 Standard Deviation from Mean; 1.5 IQR (InterQuartile Range) from 1st Quartile and 3rd Quartile.
OPS: Operations Per Second, computed as 1 / Mean

@johnnv1 johnnv1 merged commit 46a5e40 into kornia:main Mar 27, 2024
27 checks passed
@johnnv1 johnnv1 deleted the feat/aug-benchs branch March 27, 2024 23:57
@johnnv1 johnnv1 mentioned this pull request Mar 29, 2024
cjpurackal pushed a commit to cjpurackal/kornia that referenced this pull request May 18, 2024
* feat: introduce `--optimizer` in the testing
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
code heatlh 💊 Improvement the package code health
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants