Add ability to compare two different benchmark runs #4519

sadhansood · 2022-09-08T07:00:48Z

We can now publish benchmark results with different quantile latencies like (using a histogram library):

Benchmark Result
+-------------+-----+------------+-----------------+-----------------+-----------------+-----------------+-----------------+-----------------+-------------------+-----------------+
| duration(s) | tps | error_rate | latency_ms(min) | latency_ms(p25) | latency_ms(p50) | latency_ms(p75) | latency_ms(p90) | latency_ms(p99) | latency_ms(p99.9) | latency_ms(max) |
+==================================================================================================================================================================================+
| 300         | 675 | 0          | 0               | 148             | 413             | 1071            | 2111            | 5535            | 7775              | 8959            |
+-------------+-----+------------+-----------------+-----------------+-----------------+-----------------+-----------------+-----------------+-------------------+-----------------+

as well as store the benchmark results on disk. Also, we can compare current run with a previously stored result to test for regression, etc:

Benchmark Comparison Result[/tmp/prev_bench_result]
+--------------+------+------+------+------------+---------+
| name         | old  | new  | diff | diff_ratio | speedup |
+==========================================================+
| tps          | 739  | 675  | -64  | -8.66%     | 0.91x   |
|--------------+------+------+------+------------+---------|
| error_rate   | 0    | 0    | 0    | NaN%       | NaNx    |
|--------------+------+------+------+------------+---------|
| min_latency  | 0    | 0    | 0    | NaN%       | NaNx    |
|--------------+------+------+------+------------+---------|
| p25_latency  | 173  | 148  | -25  | -14.45%    | 1.17x   |
|--------------+------+------+------+------------+---------|
| p50_latency  | 409  | 413  | 4    | 0.98%      | 0.99x   |
|--------------+------+------+------+------------+---------|
| p75_latency  | 1103 | 1071 | -32  | -2.90%     | 1.03x   |
|--------------+------+------+------+------------+---------|
| p90_latency  | 2303 | 2111 | -192 | -8.34%     | 1.09x   |
|--------------+------+------+------+------------+---------|
| p99_latency  | 4799 | 5535 | 736  | 15.34%     | 0.87x   |
|--------------+------+------+------+------------+---------|
| p999_latency | 6239 | 7775 | 1536 | 24.62%     | 0.80x   |
|--------------+------+------+------+------------+---------|
| max_latency  | 7551 | 8959 | 1408 | 18.65%     | 0.84x   |
+--------------+------+------+------+------------+---------+

sadhansood force-pushed the sadhan/compare_benchmarks branch from 09323f1 to f1b590d Compare September 8, 2022 07:05

sadhansood changed the title ~~Sadhan/compare benchmarks~~ Add ability to compare two different benchmark runs Sep 8, 2022

sadhansood requested review from tharbert, bmwill, velvia and longbowlu September 8, 2022 07:09

sadhansood marked this pull request as ready for review September 8, 2022 07:09

sadhansood mentioned this pull request Sep 8, 2022

Add ability to run stress test for a finite duration or count #4518

Merged

tharbert approved these changes Sep 16, 2022

View reviewed changes

sadhansood force-pushed the sadhan/stress_interval branch 3 times, most recently from 60459ea to 1a48cd2 Compare September 16, 2022 20:48

Base automatically changed from sadhan/stress_interval to main September 16, 2022 21:01

sadhansood added 2 commits September 16, 2022 16:33

Add ability to run stress for a finite duration or count

07b28b2

Add ability to run stress for a finite duration or count

0c7f64a

sadhansood force-pushed the sadhan/compare_benchmarks branch from f1b590d to 5a4b1d2 Compare September 17, 2022 14:11

Add ability to store benchmark results and compare with a previous run

798d72a

sadhansood force-pushed the sadhan/compare_benchmarks branch from 5a4b1d2 to 798d72a Compare September 17, 2022 15:14

sadhansood merged commit bee2098 into main Sep 17, 2022

sadhansood deleted the sadhan/compare_benchmarks branch September 17, 2022 20:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to compare two different benchmark runs #4519

Add ability to compare two different benchmark runs #4519

sadhansood commented Sep 8, 2022 •

edited

Loading

Add ability to compare two different benchmark runs #4519

Add ability to compare two different benchmark runs #4519

Conversation

sadhansood commented Sep 8, 2022 • edited Loading

sadhansood commented Sep 8, 2022 •

edited

Loading