Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(ci): add analysis from smp to regression workflow #15063

Merged
merged 9 commits into from
Nov 2, 2022

Conversation

goxberry
Copy link
Contributor

@goxberry goxberry commented Nov 1, 2022

This pull request updates the Regression workflow to analyze results from regression experiments using smp. To do so, it makes the following changes:

  • Updates the smp crate version from 0.3.0 to 0.5.0.
  • Updates flags as needed in the smp job submit and smp job cancel invocations.
  • Uncomments the analysis job @blt set up, and adds a couple steps to post regression reports to PR comments.

h/t and thanks go to @blt & @GeorgeHahn for making the setup process easy, and many thanks to y'all for your patience and for being early adopters!

All of the `smp` subcommands take a `--team-id` argument between `smp`
and `job`, but this flag is missing in the `smp job cancel` invocation
in the Vector Regression workflow, so this commit adds that flag.

Signed-off-by: Geoffrey M. Oxberry <geoffrey.oxberry@datadoghq.com>
Flags for the `smp job submit` step have changed: as a result of
passing in image tags to specify the baseline and comparison variants
of a target in regression testing, `smp` no longer has access to Git
commit SHA metadata via the `--baseline-image` and
`--comparison-image` flag parameters. To enable `smp` to report this
information during its analysis step, the `--baseline-sha` and
`--comparison-sha` flags were added to `smp job submit`. This commit
adds those flags and the corresponding commit hashes to the job
submission step in the regression workflow.

Signed-off-by: Geoffrey M. Oxberry <geoffrey.oxberry@datadoghq.com>
To add regression reports from `smp`, this commit uncomments and
renames the analysis job, making minor tweaks so that the resulting
job:

    * Downloads an `smp` binary with the right version.

    * Uses that binary to download an HTML report analyzing regression
      experiments on a Vector PR.

    * Posts that HTML report into a comment on that PR.

    * Uploads the HTML report to workflow run artifacts.

Hat-tip and thanks to Brian L Troutwine & George Hahn for making these
changes easy.

Signed-off-by: Geoffrey M. Oxberry <geoffrey.oxberry@datadoghq.com>
@netlify
Copy link

netlify bot commented Nov 1, 2022

Deploy Preview for vector-project canceled.

Name Link
🔨 Latest commit c256ed7
🔍 Latest deploy log https://app.netlify.com/sites/vector-project/deploys/6362b2fbabe7fb000b2de203

@netlify
Copy link

netlify bot commented Nov 1, 2022

Deploy Preview for vrl-playground canceled.

Name Link
🔨 Latest commit c256ed7
🔍 Latest deploy log https://app.netlify.com/sites/vrl-playground/deploys/6362b2fbe06d0b000961eb4a

Copy link
Contributor

@blt blt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great to me. All that remains to be seen is whether the analysis posts, which I have high confidence of.

@spencergilbert
Copy link
Contributor

It is a bit unclear to me, this doesn't address the secret access issue yet - right?

@blt blt requested a review from jszwedko November 1, 2022 23:03
This commit adds a missing task dependency to the analysis job so that
it downloads the correct version of `smp`.

Signed-off-by: Geoffrey M. Oxberry <geoffrey.oxberry@datadoghq.com>
GitHub Actions CI emits warnings about the
`aws-actions/configure-aws-credentials@v1` action because it uses
Node.js 12, and Node.js 12 actions are deprecated with removal set for
Summer 2023. Migrating to Node.js-16-based GitHub Actions is the
suggested mitigation, and the
`aws-actions/configure-aws-credentials@v1-node16` is suggested as a
replacement, so this commit makes that change.

Signed-off-by: Geoffrey M. Oxberry <geoffrey.oxberry@datadoghq.com>
@goxberry
Copy link
Contributor Author

goxberry commented Nov 1, 2022

Should be able to fix the Node.js version warning here by changing aws-actions/configure-aws-credentials@v1 to aws-actions/configure-aws-credentials@v1-node16. Still checking out the other warning. h/t to @spencergilbert for the heads up.

@blt
Copy link
Contributor

blt commented Nov 1, 2022

@jszwedko @spencergilbert once this is merged we'll have two sets of

It is a bit unclear to me, this doesn't address the secret access issue yet - right?

That's correct. In order to address that this workflow would need to be split up with the code building portion running per-PR and the job submission etc running in a triggered workflow sourced from master branch.

In the Regression task, the `build-comparison` job, which builds a
Vector container for the comparison variant of a regression
experiment, uses the display name "Build baseline Vector
container". This name confused me when I first saw the output in
GitHub Actions because I thought I might have introduced a bug into
the CI Regression workflow, but it appears that the `build-comparison`
job is correctly configured aside from having a misleading name.

To avoid this sort of confusion in the future, this commit updates the
display name of the `build-comparison` job in the Regression workflow
to reflect more accurately what that job does.

Signed-off-by: Geoffrey M. Oxberry <geoffrey.oxberry@datadoghq.com>
@github-actions
Copy link

github-actions bot commented Nov 2, 2022

Soak Test Results

Baseline: f7f4b64
Comparison: 537759f
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
splunk_hec_route_s3 142.01KiB 0.65 98.25% 21.26MiB 2.05MiB 42.76KiB 0 0.0965322 21.4MiB 2.0MiB 41.72KiB 0 0.0932374 False False
socket_to_socket_blackhole 129.75KiB 0.6 99.99% 21.13MiB 1.21MiB 25.2KiB 0 0.0570424 21.26MiB 1.09MiB 22.86KiB 0 0.0514293 False False
http_pipelines_blackhole_acks 4.26KiB 0.36 88.90% 1.17MiB 99.97KiB 2.04KiB 0 0.0837552 1.17MiB 84.93KiB 1.73KiB 0 0.0709047 False False
datadog_agent_remap_blackhole 183.18KiB 0.31 95.13% 56.87MiB 3.69MiB 76.84KiB 0 0.0648224 57.05MiB 2.5MiB 52.19KiB 0 0.0438213 False False
syslog_splunk_hec_logs 40.92KiB 0.26 97.12% 15.52MiB 721.55KiB 14.69KiB 0 0.0454016 15.56MiB 567.77KiB 11.59KiB 0 0.0356338 False False
http_text_to_http_json 83.18KiB 0.22 99.94% 37.6MiB 849.76KiB 17.35KiB 0 0.0220643 37.68MiB 819.12KiB 16.72KiB 0 0.0212229 False False
syslog_regex_logs2metric_ddmetrics 16.84KiB 0.22 45.39% 7.38MiB 987.03KiB 20.11KiB 0 0.130497 7.4MiB 947.67KiB 19.33KiB 0 0.125014 False False
syslog_humio_logs 24.37KiB 0.15 99.89% 16.05MiB 308.01KiB 6.28KiB 0 0.0187414 16.07MiB 195.3KiB 4.0KiB 0 0.0118657 False False
datadog_agent_remap_blackhole_acks 41.26KiB 0.07 30.81% 57.77MiB 3.88MiB 80.8KiB 0 0.0671433 57.81MiB 3.14MiB 65.65KiB 0 0.0543308 False False
splunk_hec_to_splunk_hec_logs_noack 15.4KiB 0.06 82.07% 23.82MiB 455.37KiB 9.3KiB 0 0.0186621 23.84MiB 329.0KiB 6.72KiB 0 0.0134749 False False
syslog_log2metric_humio_metrics 6.04KiB 0.06 47.05% 9.31MiB 237.85KiB 4.85KiB 0 0.0249387 9.32MiB 406.9KiB 8.28KiB 0 0.0426363 False False
splunk_hec_to_splunk_hec_logs_acks 8.2KiB 0.03 26.13% 23.75MiB 874.35KiB 17.78KiB 0 0.0359375 23.76MiB 833.0KiB 16.95KiB 0 0.0342262 False False
enterprise_http_to_http 241.02B 0 2.57% 23.85MiB 253.41KiB 5.17KiB 0 0.0103758 23.85MiB 252.85KiB 5.18KiB 0 0.0103524 False False
splunk_hec_indexer_ack_blackhole -2.73KiB -0.01 9.04% 23.76MiB 832.29KiB 16.93KiB 0 0.0342054 23.75MiB 839.8KiB 17.08KiB 0 0.0345179 False False
file_to_blackhole -63.62KiB -0.07 49.48% 95.34MiB 3.12MiB 64.72KiB 0 0.0327402 95.28MiB 3.37MiB 70.17KiB 0 0.0354012 False False
http_to_http_json -40.14KiB -0.16 99.61% 23.85MiB 381.34KiB 7.78KiB 0 0.0156137 23.81MiB 563.8KiB 11.51KiB 0 0.0231223 False False
fluent_elasticsearch -130.0KiB -0.16 100.00% 79.47MiB 52.46KiB 1.06KiB 0 0.0006445 79.35MiB 1.22MiB 25.12KiB 0 0.0153744 False False
syslog_log2metric_splunk_hec_metrics -28.75KiB -0.18 73.81% 15.64MiB 857.84KiB 17.47KiB 0 0.0535557 15.61MiB 920.56KiB 18.74KiB 0 0.057575 False False
http_to_http_noack -52.94KiB -0.22 99.30% 23.84MiB 401.31KiB 8.2KiB 0 0.0164365 23.79MiB 874.69KiB 17.83KiB 0 0.0359032 False False
http_pipelines_blackhole -4.58KiB -0.27 90.93% 1.68MiB 46.2KiB 966.74B 0 0.0268283 1.68MiB 124.37KiB 2.53KiB 0 0.072409 False False
syslog_loki -64.3KiB -0.41 99.96% 15.39MiB 393.62KiB 8.05KiB 0 0.0249754 15.32MiB 802.95KiB 16.32KiB 0 0.0511569 False False
datadog_agent_remap_datadog_logs_acks -242.07KiB -0.45 98.22% 53.02MiB 2.9MiB 60.68KiB 0 0.0547419 52.78MiB 3.95MiB 82.14KiB 0 0.0747423 False False
datadog_agent_remap_datadog_logs -456.24KiB -0.81 100.00% 54.67MiB 1.43MiB 29.95KiB 0 0.0261373 54.22MiB 3.77MiB 78.55KiB 0 0.0695619 False False
http_pipelines_no_grok_blackhole -112.96KiB -1.05 100.00% 10.46MiB 256.61KiB 5.24KiB 0 0.0239571 10.35MiB 1.06MiB 22.01KiB 0 0.102157 False False
http_to_http_acks -279.01KiB -1.57 75.42% 17.36MiB 8.11MiB 169.46KiB 0 0.466748 17.09MiB 8.17MiB 170.46KiB 0 0.478189 True True

@blt
Copy link
Contributor

blt commented Nov 2, 2022

Hmm, it looks like splunk_hec_route_s3 is failing when run under the regression detector. We'll re-trigger the run to see if it's a persistent problem on our side -- config bungled? -- and will dig in further tomorrow.

@github-actions
Copy link

github-actions bot commented Nov 2, 2022

Soak Test Results

Baseline: 373e1c8
Comparison: c4cdb67
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
http_pipelines_blackhole_acks 19.79KiB 1.75 100.00% 1.1MiB 166.77KiB 3.4KiB 0 0.147686 1.12MiB 141.87KiB 2.89KiB 0 0.123468 False False
syslog_log2metric_humio_metrics 90.81KiB 0.93 100.00% 9.5MiB 183.51KiB 3.75KiB 0 0.0188503 9.59MiB 394.97KiB 8.04KiB 0 0.0401968 False False
syslog_humio_logs 110.34KiB 0.69 100.00% 15.67MiB 343.01KiB 7.0KiB 0 0.0213687 15.78MiB 316.79KiB 6.49KiB 0 0.0196008 False False
syslog_splunk_hec_logs 81.18KiB 0.52 100.00% 15.14MiB 655.61KiB 13.35KiB 0 0.042281 15.22MiB 529.78KiB 10.81KiB 0 0.0339877 False False
syslog_log2metric_splunk_hec_metrics 79.4KiB 0.48 100.00% 16.17MiB 627.37KiB 12.77KiB 0 0.037877 16.25MiB 623.12KiB 12.69KiB 0 0.0374409 False False
splunk_hec_route_s3 89.42KiB 0.42 83.27% 21.01MiB 2.25MiB 46.79KiB 0 0.106905 21.1MiB 2.14MiB 44.76KiB 0 0.101442 False False
syslog_regex_logs2metric_ddmetrics 4.84KiB 0.06 14.74% 7.66MiB 918.85KiB 18.7KiB 0 0.117106 7.67MiB 891.87KiB 18.17KiB 0 0.113597 False False
splunk_hec_to_splunk_hec_logs_noack 13.04KiB 0.05 75.92% 23.83MiB 432.42KiB 8.83KiB 0 0.0177196 23.84MiB 331.0KiB 6.76KiB 0 0.0135564 False False
splunk_hec_indexer_ack_blackhole 8.04KiB 0.03 26.04% 23.75MiB 852.52KiB 17.34KiB 0 0.0350424 23.76MiB 828.39KiB 16.86KiB 0 0.0340394 False False
http_to_http_acks 5.9KiB 0.03 1.93% 17.29MiB 8.31MiB 173.81KiB 0 0.480641 17.3MiB 8.15MiB 170.06KiB 0 0.471316 True True
enterprise_http_to_http 1.29KiB 0.01 13.72% 23.84MiB 261.08KiB 5.33KiB 0 0.0106903 23.85MiB 255.57KiB 5.23KiB 0 0.0104639 False False
splunk_hec_to_splunk_hec_logs_acks 394.56B 0 1.21% 23.76MiB 879.93KiB 17.9KiB 0 0.0361632 23.76MiB 893.6KiB 18.18KiB 0 0.0367244 False False
file_to_blackhole -81.77KiB -0.08 68.19% 95.35MiB 2.65MiB 55.0KiB 0 0.0278196 95.27MiB 2.92MiB 60.69KiB 0 0.0306532 False False
http_to_http_json -30.27KiB -0.12 98.00% 23.85MiB 370.79KiB 7.57KiB 0 0.0151811 23.82MiB 518.12KiB 10.59KiB 0 0.0212399 False False
fluent_elasticsearch -162.94KiB -0.2 100.00% 79.47MiB 53.28KiB 1.08KiB 0 0.00065451 79.31MiB 1.49MiB 30.75KiB 0 0.0188341 False False
http_to_http_noack -92.73KiB -0.38 100.00% 23.84MiB 266.98KiB 5.46KiB 0 0.0109321 23.75MiB 1.03MiB 21.58KiB 0 0.043549 False False
http_pipelines_no_grok_blackhole -60.02KiB -0.55 99.52% 10.59MiB 74.55KiB 1.52KiB 0 0.00687415 10.53MiB 1.02MiB 21.23KiB 0 0.096803 False False
datadog_agent_remap_blackhole -332.33KiB -0.56 99.99% 57.64MiB 3.8MiB 79.16KiB 0 0.0659061 57.32MiB 1.41MiB 29.37KiB 0 0.0245184 False False
syslog_loki -96.88KiB -0.61 100.00% 15.54MiB 282.78KiB 5.79KiB 0 0.0177645 15.45MiB 770.1KiB 15.65KiB 0 0.0486741 False False
datadog_agent_remap_blackhole_acks -430.57KiB -0.73 100.00% 57.65MiB 4.18MiB 87.03KiB 0 0.072503 57.23MiB 2.36MiB 49.32KiB 0 0.0411774 False False
datadog_agent_remap_datadog_logs -487.65KiB -0.86 100.00% 55.32MiB 666.81KiB 13.65KiB 0 0.0117693 54.84MiB 3.64MiB 75.87KiB 0 0.0664051 False False
datadog_agent_remap_datadog_logs_acks -520.66KiB -0.93 100.00% 54.58MiB 2.48MiB 51.95KiB 0 0.0454938 54.07MiB 3.79MiB 78.96KiB 0 0.0701447 False False
socket_to_socket_blackhole -485.13KiB -2.1 100.00% 22.58MiB 569.09KiB 11.62KiB 0 0.0246097 22.1MiB 458.11KiB 9.36KiB 0 0.020235 False False
http_pipelines_blackhole -33.7KiB -2.12 100.00% 1.55MiB 146.07KiB 2.98KiB 0 0.0919419 1.52MiB 201.49KiB 4.11KiB 0 0.12957 False False
http_text_to_http_json -1.55MiB -3.91 100.00% 39.55MiB 778.7KiB 15.9KiB 0 0.0192236 38.0MiB 809.77KiB 16.53KiB 0 0.0208044 False False

Signed-off-by: Brian L. Troutwine <brian.troutwine@datadoghq.com>
It turns out that lading will crash on the splunk_hec_route_s3 experiment
because it expects acks. We have to intentionally inform Vector to ack.

Signed-off-by: Brian L. Troutwine <brian.troutwine@datadoghq.com>
Copy link
Member

@jszwedko jszwedko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One question, but otherwise looks reasonable! I'll wait to see what the results look like from this.

regression/cases/splunk_hec_route_s3/vector/vector.toml Outdated Show resolved Hide resolved
@github-actions
Copy link

github-actions bot commented Nov 2, 2022

Regression Test Results

Baseline: 0e68650
Comparison: e2caa13
Total vector CPUs: 4

Explanation

A regression test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their bytes_written_per_cpu_second performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±5% change in mean bytes_written_per_cpu_second are discarded. An experiment is erratic if its coefficient of variation is greater than 0.1. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in bytes_written_per_cpu_second with confidence ≥ 90.00% and absolute Δ mean >= ±5%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_regex_logs2metric_ddmetrics 89.66KiB/CPU-s 1.46 100.00% 5.99MiB/CPU-s 436.17KiB/CPU-s 5.63KiB/CPU-s 0.0 0.071046 6.08MiB/CPU-s 473.24KiB/CPU-s 6.11KiB/CPU-s 0.0 0.075974 False False
http_to_http_acks 94.27KiB/CPU-s 0.93 60.14% 9.87MiB/CPU-s 5.93MiB/CPU-s 78.44KiB/CPU-s 0.0 0.601291 9.96MiB/CPU-s 6.01MiB/CPU-s 79.48KiB/CPU-s 0.0 0.603534 True False
datadog_agent_remap_blackhole_acks 262.81KiB/CPU-s 0.60 100.00% 42.56MiB/CPU-s 856.48KiB/CPU-s 11.06KiB/CPU-s 0.0 0.019652 42.81MiB/CPU-s 1.09MiB/CPU-s 14.47KiB/CPU-s 0.0 0.025558 False False
socket_to_socket_blackhole 105.01KiB/CPU-s 0.45 100.00% 22.6MiB/CPU-s 808.79KiB/CPU-s 10.45KiB/CPU-s 0.0 0.034944 22.7MiB/CPU-s 372.15KiB/CPU-s 4.8KiB/CPU-s 0.0 0.016006 False False
syslog_loki 59.83KiB/CPU-s 0.42 100.00% 13.96MiB/CPU-s 446.57KiB/CPU-s 5.76KiB/CPU-s 0.0 0.031242 14.02MiB/CPU-s 477.18KiB/CPU-s 6.16KiB/CPU-s 0.0 0.033245 False False
syslog_log2metric_humio_metrics 28.86KiB/CPU-s 0.37 100.00% 7.67MiB/CPU-s 165.08KiB/CPU-s 2.13KiB/CPU-s 0.0 0.021021 7.7MiB/CPU-s 205.33KiB/CPU-s 2.65KiB/CPU-s 0.0 0.026050 False False
datadog_agent_remap_datadog_logs 107.83KiB/CPU-s 0.35 100.00% 30.09MiB/CPU-s 920.85KiB/CPU-s 11.88KiB/CPU-s 0.0 0.029880 30.2MiB/CPU-s 977.91KiB/CPU-s 12.61KiB/CPU-s 0.0 0.031621 False False
otlp_grpc_to_blackhole 4.81KiB/CPU-s 0.29 99.85% 1.61MiB/CPU-s 85.01KiB/CPU-s 1.1KiB/CPU-s 0.0 0.051447 1.62MiB/CPU-s 81.54KiB/CPU-s 1.05KiB/CPU-s 0.0 0.049199 False False
http_text_to_http_json 89.75KiB/CPU-s 0.24 100.00% 36.95MiB/CPU-s 952.78KiB/CPU-s 12.3KiB/CPU-s 0.0 0.025181 37.03MiB/CPU-s 817.5KiB/CPU-s 10.56KiB/CPU-s 0.0 0.021555 False False
file_to_blackhole 165.74KiB/CPU-s 0.18 48.08% 92.29MiB/CPU-s 13.98MiB/CPU-s 184.53KiB/CPU-s 0.0 0.151485 92.45MiB/CPU-s 13.56MiB/CPU-s 179.07KiB/CPU-s 0.0 0.146696 True False
syslog_log2metric_splunk_hec_metrics 12.18KiB/CPU-s 0.09 90.48% 13.74MiB/CPU-s 379.19KiB/CPU-s 4.9KiB/CPU-s 0.0 0.026952 13.75MiB/CPU-s 419.31KiB/CPU-s 5.41KiB/CPU-s 0.0 0.029778 False False
splunk_hec_indexer_ack_blackhole 4.02KiB/CPU-s 0.02 38.54% 23.83MiB/CPU-s 448.83KiB/CPU-s 5.79KiB/CPU-s 0.0 0.018394 23.83MiB/CPU-s 425.6KiB/CPU-s 5.49KiB/CPU-s 0.0 0.017439 False False
enterprise_http_to_http 319.47B/CPU-s 0.00 5.09% 23.84MiB/CPU-s 268.07KiB/CPU-s 3.46KiB/CPU-s 0.0 0.010979 23.84MiB/CPU-s 266.55KiB/CPU-s 3.44KiB/CPU-s 0.0 0.010917 False False
otlp_http_to_blackhole -489.1B/CPU-s -0.02 8.47% 2.36MiB/CPU-s 243.77KiB/CPU-s 3.15KiB/CPU-s 0.0 0.100900 2.36MiB/CPU-s 248.42KiB/CPU-s 3.21KiB/CPU-s 0.0 0.102846 True False
http_to_http_noack -7.68KiB/CPU-s -0.03 64.82% 23.84MiB/CPU-s 404.61KiB/CPU-s 5.23KiB/CPU-s 0.0 0.016576 23.83MiB/CPU-s 494.46KiB/CPU-s 6.38KiB/CPU-s 0.0 0.020264 False False
fluent_elasticsearch -28.16KiB/CPU-s -0.03 98.70% 79.47MiB/CPU-s 83.47KiB/CPU-s 1.07KiB/CPU-s 0.0 0.001026 79.45MiB/CPU-s 883.47KiB/CPU-s 11.28KiB/CPU-s 0.0 0.010859 False False
http_to_http_json -7.67KiB/CPU-s -0.03 43.64% 23.66MiB/CPU-s 722.67KiB/CPU-s 9.33KiB/CPU-s 0.0 0.029824 23.65MiB/CPU-s 731.14KiB/CPU-s 9.44KiB/CPU-s 0.0 0.030183 False False
splunk_hec_to_splunk_hec_logs_noack -14.75KiB/CPU-s -0.08 69.70% 18.64MiB/CPU-s 771.75KiB/CPU-s 9.96KiB/CPU-s 0.0 0.040431 18.62MiB/CPU-s 797.43KiB/CPU-s 10.29KiB/CPU-s 0.0 0.041809 False False
http_pipelines_no_grok_blackhole -11.51KiB/CPU-s -0.20 100.00% 5.75MiB/CPU-s 85.39KiB/CPU-s 1.1KiB/CPU-s 0.0 0.014498 5.74MiB/CPU-s 173.58KiB/CPU-s 2.24KiB/CPU-s 0.0 0.029529 False False
syslog_humio_logs -33.03KiB/CPU-s -0.23 100.00% 13.93MiB/CPU-s 529.84KiB/CPU-s 6.84KiB/CPU-s 0.0 0.037137 13.9MiB/CPU-s 291.38KiB/CPU-s 3.76KiB/CPU-s 0.0 0.020470 False False
syslog_splunk_hec_logs -34.86KiB/CPU-s -0.24 100.00% 14.01MiB/CPU-s 318.29KiB/CPU-s 4.11KiB/CPU-s 0.0 0.022177 13.98MiB/CPU-s 313.83KiB/CPU-s 4.05KiB/CPU-s 0.0 0.021920 False False
splunk_hec_to_splunk_hec_logs_acks -70.76KiB/CPU-s -0.38 100.00% 18.15MiB/CPU-s 866.25KiB/CPU-s 11.17KiB/CPU-s 0.0 0.046618 18.08MiB/CPU-s 859.36KiB/CPU-s 11.09KiB/CPU-s 0.0 0.046424 False False
datadog_agent_remap_datadog_logs_acks -151.06KiB/CPU-s -0.44 100.00% 33.71MiB/CPU-s 1.21MiB/CPU-s 15.97KiB/CPU-s 0.0 0.035871 33.56MiB/CPU-s 692.5KiB/CPU-s 8.94KiB/CPU-s 0.0 0.020148 False False
http_pipelines_blackhole_acks -4.03KiB/CPU-s -0.53 100.00% 754.13KiB/CPU-s 18.2KiB/CPU-s 240.61B/CPU-s 0.0 0.024127 750.11KiB/CPU-s 28.43KiB/CPU-s 375.65B/CPU-s 0.0 0.037895 False False
http_pipelines_blackhole -6.88KiB/CPU-s -0.71 100.00% 965.77KiB/CPU-s 20.51KiB/CPU-s 271.34B/CPU-s 0.0 0.021233 958.9KiB/CPU-s 40.36KiB/CPU-s 533.34B/CPU-s 0.0 0.042084 False False
splunk_hec_route_s3 -100.91KiB/CPU-s -0.75 100.00% 13.1MiB/CPU-s 922.95KiB/CPU-s 11.91KiB/CPU-s 0.0 0.068806 13.0MiB/CPU-s 908.54KiB/CPU-s 11.72KiB/CPU-s 0.0 0.068245 False False
datadog_agent_remap_blackhole -1.02MiB/CPU-s -2.44 100.00% 41.88MiB/CPU-s 1018.72KiB/CPU-s 13.15KiB/CPU-s 0.0 0.023753 40.86MiB/CPU-s 2.03MiB/CPU-s 26.88KiB/CPU-s 0.0 0.049784 False False

Signed-off-by: Brian L. Troutwine <brian.troutwine@datadoghq.com>
@github-actions
Copy link

github-actions bot commented Nov 2, 2022

Regression Test Results

Baseline: 0e68650
Comparison: c256ed7
Total vector CPUs: 4

Explanation

A regression test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their bytes_written_per_cpu_second performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±5% change in mean bytes_written_per_cpu_second are discarded. An experiment is erratic if its coefficient of variation is greater than 0.1. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in bytes_written_per_cpu_second with confidence ≥ 90.00% and absolute Δ mean >= ±5%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_regex_logs2metric_ddmetrics 165.58KiB/CPU-s 2.74 100.00% 5.9MiB/CPU-s 611.57KiB/CPU-s 7.89KiB/CPU-s 0.0 0.101174 6.06MiB/CPU-s 493.48KiB/CPU-s 6.37KiB/CPU-s 0.0 0.079461 True False
syslog_log2metric_splunk_hec_metrics 154.32KiB/CPU-s 1.10 100.00% 13.72MiB/CPU-s 325.84KiB/CPU-s 4.21KiB/CPU-s 0.0 0.023192 13.87MiB/CPU-s 306.85KiB/CPU-s 3.96KiB/CPU-s 0.0 0.021603 False False
syslog_loki 137.6KiB/CPU-s 0.97 100.00% 13.91MiB/CPU-s 476.08KiB/CPU-s 6.14KiB/CPU-s 0.0 0.033418 14.05MiB/CPU-s 352.07KiB/CPU-s 4.55KiB/CPU-s 0.0 0.024477 False False
datadog_agent_remap_blackhole 376.36KiB/CPU-s 0.88 100.00% 41.61MiB/CPU-s 951.48KiB/CPU-s 12.28KiB/CPU-s 0.0 0.022327 41.98MiB/CPU-s 1.2MiB/CPU-s 15.86KiB/CPU-s 0.0 0.028599 False False
syslog_log2metric_humio_metrics 61.77KiB/CPU-s 0.80 100.00% 7.57MiB/CPU-s 241.19KiB/CPU-s 3.11KiB/CPU-s 0.0 0.031127 7.63MiB/CPU-s 219.23KiB/CPU-s 2.83KiB/CPU-s 0.0 0.028070 False False
splunk_hec_to_splunk_hec_logs_acks 146.9KiB/CPU-s 0.80 100.00% 18.03MiB/CPU-s 917.06KiB/CPU-s 11.83KiB/CPU-s 0.0 0.049653 18.18MiB/CPU-s 849.17KiB/CPU-s 10.95KiB/CPU-s 0.0 0.045615 False False
otlp_grpc_to_blackhole 13.02KiB/CPU-s 0.79 100.00% 1.61MiB/CPU-s 79.0KiB/CPU-s 1.02KiB/CPU-s 0.0 0.047944 1.62MiB/CPU-s 76.77KiB/CPU-s 1014.6B/CPU-s 0.0 0.046227 False False
splunk_hec_route_s3 90.09KiB/CPU-s 0.66 100.00% 13.23MiB/CPU-s 967.85KiB/CPU-s 12.49KiB/CPU-s 0.0 0.071418 13.32MiB/CPU-s 893.7KiB/CPU-s 11.53KiB/CPU-s 0.0 0.065511 False False
http_to_http_json 115.53KiB/CPU-s 0.48 100.00% 23.47MiB/CPU-s 896.61KiB/CPU-s 11.57KiB/CPU-s 0.0 0.037305 23.58MiB/CPU-s 800.2KiB/CPU-s 10.33KiB/CPU-s 0.0 0.033135 False False
otlp_http_to_blackhole 7.19KiB/CPU-s 0.30 87.38% 2.33MiB/CPU-s 253.2KiB/CPU-s 3.27KiB/CPU-s 0.0 0.106306 2.33MiB/CPU-s 261.86KiB/CPU-s 3.38KiB/CPU-s 0.0 0.109614 True False
datadog_agent_remap_blackhole_acks 81.72KiB/CPU-s 0.19 99.19% 42.42MiB/CPU-s 1.79MiB/CPU-s 23.73KiB/CPU-s 0.0 0.042301 42.5MiB/CPU-s 1.49MiB/CPU-s 19.75KiB/CPU-s 0.0 0.035165 False False
http_to_http_acks 18.01KiB/CPU-s 0.18 12.94% 9.97MiB/CPU-s 5.9MiB/CPU-s 77.95KiB/CPU-s 0.0 0.591399 9.99MiB/CPU-s 5.93MiB/CPU-s 78.38KiB/CPU-s 0.0 0.593331 True False
syslog_humio_logs 19.53KiB/CPU-s 0.14 100.00% 14.03MiB/CPU-s 268.88KiB/CPU-s 3.47KiB/CPU-s 0.0 0.018710 14.05MiB/CPU-s 245.72KiB/CPU-s 3.17KiB/CPU-s 0.0 0.017075 False False
fluent_elasticsearch 477.39B/CPU-s 0.00 37.58% 79.47MiB/CPU-s 52.77KiB/CPU-s 690.28B/CPU-s 0.0 0.000648 79.47MiB/CPU-s 52.59KiB/CPU-s 687.88B/CPU-s 0.0 0.000646 False False
enterprise_http_to_http -237.61B/CPU-s -0.00 3.80% 23.84MiB/CPU-s 266.76KiB/CPU-s 3.44KiB/CPU-s 0.0 0.010925 23.84MiB/CPU-s 265.79KiB/CPU-s 3.43KiB/CPU-s 0.0 0.010886 False False
splunk_hec_indexer_ack_blackhole -1.88KiB/CPU-s -0.01 18.93% 23.83MiB/CPU-s 424.71KiB/CPU-s 5.48KiB/CPU-s 0.0 0.017403 23.83MiB/CPU-s 438.01KiB/CPU-s 5.65KiB/CPU-s 0.0 0.017949 False False
http_to_http_noack -3.44KiB/CPU-s -0.01 38.74% 23.84MiB/CPU-s 342.71KiB/CPU-s 4.43KiB/CPU-s 0.0 0.014039 23.83MiB/CPU-s 399.66KiB/CPU-s 5.16KiB/CPU-s 0.0 0.016373 False False
datadog_agent_remap_datadog_logs -9.59KiB/CPU-s -0.03 40.18% 29.97MiB/CPU-s 743.78KiB/CPU-s 9.6KiB/CPU-s 0.0 0.024231 29.96MiB/CPU-s 1.17MiB/CPU-s 15.45KiB/CPU-s 0.0 0.039037 False False
http_pipelines_blackhole_acks -272.0B/CPU-s -0.04 43.23% 754.79KiB/CPU-s 18.68KiB/CPU-s 247.01B/CPU-s 0.0 0.024749 754.53KiB/CPU-s 30.79KiB/CPU-s 406.8B/CPU-s 0.0 0.040800 False False
file_to_blackhole -69.42KiB/CPU-s -0.07 21.38% 92.47MiB/CPU-s 13.66MiB/CPU-s 180.36KiB/CPU-s 0.0 0.147717 92.41MiB/CPU-s 13.76MiB/CPU-s 181.56KiB/CPU-s 0.0 0.148897 True False
splunk_hec_to_splunk_hec_logs_noack -21.68KiB/CPU-s -0.11 86.97% 18.68MiB/CPU-s 780.99KiB/CPU-s 10.08KiB/CPU-s 0.0 0.040823 18.66MiB/CPU-s 789.8KiB/CPU-s 10.19KiB/CPU-s 0.0 0.041331 False False
http_pipelines_no_grok_blackhole -14.3KiB/CPU-s -0.24 100.00% 5.75MiB/CPU-s 53.02KiB/CPU-s 700.95B/CPU-s 0.0 0.009001 5.74MiB/CPU-s 127.47KiB/CPU-s 1.65KiB/CPU-s 0.0 0.021691 False False
socket_to_socket_blackhole -57.88KiB/CPU-s -0.25 100.00% 22.87MiB/CPU-s 374.36KiB/CPU-s 4.83KiB/CPU-s 0.0 0.015987 22.81MiB/CPU-s 269.06KiB/CPU-s 3.47KiB/CPU-s 0.0 0.011519 False False
http_text_to_http_json -151.89KiB/CPU-s -0.40 100.00% 36.94MiB/CPU-s 856.31KiB/CPU-s 11.06KiB/CPU-s 0.0 0.022634 36.79MiB/CPU-s 920.99KiB/CPU-s 11.9KiB/CPU-s 0.0 0.024442 False False
datadog_agent_remap_datadog_logs_acks -169.09KiB/CPU-s -0.49 100.00% 33.9MiB/CPU-s 1.11MiB/CPU-s 14.62KiB/CPU-s 0.0 0.032650 33.74MiB/CPU-s 1.02MiB/CPU-s 13.46KiB/CPU-s 0.0 0.030189 False False
syslog_splunk_hec_logs -289.97KiB/CPU-s -2.02 100.00% 14.03MiB/CPU-s 315.89KiB/CPU-s 4.08KiB/CPU-s 0.0 0.021985 13.75MiB/CPU-s 485.5KiB/CPU-s 6.27KiB/CPU-s 0.0 0.034485 False False
http_pipelines_blackhole -26.41KiB/CPU-s -2.73 100.00% 966.04KiB/CPU-s 19.87KiB/CPU-s 262.84B/CPU-s 0.0 0.020562 939.63KiB/CPU-s 53.9KiB/CPU-s 712.34B/CPU-s 0.0 0.057356 False False

@github-actions
Copy link

github-actions bot commented Nov 2, 2022

Soak Test Results

Baseline: 0e68650
Comparison: c256ed7
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_humio_metrics 148.4KiB 1.57 100.00% 9.25MiB 217.87KiB 4.45KiB 0 0.0230023 9.39MiB 326.74KiB 6.66KiB 0 0.0339647 False False
http_pipelines_blackhole_acks 9.67KiB 0.78 99.92% 1.22MiB 111.63KiB 2.27KiB 0 0.0894801 1.23MiB 86.78KiB 1.77KiB 0 0.0690221 False False
datadog_agent_remap_blackhole 357.16KiB 0.63 99.98% 55.76MiB 3.56MiB 74.2KiB 0 0.0638393 56.11MiB 2.98MiB 62.22KiB 0 0.0531692 False False
syslog_splunk_hec_logs 33.09KiB 0.21 88.70% 15.52MiB 800.12KiB 16.27KiB 0 0.0503234 15.56MiB 640.88KiB 13.08KiB 0 0.0402241 False False
splunk_hec_indexer_ack_blackhole 19.43KiB 0.08 57.99% 23.75MiB 877.24KiB 17.84KiB 0 0.0360618 23.77MiB 795.93KiB 16.2KiB 0 0.0326929 False False
splunk_hec_to_splunk_hec_logs_noack 17.34KiB 0.07 86.96% 23.82MiB 455.07KiB 9.3KiB 0 0.0186518 23.84MiB 328.49KiB 6.71KiB 0 0.013454 False False
enterprise_http_to_http 1.25KiB 0.01 13.27% 23.84MiB 257.35KiB 5.25KiB 0 0.0105377 23.85MiB 260.33KiB 5.33KiB 0 0.0106591 False False
splunk_hec_to_splunk_hec_logs_acks -5.89KiB -0.02 18.26% 23.76MiB 867.76KiB 17.65KiB 0 0.0356625 23.75MiB 906.02KiB 18.43KiB 0 0.0372437 False False
file_to_blackhole -46.34KiB -0.05 37.01% 95.34MiB 2.93MiB 60.79KiB 0 0.0307511 95.3MiB 3.58MiB 74.53KiB 0 0.0376103 False False
http_to_http_json -24.3KiB -0.1 94.54% 23.85MiB 376.67KiB 7.69KiB 0 0.0154218 23.82MiB 490.64KiB 10.03KiB 0 0.0201079 False False
syslog_regex_logs2metric_ddmetrics -9.43KiB -0.13 26.60% 7.35MiB 971.07KiB 19.8KiB 0 0.128985 7.34MiB 954.84KiB 19.46KiB 0 0.126989 False False
fluent_elasticsearch -161.77KiB -0.2 100.00% 79.47MiB 53.29KiB 1.08KiB 0 0.000654679 79.31MiB 1.47MiB 30.29KiB 0 0.0185477 False False
splunk_hec_route_s3 -57.23KiB -0.26 66.64% 21.58MiB 2.06MiB 43.0KiB 0 0.0956333 21.53MiB 1.95MiB 40.67KiB 0 0.0904237 False False
http_pipelines_blackhole -5.16KiB -0.3 98.04% 1.7MiB 10.87KiB 227.59B 0 0.00623939 1.7MiB 107.93KiB 2.2KiB 0 0.0621097 False False
syslog_humio_logs -54.77KiB -0.33 100.00% 16.09MiB 227.97KiB 4.65KiB 0 0.0138298 16.04MiB 140.17KiB 2.87KiB 0 0.00853209 False False
http_to_http_acks -58.35KiB -0.33 18.95% 17.31MiB 8.31MiB 173.8KiB 0 0.480145 17.26MiB 8.16MiB 170.25KiB 0 0.472582 True True
http_to_http_noack -91.05KiB -0.37 99.99% 23.84MiB 408.13KiB 8.35KiB 0 0.0167166 23.75MiB 1.07MiB 22.21KiB 0 0.0448396 False False
http_pipelines_no_grok_blackhole -58.42KiB -0.53 99.80% 10.67MiB 38.02KiB 794.89B 0 0.00348051 10.61MiB 926.0KiB 18.85KiB 0 0.0852234 False False
syslog_log2metric_splunk_hec_metrics -114.88KiB -0.7 100.00% 16.06MiB 513.42KiB 10.47KiB 0 0.0312212 15.94MiB 721.32KiB 14.7KiB 0 0.0441722 False False
datadog_agent_remap_blackhole_acks -404.41KiB -0.71 99.91% 55.95MiB 4.32MiB 89.86KiB 0 0.0771295 55.55MiB 3.93MiB 82.18KiB 0 0.0708043 False False
datadog_agent_remap_datadog_logs_acks -443.1KiB -0.8 100.00% 54.18MiB 2.98MiB 62.32KiB 0 0.0550172 53.75MiB 3.98MiB 82.91KiB 0 0.0740881 False False
datadog_agent_remap_datadog_logs -622.61KiB -1.11 100.00% 54.79MiB 632.59KiB 12.95KiB 0 0.0112735 54.18MiB 3.56MiB 74.08KiB 0 0.0656321 False False
syslog_loki -279.76KiB -1.79 100.00% 15.28MiB 322.86KiB 6.61KiB 0 0.0206251 15.01MiB 783.47KiB 15.93KiB 0 0.050961 False False
http_text_to_http_json -710.14KiB -1.81 100.00% 38.41MiB 851.87KiB 17.39KiB 0 0.0216551 37.71MiB 858.29KiB 17.52KiB 0 0.0222193 False False
socket_to_socket_blackhole -644.77KiB -2.79 100.00% 22.59MiB 602.71KiB 12.3KiB 0 0.0260524 21.96MiB 520.69KiB 10.64KiB 0 0.0231525 False False

@@ -36,6 +36,7 @@ address = "0.0.0.0:9090"
type = "aws_s3"
inputs = ["container_type.sidecar"]

region = "us-west-2"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you find this to be needed too? Just trying to avoid the test cases falling out of sync. I think ideally we'd just have one set rather than this set in addition to soaks/tests. Maybe we could just symlink the test case directory for now?

Copy link
Contributor

@blt blt Nov 2, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, we run in a more restricted environment in the new setup with no AWS data for Vector to pick up. With regard to things falling out of sync, my hope is that the soak tests won't last much longer, TBH. Also if we did want to symlink we could do that but lading would need to be updated on the soak side.

@goxberry goxberry merged commit 976687f into master Nov 2, 2022
@goxberry goxberry deleted the goxberry/add_analysis_from_smp branch November 2, 2022 22:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants