Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

enhancement(elasticsearch sink): Multiple hosts #13236

Closed
wants to merge 62 commits into from

Conversation

ktff
Copy link
Contributor

@ktff ktff commented Jun 20, 2022

Ref #3649

Implements multiple host feature for elasticsearch sink with failover. Where the amount of inflight requests for each endpoint is an estimate of its load.

Possible extensions

Todo

  • Internal metrics
  • Internal logs
  • Documentation

ktff added 12 commits June 16, 2022 19:26
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@netlify
Copy link

netlify bot commented Jun 20, 2022

Deploy Preview for vector-project canceled.

Name Link
🔨 Latest commit 02d9add
🔍 Latest deploy log https://app.netlify.com/sites/vector-project/deploys/62f2515d5fa8050009761c5a

@github-actions github-actions bot added the domain: sinks Anything related to the Vector's sinks label Jun 20, 2022
ktff added 3 commits June 20, 2022 16:10
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@github-actions
Copy link

Soak Test Results

Baseline: f296e01
Comparison: a480172
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_splunk_hec_metrics 413.75KiB 2.3 100.00% 17.53MiB 770.48KiB 15.68KiB 0 0.0429051 17.94MiB 638.6KiB 13.01KiB 0 0.0347604 False False
splunk_hec_route_s3 431.44KiB 2.22 100.00% 18.99MiB 2.36MiB 49.4KiB 0 0.124408 19.42MiB 2.2MiB 46.01KiB 0 0.11332 False False
syslog_log2metric_humio_metrics 302.58KiB 2.14 100.00% 13.81MiB 410.0KiB 8.36KiB 0 0.0289956 14.1MiB 154.39KiB 3.16KiB 0 0.0106897 False False
syslog_loki 327.22KiB 2.1 100.00% 15.19MiB 561.32KiB 11.44KiB 0 0.0360821 15.51MiB 326.85KiB 6.67KiB 0 0.0205772 False False
syslog_splunk_hec_logs 269.41KiB 1.64 100.00% 16.0MiB 140.72KiB 2.87KiB 0 0.00858443 16.27MiB 141.44KiB 2.89KiB 0 0.00848889 False False
http_pipelines_blackhole_acks 52.87KiB 1.2 99.87% 4.29MiB 574.67KiB 11.75KiB 0 0.130645 4.35MiB 560.63KiB 11.45KiB 0 0.125939 False False
syslog_humio_logs 189.09KiB 1.11 100.00% 16.7MiB 134.53KiB 2.76KiB 0 0.00786623 16.88MiB 141.45KiB 2.89KiB 0 0.0081804 False False
syslog_regex_logs2metric_ddmetrics 122.6KiB 0.96 100.00% 12.49MiB 588.01KiB 11.98KiB 0 0.0459779 12.61MiB 699.98KiB 14.24KiB 0 0.0542133 False False
http_pipelines_no_grok_blackhole 160.75KiB 0.94 99.98% 16.63MiB 1.3MiB 27.3KiB 0 0.0783276 16.78MiB 1.64MiB 34.15KiB 0 0.0976064 False False
http_to_http_noack 96.25KiB 0.4 100.00% 23.75MiB 1.05MiB 21.89KiB 0 0.0441933 23.85MiB 240.86KiB 4.92KiB 0 0.00986198 False False
datadog_agent_remap_datadog_logs 89.16KiB 0.13 77.26% 67.64MiB 3.01MiB 62.73KiB 0 0.0444299 67.73MiB 1.86MiB 38.98KiB 0 0.0274449 False False
fluent_elasticsearch 32.58KiB 0.04 97.75% 79.44MiB 707.39KiB 14.22KiB 0 0.00869404 79.47MiB 54.7KiB 1.1KiB 0 0.000671959 False False
http_to_http_json 758.99B 0 6.14% 23.84MiB 333.64KiB 6.82KiB 0 0.0136615 23.85MiB 332.11KiB 6.78KiB 0 0.0135983 False False
splunk_hec_to_splunk_hec_logs_noack 477.97B 0 3.98% 23.84MiB 321.61KiB 6.58KiB 0 0.0131732 23.84MiB 323.82KiB 6.63KiB 0 0.0132635 False False
splunk_hec_indexer_ack_blackhole -10.46KiB -0.04 36.08% 23.77MiB 751.05KiB 15.28KiB 0 0.0308522 23.76MiB 798.67KiB 16.25KiB 0 0.0328223 False False
splunk_hec_to_splunk_hec_logs_acks -15.04KiB -0.06 49.73% 23.77MiB 750.54KiB 15.27KiB 0 0.0308319 23.75MiB 808.4KiB 16.44KiB 0 0.0332295 False False
socket_to_socket_blackhole -16.39KiB -0.11 73.66% 14.04MiB 548.38KiB 11.17KiB 0 0.0381477 14.02MiB 464.63KiB 9.48KiB 0 0.0323591 False False
file_to_blackhole -196.73KiB -0.2 95.26% 95.29MiB 3.3MiB 68.49KiB 0 0.0346013 95.1MiB 3.48MiB 71.77KiB 0 0.0365442 False False
http_pipelines_blackhole -16.24KiB -0.36 65.43% 4.44MiB 580.42KiB 11.9KiB 0 0.127748 4.42MiB 610.59KiB 12.45KiB 0 0.134869 False False
datadog_agent_remap_blackhole_acks -273.2KiB -0.39 99.80% 68.1MiB 3.29MiB 68.59KiB 0 0.0482674 67.83MiB 2.66MiB 55.51KiB 0 0.039171 False False
datadog_agent_remap_datadog_logs_acks -423.31KiB -0.61 100.00% 67.94MiB 1.02MiB 21.44KiB 0 0.0150723 67.53MiB 3.08MiB 64.33KiB 0 0.0455558 False False
datadog_agent_remap_blackhole -563.22KiB -0.84 100.00% 65.5MiB 844.08KiB 17.28KiB 0 0.0125828 64.95MiB 951.54KiB 19.47KiB 0 0.0143049 False False
http_to_http_acks -478.9KiB -2.57 96.59% 18.23MiB 7.3MiB 152.6KiB 0 0.40025 17.77MiB 7.97MiB 166.63KiB 0 0.448641 True True

Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@github-actions
Copy link

Soak Test Results

Baseline: f296e01
Comparison: 0309d8a
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
datadog_agent_remap_blackhole -6.4MiB -9.94 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_humio_metrics 499.72KiB 3.54 100.00% 13.77MiB 653.76KiB 13.33KiB 0 0.0463627 14.26MiB 126.05KiB 2.58KiB 0 0.00863294 False False
syslog_loki 351.03KiB 2.38 100.00% 14.38MiB 997.42KiB 20.34KiB 0 0.067711 14.73MiB 1.06MiB 22.11KiB 0 0.0717791 False False
http_pipelines_no_grok_blackhole 258.85KiB 1.46 100.00% 17.27MiB 1.08MiB 22.53KiB 0 0.0622676 17.52MiB 1.15MiB 23.97KiB 0 0.0655232 False False
syslog_log2metric_splunk_hec_metrics 250.0KiB 1.39 100.00% 17.55MiB 858.46KiB 17.45KiB 0 0.0477458 17.8MiB 447.47KiB 9.12KiB 0 0.0245461 False False
http_pipelines_blackhole_acks 40.6KiB 0.96 96.71% 4.13MiB 670.74KiB 13.71KiB 0 0.158401 4.17MiB 645.96KiB 13.19KiB 0 0.151101 False False
datadog_agent_remap_blackhole_acks 618.35KiB 0.9 100.00% 67.12MiB 3.71MiB 77.39KiB 0 0.0552892 67.72MiB 2.24MiB 46.74KiB 0 0.0330411 False False
syslog_splunk_hec_logs 106.82KiB 0.64 100.00% 16.4MiB 412.61KiB 8.42KiB 0 0.0245587 16.51MiB 385.16KiB 7.87KiB 0 0.02278 False False
syslog_humio_logs 109.83KiB 0.64 100.00% 16.66MiB 121.64KiB 2.49KiB 0 0.00712825 16.77MiB 132.89KiB 2.72KiB 0 0.00773753 False False
syslog_regex_logs2metric_ddmetrics 78.78KiB 0.62 100.00% 12.44MiB 624.91KiB 12.72KiB 0 0.0490622 12.51MiB 651.93KiB 13.27KiB 0 0.0508686 False False
http_to_http_noack 95.38KiB 0.39 100.00% 23.75MiB 1.05MiB 21.88KiB 0 0.0441676 23.85MiB 240.06KiB 4.9KiB 0 0.00982921 False False
http_to_http_acks 65.57KiB 0.36 21.40% 17.62MiB 8.14MiB 170.22KiB 0 0.462077 17.68MiB 8.19MiB 171.28KiB 0 0.463271 True True
socket_to_socket_blackhole 49.74KiB 0.35 98.16% 13.86MiB 667.81KiB 13.61KiB 0 0.0470294 13.91MiB 790.29KiB 16.11KiB 0 0.0554601 False False
http_pipelines_blackhole 9.84KiB 0.22 42.88% 4.46MiB 591.73KiB 12.11KiB 0 0.129444 4.47MiB 610.8KiB 12.46KiB 0 0.133328 False False
splunk_hec_indexer_ack_blackhole 13.66KiB 0.06 46.42% 23.76MiB 790.24KiB 16.07KiB 0 0.0324755 23.77MiB 742.87KiB 15.12KiB 0 0.0305116 False False
splunk_hec_to_splunk_hec_logs_acks 9.41KiB 0.04 32.87% 23.76MiB 789.5KiB 16.06KiB 0 0.0324449 23.77MiB 751.15KiB 15.29KiB 0 0.0308571 False False
fluent_elasticsearch 18.89KiB 0.02 92.72% 79.45MiB 519.84KiB 10.47KiB 0 0.00638798 79.47MiB 53.33KiB 1.07KiB 0 0.000655168 False False
splunk_hec_route_s3 2.8KiB 0.01 3.47% 19.62MiB 2.17MiB 45.45KiB 0 0.110816 19.62MiB 2.18MiB 45.62KiB 0 0.111202 False False
splunk_hec_to_splunk_hec_logs_noack 400.99B 0 3.32% 23.83MiB 324.21KiB 6.64KiB 0 0.0132807 23.84MiB 325.46KiB 6.67KiB 0 0.0133316 False False
http_to_http_json 937.66B 0 7.77% 23.85MiB 328.69KiB 6.71KiB 0 0.0134584 23.85MiB 321.67KiB 6.57KiB 0 0.0131703 False False
datadog_agent_remap_datadog_logs_acks -132.19KiB -0.2 81.72% 65.96MiB 2.94MiB 61.42KiB 0 0.0445195 65.84MiB 3.73MiB 77.9KiB 0 0.0566113 False False
file_to_blackhole -315.92KiB -0.32 98.81% 95.28MiB 3.42MiB 70.98KiB 0 0.0358594 94.98MiB 5.02MiB 103.56KiB 0 0.0528547 False False
datadog_agent_remap_datadog_logs -364.72KiB -0.52 99.99% 68.22MiB 2.41MiB 50.35KiB 0 0.0353403 67.86MiB 3.85MiB 80.37KiB 0 0.0567189 False False
datadog_agent_remap_blackhole -6.4MiB -9.94 100.00% 64.39MiB 6.97MiB 146.02KiB 0 0.108239 57.99MiB 9.76MiB 204.44KiB 0 0.168248 False False

ktff added 2 commits June 21, 2022 13:39
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@github-actions github-actions bot added the domain: external docs Anything related to Vector's external, public documentation label Jun 21, 2022
@ktff ktff marked this pull request as ready for review June 21, 2022 12:42
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@ktff
Copy link
Contributor Author

ktff commented Jun 21, 2022

@jszwedko this is ready for review.

cc. @sim0nx

@github-actions
Copy link

Soak Test Results

Baseline: f296e01
Comparison: 383b28d
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_humio_metrics 272.24KiB 1.96 100.00% 13.54MiB 548.05KiB 11.18KiB 0 0.0395222 13.8MiB 512.84KiB 10.51KiB 0 0.0362711 False False
http_pipelines_no_grok_blackhole 239.79KiB 1.37 100.00% 17.07MiB 1.14MiB 23.87KiB 0 0.0667839 17.3MiB 1.18MiB 24.63KiB 0 0.0681743 False False
socket_to_socket_blackhole 146.08KiB 1.03 100.00% 13.89MiB 639.55KiB 13.02KiB 0 0.0449666 14.03MiB 486.45KiB 9.92KiB 0 0.0338546 False False
splunk_hec_route_s3 147.97KiB 0.78 93.76% 18.52MiB 2.69MiB 56.32KiB 0 0.145458 18.66MiB 2.68MiB 55.96KiB 0 0.143413 False False
syslog_log2metric_splunk_hec_metrics 76.24KiB 0.42 99.93% 17.6MiB 842.76KiB 17.14KiB 0 0.0467461 17.68MiB 705.8KiB 14.37KiB 0 0.0389846 False False
datadog_agent_remap_blackhole 252.13KiB 0.37 100.00% 67.14MiB 719.26KiB 14.72KiB 0 0.0104589 67.39MiB 650.52KiB 13.31KiB 0 0.00942465 False False
syslog_loki 56.25KiB 0.36 100.00% 15.11MiB 566.73KiB 11.55KiB 0 0.0366223 15.16MiB 226.7KiB 4.63KiB 0 0.0145965 False False
http_to_http_noack 69.1KiB 0.28 99.96% 23.78MiB 931.55KiB 18.98KiB 0 0.0382507 23.85MiB 237.77KiB 4.85KiB 0 0.00973564 False False
http_pipelines_blackhole 10.82KiB 0.25 45.99% 4.31MiB 608.09KiB 12.46KiB 0 0.137836 4.32MiB 613.29KiB 12.51KiB 0 0.138674 False False
datadog_agent_remap_datadog_logs_acks 140.65KiB 0.22 60.55% 62.02MiB 5.46MiB 114.27KiB 0 0.0879946 62.16MiB 5.7MiB 119.24KiB 0 0.0916 False False
fluent_elasticsearch 40.56KiB 0.05 95.70% 79.43MiB 996.95KiB 20.0KiB 0 0.0122541 79.47MiB 56.12KiB 1.13KiB 0 0.000689499 False False
splunk_hec_indexer_ack_blackhole 4.71KiB 0.02 16.98% 23.76MiB 772.43KiB 15.71KiB 0 0.0317377 23.77MiB 753.14KiB 15.33KiB 0 0.030939 False False
http_to_http_json 53.31B 0 0.42% 23.84MiB 345.82KiB 7.06KiB 0 0.0141615 23.84MiB 340.89KiB 6.96KiB 0 0.0139594 False False
splunk_hec_to_splunk_hec_logs_noack -787.83B -0 6.52% 23.84MiB 323.24KiB 6.62KiB 0 0.0132404 23.84MiB 326.19KiB 6.68KiB 0 0.0133616 False False
syslog_regex_logs2metric_ddmetrics -7.86KiB -0.06 32.35% 12.28MiB 637.08KiB 12.97KiB 0 0.0506475 12.27MiB 672.39KiB 13.67KiB 0 0.0534877 False False
splunk_hec_to_splunk_hec_logs_acks -18.74KiB -0.08 60.08% 23.77MiB 731.97KiB 14.9KiB 0 0.0300631 23.75MiB 810.95KiB 16.49KiB 0 0.0333325 False False
file_to_blackhole -233.21KiB -0.24 98.81% 95.38MiB 2.95MiB 61.36KiB 0 0.030886 95.16MiB 3.37MiB 69.5KiB 0 0.0353701 False False
syslog_humio_logs -42.21KiB -0.25 100.00% 16.44MiB 218.44KiB 4.48KiB 0 0.0129743 16.4MiB 242.74KiB 4.97KiB 0 0.0144539 False False
syslog_splunk_hec_logs -45.91KiB -0.27 99.99% 16.4MiB 388.77KiB 7.94KiB 0 0.0231486 16.35MiB 396.8KiB 8.11KiB 0 0.0236914 False False
datadog_agent_remap_datadog_logs -502.93KiB -0.72 100.00% 67.96MiB 2.63MiB 54.91KiB 0 0.0386912 67.47MiB 4.1MiB 85.54KiB 0 0.0607635 False False
http_pipelines_blackhole_acks -58.58KiB -1.39 99.71% 4.11MiB 659.61KiB 13.49KiB 0 0.156642 4.05MiB 701.08KiB 14.32KiB 0 0.168838 False False
http_to_http_acks -315.43KiB -1.71 83.18% 17.99MiB 7.56MiB 158.01KiB 0 0.420028 17.68MiB 7.92MiB 165.58KiB 0 0.447908 True True
datadog_agent_remap_blackhole_acks -1.84MiB -2.71 100.00% 67.67MiB 3.38MiB 70.46KiB 0 0.0498698 65.84MiB 4.96MiB 103.66KiB 0 0.0753092 False False

@github-actions
Copy link

Soak Test Results

Baseline: f296e01
Comparison: 9c791f8
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

No interesting changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_humio_metrics 350.26KiB 2.47 100.00% 13.82MiB 551.88KiB 11.25KiB 0 0.0389872 14.16MiB 138.2KiB 2.83KiB 0 0.00952712 False False
socket_to_socket_blackhole 164.97KiB 1.17 100.00% 13.81MiB 662.66KiB 13.5KiB 0 0.0468431 13.97MiB 551.35KiB 11.25KiB 0 0.0385258 False False
http_pipelines_blackhole_acks 36.97KiB 0.82 97.82% 4.38MiB 564.04KiB 11.53KiB 0 0.125601 4.42MiB 550.89KiB 11.25KiB 0 0.121671 False False
http_pipelines_blackhole 28.73KiB 0.7 79.75% 4.03MiB 777.41KiB 15.94KiB 0 0.188568 4.05MiB 781.88KiB 15.94KiB 0 0.18834 False False
datadog_agent_remap_blackhole_acks 354.11KiB 0.53 99.98% 65.34MiB 3.64MiB 75.87KiB 0 0.0556307 65.69MiB 2.84MiB 59.32KiB 0 0.0432176 False False
splunk_hec_route_s3 102.21KiB 0.51 90.14% 19.65MiB 2.1MiB 43.87KiB 0 0.106783 19.75MiB 2.09MiB 43.64KiB 0 0.10567 False False
syslog_log2metric_splunk_hec_metrics 81.61KiB 0.45 99.99% 17.61MiB 777.56KiB 15.82KiB 0 0.0431019 17.69MiB 618.03KiB 12.59KiB 0 0.0341045 False False
http_to_http_noack 78.87KiB 0.32 99.99% 23.77MiB 969.58KiB 19.75KiB 0 0.0398272 23.85MiB 249.97KiB 5.1KiB 0 0.0102348 False False
http_to_http_acks 48.02KiB 0.26 15.76% 17.8MiB 8.09MiB 169.07KiB 0 0.454177 17.84MiB 8.25MiB 172.37KiB 0 0.462018 True True
fluent_elasticsearch 84.74KiB 0.1 97.42% 79.39MiB 1.84MiB 37.98KiB 0 0.0232086 79.47MiB 52.16KiB 1.05KiB 0 0.000640825 False False
datadog_agent_remap_blackhole 63.01KiB 0.1 55.86% 63.41MiB 2.89MiB 60.49KiB 0 0.0455268 63.47MiB 2.63MiB 55.11KiB 0 0.0414788 False False
syslog_loki 11.71KiB 0.08 56.92% 14.71MiB 595.04KiB 12.13KiB 0 0.0395029 14.72MiB 419.77KiB 8.58KiB 0 0.0278456 False False
syslog_regex_logs2metric_ddmetrics 4.03KiB 0.03 20.60% 12.0MiB 486.1KiB 9.9KiB 0 0.039558 12.0MiB 581.64KiB 11.83KiB 0 0.0473168 False False
http_to_http_json 2.45KiB 0.01 19.85% 23.84MiB 338.04KiB 6.91KiB 0 0.0138414 23.85MiB 336.37KiB 6.87KiB 0 0.0137718 False False
splunk_hec_to_splunk_hec_logs_noack 1017.71B 0 8.44% 23.84MiB 322.12KiB 6.6KiB 0 0.0131949 23.84MiB 325.77KiB 6.67KiB 0 0.0133439 False False
syslog_splunk_hec_logs -11.67KiB -0.07 56.69% 16.12MiB 535.89KiB 10.94KiB 0 0.0324538 16.11MiB 494.71KiB 10.1KiB 0 0.0299812 False False
splunk_hec_indexer_ack_blackhole -19.8KiB -0.08 63.88% 23.78MiB 713.33KiB 14.52KiB 0 0.0292916 23.76MiB 791.54KiB 16.1KiB 0 0.0325298 False False
splunk_hec_to_splunk_hec_logs_acks -28.82KiB -0.12 81.71% 23.78MiB 687.81KiB 14.0KiB 0 0.0282371 23.75MiB 810.81KiB 16.49KiB 0 0.0333258 False False
syslog_humio_logs -29.7KiB -0.17 100.00% 16.6MiB 124.29KiB 2.55KiB 0 0.0073086 16.57MiB 125.07KiB 2.56KiB 0 0.00736754 False False
file_to_blackhole -232.67KiB -0.24 97.86% 95.37MiB 3.29MiB 68.37KiB 0 0.0344482 95.14MiB 3.61MiB 74.45KiB 0 0.0378957 False False
datadog_agent_remap_datadog_logs -185.18KiB -0.27 97.88% 66.71MiB 2.83MiB 59.01KiB 0 0.042359 66.53MiB 2.6MiB 54.5KiB 0 0.0390996 False False
datadog_agent_remap_datadog_logs_acks -465.67KiB -0.67 100.00% 67.87MiB 643.61KiB 13.15KiB 0 0.0092584 67.42MiB 2.69MiB 56.22KiB 0 0.0398442 False False
http_pipelines_no_grok_blackhole -176.75KiB -1 100.00% 17.2MiB 1.22MiB 25.49KiB 0 0.070737 17.03MiB 1.63MiB 34.02KiB 0 0.0957281 False False

ktff added 2 commits June 21, 2022 17:09
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@tobz
Copy link
Contributor

tobz commented Aug 2, 2022

@ktff I'm not sure I see why anything needs to be split up.

The thesis of my argument is that the idea that we should mark an endpoint as unhealthy after a mere single error (when it's temporary like a 429) is wrong.

ARC already generally handles this by multiplicatively decreasing the concurrency limit under backpressure/RTT deviations, which will quickly ramp down the limit for an endpoint that is under sustained backpressure. We want ARC to have a working history of the host to keep the limit low and increase it again slowly, rather than just having the endpoint become "healthy" again (when the reactivate delay expires) and gets immediately slammed with full traffic again.

There's a possible future where any sort of health/circuit breaker/etc service can also take backpressure into account as a slower growing counter of which the health status is derived from.

@ktff
Copy link
Contributor Author

ktff commented Aug 3, 2022

@tobz I agree with the benefits, but there is a tradeoff that needs to be made. See #3649 (comment). In short I would say that it only boils down to how much times we would want to retry with the same endpoint.

Regarding splitting, I was thinking of resolving that tradeoff by splitting some parts of ARC but its not doable.

@tobz
Copy link
Contributor

tobz commented Aug 3, 2022

I think right now, I'm OK with possibly having a retry land on the same endpoint that it just tried to use (and got an error with) to simplify the logic overall.

To sort of say it all another way:

  • HealthService should:

    • only pay attention to non-client/non-backpressure errors (5xx)
    • should switch to using an accumulator approach (i.e. 3-5 errors in a row before it trips, but a single successful response, when it lets through the next request in the half-open state, should reset the accumulator entirely)
    • use an exponential backoff for the amount of time it sleeps/marks itself as not ready (must also reset to zero when successful response occurs)
  • DistributedService should:

    • have retries before the balance layer
    • wrap each service in the balance layer as health and then ARC

If you think the comments in #3649 need addressing, can you copy/paste them here? It doesn't make a lot of sense to me to split up the conversation about how to approach the design from the actual PR making the design changes..

@ktff
Copy link
Contributor Author

ktff commented Aug 4, 2022

If you think the comments in #3649 need addressing, can you copy/paste them here?

Related to ARC segment in #13236 (review) it kinda became a > requirement to put ARC bellow distribution so that each endpoint is managed by its own ARC. To do it, dependency circle needs > to be broken, unfortunately there are two features that are practically mutually exclusive under such design:

  • Failover
  • ARC sensitivity to retries
    • We want retries to increase the effective RTT and back-off concurrency

They conflict in a way that for ARC to be sensitive to retries they need to happen with the same endpoint, while failover wants for > retries to happen with different endpoints.

Between the two, ARC sensitivity to retries seems less important and could maybe me managed in some other way in ARC so > > I'm for prioritizing failover. That would leave us with two designs:

  1. Always failover on retries.
  2. Retry some amount of times on the same endpoint, if that fails then failover.

Going by

I'm OK with possibly having a retry land on the same endpoint

and

have retries before the balance layer

you are suggesting 1., which I'm also fine with, but as mentioned will disable ARC ability to be sensitive to retries. If we've reached consensus on this, then I'll start addressing your other comments. Although I would like to get @bruceg blessing on this if possible so that we don't bounce from design to design.

@ktff
Copy link
Contributor Author

ktff commented Aug 8, 2022

@tobz ready for review.

Besides issues that you raised, one notable change is that distribution healthchecks now depend on cluster status. This can be configured through an option.

@github-actions
Copy link

github-actions bot commented Aug 8, 2022

Soak Test Results

Baseline: 4611307
Comparison: 1b08156
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
socket_to_socket_blackhole -10.18MiB -43.62 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_splunk_hec_metrics 428.72KiB 2.34 100.00% 17.88MiB 470.27KiB 9.59KiB 0 0.0256851 18.29MiB 845.63KiB 17.2KiB 0 0.0451294 False False
syslog_humio_logs 380.01KiB 2.3 100.00% 16.11MiB 302.33KiB 6.17KiB 0 0.0183271 16.48MiB 321.55KiB 6.58KiB 0 0.019053 False False
http_to_http_acks 386.71KiB 2.17 88.37% 17.43MiB 7.92MiB 165.65KiB 0 0.454572 17.8MiB 8.73MiB 182.11KiB 0 0.48997 True True
syslog_splunk_hec_logs 277.23KiB 1.65 100.00% 16.42MiB 774.81KiB 15.76KiB 0 0.0460592 16.7MiB 732.78KiB 14.92KiB 0 0.0428542 False False
syslog_regex_logs2metric_ddmetrics 160.55KiB 1.26 100.00% 12.41MiB 571.55KiB 11.65KiB 0 0.0449564 12.57MiB 564.66KiB 11.51KiB 0 0.0438608 False False
splunk_hec_to_splunk_hec_logs_noack 4.94KiB 0.02 36.51% 23.83MiB 382.63KiB 7.81KiB 0 0.0156758 23.84MiB 335.92KiB 6.86KiB 0 0.0137592 False False
splunk_hec_to_splunk_hec_logs_acks 6.0KiB 0.02 19.65% 23.75MiB 846.73KiB 17.22KiB 0 0.0348069 23.76MiB 829.11KiB 16.87KiB 0 0.0340741 False False
splunk_hec_indexer_ack_blackhole 4.71KiB 0.02 14.24% 23.75MiB 924.1KiB 18.8KiB 0 0.0379947 23.75MiB 901.06KiB 18.33KiB 0 0.0370403 False False
enterprise_http_to_http -1.99KiB -0.01 21.15% 23.85MiB 257.78KiB 5.26KiB 0 0.0105539 23.85MiB 254.96KiB 5.22KiB 0 0.0104394 False False
file_to_blackhole -51.02KiB -0.05 41.40% 95.34MiB 2.84MiB 58.78KiB 0 0.0297352 95.29MiB 3.51MiB 72.94KiB 0 0.0367867 False False
datadog_agent_remap_blackhole -77.05KiB -0.12 53.49% 64.34MiB 4.36MiB 90.82KiB 0 0.0677555 64.27MiB 2.57MiB 53.64KiB 0 0.0399759 False False
http_to_http_json -31.41KiB -0.13 98.83% 23.85MiB 350.04KiB 7.15KiB 0 0.0143313 23.82MiB 499.42KiB 10.2KiB 0 0.0204734 False False
http_pipelines_no_grok_blackhole -37.5KiB -0.33 87.16% 11.1MiB 320.27KiB 6.54KiB 0 0.0281701 11.06MiB 1.14MiB 23.78KiB 0 0.103217 False False
http_to_http_noack -118.43KiB -0.49 100.00% 23.84MiB 404.37KiB 8.27KiB 0 0.0165626 23.72MiB 1.19MiB 24.86KiB 0 0.0502919 False False
fluent_elasticsearch -444.93KiB -0.55 100.00% 79.47MiB 53.35KiB 1.08KiB 0 0.000655471 79.04MiB 4.36MiB 89.54KiB 0 0.0551493 False False
http_pipelines_blackhole_acks -6.83KiB -0.57 97.50% 1.17MiB 114.39KiB 2.33KiB 0 0.0951401 1.17MiB 96.56KiB 1.97KiB 0 0.0807717 False False
datadog_agent_remap_blackhole_acks -751.24KiB -1.12 100.00% 65.28MiB 4.34MiB 90.47KiB 0 0.0665201 64.55MiB 2.63MiB 54.98KiB 0 0.0406966 False False
http_pipelines_blackhole -22.82KiB -1.44 100.00% 1.55MiB 40.47KiB 846.69B 0 0.0255585 1.52MiB 104.14KiB 2.12KiB 0 0.0667229 False False
splunk_hec_route_s3 -315.24KiB -1.62 100.00% 19.0MiB 2.2MiB 45.9KiB 0 0.116007 18.69MiB 2.18MiB 45.7KiB 0 0.116809 False False
syslog_log2metric_humio_metrics -339.64KiB -2.54 100.00% 13.04MiB 252.33KiB 5.15KiB 0 0.0188911 12.71MiB 426.93KiB 8.7KiB 0 0.0327966 False False
datadog_agent_remap_datadog_logs_acks -1.83MiB -2.88 100.00% 63.48MiB 3.51MiB 73.41KiB 0 0.0553587 61.65MiB 4.38MiB 91.18KiB 0 0.0710312 False False
datadog_agent_remap_datadog_logs -2.22MiB -3.52 100.00% 63.17MiB 1.08MiB 22.6KiB 0 0.017064 60.95MiB 4.22MiB 87.83KiB 0 0.0691891 False False
syslog_loki -797.64KiB -5.38 100.00% 14.48MiB 502.43KiB 10.29KiB 0 0.03388 13.7MiB 770.19KiB 15.66KiB 0 0.0548886 False False
http_text_to_http_json -2.51MiB -6.46 100.00% 38.9MiB 843.01KiB 17.21KiB 0 0.0211571 36.39MiB 651.27KiB 13.29KiB 0 0.017473 False False
socket_to_socket_blackhole -10.18MiB -43.62 100.00% 23.33MiB 226.54KiB 4.62KiB 0 0.00948222 13.15MiB 136.59KiB 2.79KiB 0 0.0101409 False False

@github-actions
Copy link

github-actions bot commented Aug 8, 2022

Soak Test Results

Baseline: 4611307
Comparison: 9fd7e23
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
socket_to_socket_blackhole -10.25MiB -43.57 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_humio_logs 383.55KiB 2.32 100.00% 16.14MiB 490.99KiB 10.02KiB 0 0.0296959 16.52MiB 513.29KiB 10.51KiB 0 0.0303402 False False
syslog_log2metric_splunk_hec_metrics 359.93KiB 2.01 100.00% 17.49MiB 692.12KiB 14.1KiB 0 0.0386397 17.84MiB 810.09KiB 16.49KiB 0 0.0443347 False False
syslog_splunk_hec_logs 285.12KiB 1.72 100.00% 16.2MiB 830.31KiB 16.89KiB 0 0.0500476 16.48MiB 749.81KiB 15.27KiB 0 0.0444315 False False
syslog_regex_logs2metric_ddmetrics 164.59KiB 1.3 100.00% 12.38MiB 580.46KiB 11.82KiB 0 0.0457909 12.54MiB 582.88KiB 11.88KiB 0 0.0453922 False False
http_to_http_acks 75.76KiB 0.42 23.92% 17.51MiB 8.13MiB 169.9KiB 0 0.464066 17.59MiB 8.71MiB 181.81KiB 0 0.495326 True True
http_pipelines_blackhole_acks 1.16KiB 0.1 32.98% 1.13MiB 114.62KiB 2.33KiB 0 0.0990538 1.13MiB 68.08KiB 1.39KiB 0 0.058773 False False
splunk_hec_to_splunk_hec_logs_noack 12.45KiB 0.05 73.34% 23.83MiB 433.39KiB 8.85KiB 0 0.0177586 23.84MiB 336.23KiB 6.86KiB 0 0.0137704 False False
enterprise_http_to_http -28.11B -0 0.31% 23.85MiB 252.63KiB 5.16KiB 0 0.0103438 23.85MiB 241.47KiB 4.94KiB 0 0.00988672 False False
splunk_hec_indexer_ack_blackhole -1.94KiB -0.01 5.79% 23.75MiB 916.25KiB 18.64KiB 0 0.0376707 23.75MiB 939.51KiB 19.11KiB 0 0.03863 False False
splunk_hec_to_splunk_hec_logs_acks -5.97KiB -0.02 18.69% 23.75MiB 862.17KiB 17.53KiB 0 0.0354451 23.74MiB 892.77KiB 18.15KiB 0 0.0367122 False False
file_to_blackhole -62.32KiB -0.06 47.27% 95.34MiB 3.14MiB 65.14KiB 0 0.0329528 95.28MiB 3.56MiB 73.99KiB 0 0.0373525 False False
http_to_http_json -33.34KiB -0.14 99.15% 23.85MiB 352.99KiB 7.21KiB 0 0.0144527 23.81MiB 510.24KiB 10.42KiB 0 0.0209198 False False
fluent_elasticsearch -179.15KiB -0.22 100.00% 79.47MiB 54.26KiB 1.1KiB 0 0.000666608 79.3MiB 1.67MiB 34.36KiB 0 0.021033 False False
datadog_agent_remap_blackhole -171.67KiB -0.26 88.64% 63.79MiB 4.33MiB 90.13KiB 0 0.067817 63.63MiB 2.89MiB 60.35KiB 0 0.0454432 False False
http_to_http_noack -96.51KiB -0.4 100.00% 23.85MiB 245.37KiB 5.01KiB 0 0.0100464 23.75MiB 1.06MiB 22.2KiB 0 0.0448015 False False
http_pipelines_no_grok_blackhole -49.37KiB -0.43 98.32% 11.24MiB 62.86KiB 1.28KiB 0 0.00545866 11.2MiB 1012.13KiB 20.6KiB 0 0.0882654 False False
datadog_agent_remap_blackhole_acks -871.28KiB -1.31 100.00% 65.18MiB 4.31MiB 89.86KiB 0 0.066179 64.33MiB 3.31MiB 69.12KiB 0 0.0514039 False False
splunk_hec_route_s3 -260.33KiB -1.35 99.99% 18.84MiB 2.3MiB 47.93KiB 0 0.122215 18.58MiB 2.22MiB 46.38KiB 0 0.119379 False False
http_pipelines_blackhole -34.85KiB -2.08 100.00% 1.63MiB 10.09KiB 211.25B 0 0.00603617 1.6MiB 132.57KiB 2.7KiB 0 0.0809668 False False
datadog_agent_remap_datadog_logs_acks -1.82MiB -2.87 100.00% 63.37MiB 3.63MiB 75.85KiB 0 0.0573287 61.55MiB 4.38MiB 91.07KiB 0 0.0710696 False False
syslog_log2metric_humio_metrics -389.59KiB -2.89 100.00% 13.16MiB 206.36KiB 4.21KiB 0 0.015316 12.77MiB 523.9KiB 10.67KiB 0 0.0400413 False False
datadog_agent_remap_datadog_logs -2.33MiB -3.69 100.00% 63.28MiB 655.63KiB 13.43KiB 0 0.0101152 60.95MiB 4.12MiB 85.77KiB 0 0.0675644 False False
syslog_loki -817.45KiB -5.35 100.00% 14.92MiB 310.35KiB 6.35KiB 0 0.0203134 14.12MiB 699.44KiB 14.22KiB 0 0.0483697 False False
http_text_to_http_json -2.65MiB -6.72 100.00% 39.39MiB 740.45KiB 15.11KiB 0 0.0183535 36.74MiB 688.26KiB 14.05KiB 0 0.0182889 False False
socket_to_socket_blackhole -10.25MiB -43.57 100.00% 23.52MiB 140.61KiB 2.87KiB 0 0.00583764 13.27MiB 99.05KiB 2.02KiB 0 0.00728733 False False

@github-actions
Copy link

github-actions bot commented Aug 8, 2022

Soak Test Results

Baseline: 4611307
Comparison: 29de90e
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
socket_to_socket_blackhole -9.89MiB -43.01 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_splunk_hec_metrics 421.91KiB 2.38 100.00% 17.29MiB 854.17KiB 17.41KiB 0 0.048229 17.7MiB 992.0KiB 20.22KiB 0 0.0547077 False False
http_to_http_acks 416.51KiB 2.35 90.54% 17.33MiB 8.1MiB 169.36KiB 0 0.467418 17.74MiB 8.75MiB 182.68KiB 0 0.493349 True True
syslog_humio_logs 357.14KiB 2.15 100.00% 16.21MiB 368.26KiB 7.52KiB 0 0.022187 16.55MiB 399.93KiB 8.19KiB 0 0.023587 False False
syslog_splunk_hec_logs 278.01KiB 1.65 100.00% 16.42MiB 731.48KiB 14.88KiB 0 0.0434922 16.69MiB 617.86KiB 12.6KiB 0 0.0361392 False False
syslog_regex_logs2metric_ddmetrics 194.84KiB 1.55 100.00% 12.24MiB 566.8KiB 11.55KiB 0 0.0451957 12.43MiB 565.6KiB 11.53KiB 0 0.0444098 False False
splunk_hec_to_splunk_hec_logs_noack 22.62KiB 0.09 93.11% 23.82MiB 513.37KiB 10.47KiB 0 0.0210462 23.84MiB 327.86KiB 6.69KiB 0 0.0134285 False False
splunk_hec_indexer_ack_blackhole 8.09KiB 0.03 25.50% 23.75MiB 890.33KiB 18.11KiB 0 0.036597 23.76MiB 838.31KiB 17.06KiB 0 0.034447 False False
enterprise_http_to_http -3.62KiB -0.01 37.78% 23.85MiB 255.18KiB 5.21KiB 0 0.0104474 23.84MiB 253.52KiB 5.19KiB 0 0.0103813 False False
splunk_hec_to_splunk_hec_logs_acks -1.85KiB -0.01 6.13% 23.76MiB 825.96KiB 16.8KiB 0 0.0339424 23.76MiB 846.52KiB 17.22KiB 0 0.0347902 False False
file_to_blackhole -66.9KiB -0.07 56.65% 95.36MiB 2.77MiB 57.48KiB 0 0.0290762 95.29MiB 3.04MiB 63.19KiB 0 0.0318903 False False
datadog_agent_remap_blackhole -68.27KiB -0.11 47.45% 62.87MiB 4.38MiB 91.14KiB 0 0.0695958 62.8MiB 2.73MiB 57.07KiB 0 0.0435273 False False
http_to_http_json -41.43KiB -0.17 99.81% 23.85MiB 343.08KiB 7.0KiB 0 0.0140468 23.81MiB 553.94KiB 11.31KiB 0 0.0227187 False False
http_pipelines_no_grok_blackhole -27.59KiB -0.24 79.13% 11.23MiB 45.91KiB 959.77B 0 0.00399206 11.2MiB 1.05MiB 21.92KiB 0 0.0939455 False False
fluent_elasticsearch -261.78KiB -0.32 99.96% 79.47MiB 54.08KiB 1.09KiB 0 0.000664425 79.22MiB 3.58MiB 73.65KiB 0 0.0451601 False False
http_to_http_noack -88.16KiB -0.36 99.95% 23.83MiB 518.72KiB 10.6KiB 0 0.021254 23.74MiB 1.11MiB 23.14KiB 0 0.0467365 False False
http_pipelines_blackhole_acks -8.49KiB -0.7 99.90% 1.19MiB 97.53KiB 1.99KiB 0 0.0802425 1.18MiB 80.4KiB 1.64KiB 0 0.0666109 False False
datadog_agent_remap_blackhole_acks -726.66KiB -1.15 100.00% 61.81MiB 4.94MiB 102.94KiB 0 0.0799727 61.1MiB 4.13MiB 86.29KiB 0 0.0675305 False False
splunk_hec_route_s3 -267.22KiB -1.37 99.99% 19.01MiB 2.38MiB 49.51KiB 0 0.125069 18.75MiB 2.19MiB 45.79KiB 0 0.116661 False False
http_pipelines_blackhole -29.97KiB -1.79 100.00% 1.64MiB 9.06KiB 189.6B 0 0.00540983 1.61MiB 114.79KiB 2.34KiB 0 0.0697811 False False
syslog_log2metric_humio_metrics -319.49KiB -2.39 100.00% 13.05MiB 351.93KiB 7.18KiB 0 0.0263235 12.74MiB 573.44KiB 11.67KiB 0 0.0439419 False False
datadog_agent_remap_datadog_logs_acks -2.08MiB -3.32 100.00% 62.71MiB 2.68MiB 56.1KiB 0 0.0427369 60.63MiB 4.29MiB 89.21KiB 0 0.0706782 False False
datadog_agent_remap_datadog_logs -2.27MiB -3.55 100.00% 63.89MiB 456.66KiB 9.34KiB 0 0.00697823 61.63MiB 4.27MiB 88.92KiB 0 0.0692858 False False
syslog_loki -721.33KiB -4.93 100.00% 14.29MiB 634.21KiB 12.97KiB 0 0.0433177 13.59MiB 806.12KiB 16.39KiB 0 0.0579138 False False
http_text_to_http_json -2.64MiB -6.8 100.00% 38.8MiB 1.06MiB 22.18KiB 0 0.0273472 36.16MiB 963.99KiB 19.69KiB 0 0.0260289 False False
socket_to_socket_blackhole -9.89MiB -43.01 100.00% 23.01MiB 435.12KiB 8.88KiB 0 0.0184653 13.11MiB 167.58KiB 3.42KiB 0 0.0124775 False False

@github-actions
Copy link

github-actions bot commented Aug 8, 2022

Soak Test Results

Baseline: 4611307
Comparison: 5dcf97f
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
socket_to_socket_blackhole -9.78MiB -43.02 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
http_to_http_acks 448.52KiB 2.54 93.19% 17.21MiB 8.01MiB 167.4KiB 0 0.465152 17.65MiB 8.62MiB 179.93KiB 0 0.488467 True True
syslog_humio_logs 343.34KiB 2.09 100.00% 16.05MiB 310.95KiB 6.35KiB 0 0.0189216 16.38MiB 339.7KiB 6.95KiB 0 0.0202479 False False
syslog_log2metric_splunk_hec_metrics 315.36KiB 1.75 100.00% 17.62MiB 697.54KiB 14.22KiB 0 0.0386527 17.93MiB 975.57KiB 19.84KiB 0 0.0531302 False False
syslog_splunk_hec_logs 279.27KiB 1.65 100.00% 16.57MiB 780.49KiB 15.88KiB 0 0.0459793 16.85MiB 726.69KiB 14.8KiB 0 0.0421169 False False
syslog_regex_logs2metric_ddmetrics 153.94KiB 1.23 100.00% 12.22MiB 591.56KiB 12.05KiB 0 0.0472669 12.37MiB 586.63KiB 11.96KiB 0 0.046303 False False
splunk_hec_to_splunk_hec_logs_noack 10.23KiB 0.04 64.81% 23.83MiB 424.01KiB 8.66KiB 0 0.0173744 23.84MiB 330.89KiB 6.76KiB 0 0.0135531 False False
splunk_hec_indexer_ack_blackhole 7.5KiB 0.03 23.85% 23.75MiB 869.99KiB 17.7KiB 0 0.0357663 23.76MiB 847.12KiB 17.23KiB 0 0.0348154 False False
enterprise_http_to_http -2.07KiB -0.01 22.58% 23.85MiB 251.95KiB 5.14KiB 0 0.0103154 23.85MiB 246.3KiB 5.04KiB 0 0.0100852 False False
http_pipelines_no_grok_blackhole -6.28KiB -0.06 21.19% 10.92MiB 282.34KiB 5.76KiB 0 0.0252536 10.91MiB 1.09MiB 22.65KiB 0 0.0997083 False False
file_to_blackhole -70.23KiB -0.07 65.19% 95.37MiB 2.46MiB 51.02KiB 0 0.0257989 95.31MiB 2.63MiB 54.76KiB 0 0.0276261 False False
http_to_http_json -29.51KiB -0.12 98.09% 23.84MiB 357.07KiB 7.29KiB 0 0.0146218 23.81MiB 502.58KiB 10.27KiB 0 0.0206054 False False
splunk_hec_to_splunk_hec_logs_acks -31.13KiB -0.13 81.73% 23.78MiB 745.47KiB 15.18KiB 0 0.0306119 23.75MiB 873.06KiB 17.75KiB 0 0.0358973 False False
http_to_http_noack -61.48KiB -0.25 99.28% 23.83MiB 519.14KiB 10.61KiB 0 0.0212712 23.77MiB 993.53KiB 20.24KiB 0 0.0408123 False False
datadog_agent_remap_blackhole -287.3KiB -0.46 98.13% 61.44MiB 4.36MiB 90.96KiB 0 0.070989 61.16MiB 3.91MiB 81.54KiB 0 0.0639141 False False
fluent_elasticsearch -389.07KiB -0.48 100.00% 79.47MiB 52.28KiB 1.06KiB 0 0.000642282 79.09MiB 4.31MiB 88.55KiB 0 0.0545035 False False
datadog_agent_remap_blackhole_acks -580.0KiB -0.95 100.00% 59.87MiB 4.15MiB 86.35KiB 0 0.0692338 59.3MiB 2.83MiB 59.14KiB 0 0.0476499 False False
http_pipelines_blackhole_acks -12.44KiB -1.02 100.00% 1.19MiB 100.6KiB 2.05KiB 0 0.0828503 1.17MiB 85.42KiB 1.74KiB 0 0.0710787 False False
splunk_hec_route_s3 -296.65KiB -1.55 100.00% 18.64MiB 2.34MiB 48.71KiB 0 0.1254 18.35MiB 2.25MiB 47.14KiB 0 0.122684 False False
http_pipelines_blackhole -27.92KiB -1.8 100.00% 1.52MiB 23.38KiB 489.16B 0 0.0150641 1.49MiB 67.67KiB 1.38KiB 0 0.0443945 False False
syslog_log2metric_humio_metrics -375.37KiB -2.98 100.00% 12.31MiB 317.76KiB 6.48KiB 0 0.0252013 11.94MiB 509.59KiB 10.38KiB 0 0.0416564 False False
datadog_agent_remap_datadog_logs_acks -2.05MiB -3.23 100.00% 63.62MiB 3.22MiB 67.31KiB 0 0.0506005 61.57MiB 4.36MiB 90.82KiB 0 0.0708477 False False
datadog_agent_remap_datadog_logs -2.17MiB -3.45 100.00% 63.03MiB 1.23MiB 25.88KiB 0 0.01959 60.86MiB 4.15MiB 86.32KiB 0 0.0681015 False False
syslog_loki -759.54KiB -5.1 100.00% 14.55MiB 383.36KiB 7.85KiB 0 0.0257316 13.8MiB 717.84KiB 14.59KiB 0 0.0507719 False False
http_text_to_http_json -2.68MiB -6.83 100.00% 39.26MiB 768.98KiB 15.7KiB 0 0.0191234 36.58MiB 842.13KiB 17.2KiB 0 0.0224782 False False
socket_to_socket_blackhole -9.78MiB -43.02 100.00% 22.72MiB 597.86KiB 12.2KiB 0 0.0256867 12.95MiB 306.66KiB 6.26KiB 0 0.0231211 False False

Signed-off-by: Kruno Tomola Fabro <krunotf@gmail.com>
@ktff
Copy link
Contributor Author

ktff commented Aug 9, 2022

Regarding failing checks, they are complaining about line 344 in a file of 334 lines so I'm not sure what's that about.

This is the file https://github.com/vectordotdev/vector/blob/02d9add57437ddcb20bbf25e3c2b3a44f1b128bb/src/sinks/elasticsearch/tests.rs

@tobz if it will help with review I can split this into two PRs. One for distributed service and one for elasticsearch.

@github-actions
Copy link

github-actions bot commented Aug 9, 2022

Soak Test Results

Baseline: afcee9b
Comparison: 3f821f3
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
socket_to_socket_blackhole -10.45MiB -44.19 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_log2metric_splunk_hec_metrics 798.69KiB 4.4 100.00% 17.71MiB 562.56KiB 11.47KiB 0 0.0310204 18.49MiB 748.41KiB 15.24KiB 0 0.039527 False False
syslog_humio_logs 707.28KiB 4.32 100.00% 16.0MiB 512.1KiB 10.46KiB 0 0.0312421 16.69MiB 568.5KiB 11.64KiB 0 0.0332478 False False
http_to_http_acks 658.93KiB 3.69 99.20% 17.45MiB 8.18MiB 170.98KiB 0 0.468579 18.1MiB 8.62MiB 179.89KiB 0 0.476172 True True
syslog_splunk_hec_logs 596.68KiB 3.58 100.00% 16.29MiB 765.48KiB 15.57KiB 0 0.0458793 16.87MiB 652.21KiB 13.3KiB 0 0.0377402 False False
syslog_regex_logs2metric_ddmetrics 377.43KiB 2.98 100.00% 12.38MiB 548.83KiB 11.19KiB 0 0.0432991 12.74MiB 518.57KiB 10.58KiB 0 0.0397284 False False
datadog_agent_remap_blackhole 301.94KiB 0.46 99.56% 63.84MiB 4.38MiB 91.2KiB 0 0.0685717 64.14MiB 2.58MiB 53.86KiB 0 0.0402188 False False
splunk_hec_route_s3 67.8KiB 0.35 70.43% 18.98MiB 2.22MiB 46.15KiB 0 0.116744 19.04MiB 2.18MiB 45.54KiB 0 0.114292 False False
http_pipelines_blackhole_acks 2.55KiB 0.21 66.40% 1.16MiB 105.02KiB 2.14KiB 0 0.0882516 1.16MiB 76.85KiB 1.57KiB 0 0.0644473 False False
splunk_hec_to_splunk_hec_logs_noack 11.13KiB 0.05 68.43% 23.83MiB 432.61KiB 8.83KiB 0 0.0177273 23.84MiB 328.92KiB 6.71KiB 0 0.0134721 False False
enterprise_http_to_http -1.13KiB -0 12.80% 23.85MiB 245.64KiB 5.01KiB 0 0.0100573 23.85MiB 241.03KiB 4.93KiB 0 0.00986908 False False
splunk_hec_to_splunk_hec_logs_acks -2.56KiB -0.01 9.19% 23.77MiB 771.6KiB 15.71KiB 0 0.03169 23.77MiB 769.32KiB 15.66KiB 0 0.0315997 False False
splunk_hec_indexer_ack_blackhole -6.84KiB -0.03 20.51% 23.75MiB 899.52KiB 18.3KiB 0 0.0369758 23.75MiB 928.37KiB 18.88KiB 0 0.0381723 False False
file_to_blackhole -53.69KiB -0.05 42.65% 95.34MiB 2.93MiB 60.79KiB 0 0.0307485 95.29MiB 3.53MiB 73.49KiB 0 0.0370649 False False
http_to_http_json -26.19KiB -0.11 96.82% 23.85MiB 354.53KiB 7.24KiB 0 0.014516 23.82MiB 479.95KiB 9.81KiB 0 0.0196723 False False
http_to_http_noack -67.03KiB -0.27 99.63% 23.83MiB 507.93KiB 10.39KiB 0 0.0208125 23.76MiB 1012.72KiB 20.63KiB 0 0.0416112 False False
syslog_log2metric_humio_metrics -53.21KiB -0.41 100.00% 12.6MiB 270.19KiB 5.51KiB 0 0.0209428 12.54MiB 563.51KiB 11.47KiB 0 0.0438591 False False
http_pipelines_no_grok_blackhole -59.14KiB -0.51 98.93% 11.25MiB 43.8KiB 915.62B 0 0.00380082 11.19MiB 1.11MiB 23.15KiB 0 0.0993178 False False
fluent_elasticsearch -439.91KiB -0.54 100.00% 79.47MiB 53.28KiB 1.08KiB 0 0.000654572 79.04MiB 4.82MiB 99.11KiB 0 0.0610175 False False
datadog_agent_remap_blackhole_acks -556.6KiB -0.86 100.00% 63.24MiB 4.94MiB 102.82KiB 0 0.0780973 62.69MiB 3.55MiB 74.29KiB 0 0.0566567 False False
http_pipelines_blackhole -18.41KiB -1.11 100.00% 1.63MiB 28.73KiB 601.38B 0 0.0172482 1.61MiB 98.03KiB 2.0KiB 0 0.0595149 False False
datadog_agent_remap_datadog_logs_acks -1.68MiB -2.65 100.00% 63.46MiB 2.86MiB 59.84KiB 0 0.045067 61.77MiB 4.39MiB 91.3KiB 0 0.0709906 False False
datadog_agent_remap_datadog_logs -1.72MiB -2.71 100.00% 63.49MiB 513.23KiB 10.51KiB 0 0.00789191 61.78MiB 4.04MiB 84.18KiB 0 0.0654095 False False
syslog_loki -856.56KiB -5.84 100.00% 14.32MiB 476.35KiB 9.75KiB 0 0.0324678 13.49MiB 731.83KiB 14.88KiB 0 0.0529743 False False
http_text_to_http_json -2.57MiB -6.57 100.00% 39.12MiB 819.36KiB 16.73KiB 0 0.0204475 36.55MiB 645.92KiB 13.19KiB 0 0.0172536 False False
socket_to_socket_blackhole -10.45MiB -44.19 100.00% 23.66MiB 308.81KiB 6.3KiB 0 0.0127436 13.2MiB 98.01KiB 2.0KiB 0 0.00724681 False False

@github-actions
Copy link

github-actions bot commented Aug 9, 2022

Soak Test Results

Baseline: afcee9b
Comparison: 02d9add
Total Vector CPUs: 4

Explanation

A soak test is an integrated performance test for vector in a repeatable rig, with varying configuration for vector. What follows is a statistical summary of a brief vector run for each configuration across SHAs given above. The goal of these tests are to determine, quickly, if vector performance is changed and to what degree by a pull request. Where appropriate units are scaled per-core.

The table below, if present, lists those experiments that have experienced a statistically significant change in their throughput performance between baseline and comparision SHAs, with 90.0% confidence OR have been detected as newly erratic. Negative values mean that baseline is faster, positive comparison. Results that do not exhibit more than a ±8.87% change in mean throughput are discarded. An experiment is erratic if its coefficient of variation is greater than 0.3. The abbreviated table will be omitted if no interesting changes are observed.

Changes in throughput with confidence ≥ 90.00% and absolute Δ mean >= ±8.87%:

experiment Δ mean Δ mean % confidence
socket_to_socket_blackhole -10.22MiB -43.27 100.00%
Fine details of change detection per experiment.
experiment Δ mean Δ mean % confidence baseline mean baseline stdev baseline stderr baseline outlier % baseline CoV comparison mean comparison stdev comparison stderr comparison outlier % comparison CoV erratic declared erratic
syslog_humio_logs 587.87KiB 3.58 100.00% 16.05MiB 446.6KiB 9.12KiB 0 0.0271713 16.62MiB 457.33KiB 9.36KiB 0 0.0268632 False False
syslog_log2metric_splunk_hec_metrics 618.73KiB 3.47 100.00% 17.42MiB 670.32KiB 13.66KiB 0 0.0375747 18.02MiB 894.18KiB 18.19KiB 0 0.0484429 False False
syslog_splunk_hec_logs 532.57KiB 3.27 100.00% 15.89MiB 856.14KiB 17.43KiB 0 0.0526218 16.41MiB 741.7KiB 15.12KiB 0 0.0441426 False False
http_to_http_acks 475.53KiB 2.66 94.56% 17.49MiB 8.11MiB 169.43KiB 0 0.463395 17.95MiB 8.62MiB 179.85KiB 0 0.47987 True True
syslog_regex_logs2metric_ddmetrics 222.22KiB 1.74 100.00% 12.47MiB 555.53KiB 11.32KiB 0 0.0434924 12.69MiB 654.83KiB 13.34KiB 0 0.0503904 False False
datadog_agent_remap_blackhole 824.03KiB 1.3 100.00% 61.94MiB 5.34MiB 111.35KiB 0 0.0862764 62.74MiB 3.79MiB 79.12KiB 0 0.0604457 False False
splunk_hec_route_s3 27.15KiB 0.14 31.73% 18.8MiB 2.28MiB 47.48KiB 0 0.121154 18.83MiB 2.22MiB 46.45KiB 0 0.11793 False False
splunk_hec_to_splunk_hec_logs_noack 13.1KiB 0.05 76.56% 23.83MiB 423.21KiB 8.64KiB 0 0.0173425 23.84MiB 334.19KiB 6.82KiB 0 0.0136872 False False
splunk_hec_indexer_ack_blackhole 9.37KiB 0.04 27.09% 23.74MiB 965.72KiB 19.64KiB 0 0.0397204 23.75MiB 914.39KiB 18.6KiB 0 0.0375949 False False
enterprise_http_to_http -2.17KiB -0.01 22.99% 23.85MiB 255.79KiB 5.22KiB 0 0.0104733 23.84MiB 257.25KiB 5.26KiB 0 0.0105341 False False
file_to_blackhole -69.17KiB -0.07 63.11% 95.36MiB 2.5MiB 51.73KiB 0 0.0261635 95.3MiB 2.74MiB 57.0KiB 0 0.0287546 False False
splunk_hec_to_splunk_hec_logs_acks -17.24KiB -0.07 52.09% 23.76MiB 809.39KiB 16.47KiB 0 0.0332588 23.74MiB 882.0KiB 17.94KiB 0 0.036268 False False
http_to_http_json -38.68KiB -0.16 99.68% 23.85MiB 349.58KiB 7.14KiB 0 0.0143134 23.81MiB 539.76KiB 11.02KiB 0 0.0221352 False False
fluent_elasticsearch -157.64KiB -0.19 100.00% 79.47MiB 53.06KiB 1.07KiB 0 0.000651891 79.32MiB 1.73MiB 35.58KiB 0 0.0218076 False False
http_pipelines_blackhole_acks -4.54KiB -0.37 92.84% 1.18MiB 104.26KiB 2.12KiB 0 0.0860222 1.18MiB 66.55KiB 1.36KiB 0 0.0551173 False False
syslog_log2metric_humio_metrics -77.94KiB -0.59 100.00% 12.98MiB 262.38KiB 5.36KiB 0 0.0197368 12.9MiB 563.08KiB 11.46KiB 0 0.0426065 False False
http_to_http_noack -165.97KiB -0.68 100.00% 23.85MiB 251.22KiB 5.14KiB 0 0.0102861 23.68MiB 1.37MiB 28.58KiB 0 0.0579381 False False
http_pipelines_no_grok_blackhole -78.54KiB -0.69 99.98% 11.18MiB 86.43KiB 1.76KiB 0 0.00754929 11.1MiB 1.01MiB 21.14KiB 0 0.0913799 False False
datadog_agent_remap_blackhole_acks -880.0KiB -1.37 100.00% 62.7MiB 4.73MiB 98.54KiB 0 0.0754807 61.84MiB 3.53MiB 73.84KiB 0 0.0570988 False False
datadog_agent_remap_datadog_logs -1.06MiB -1.73 100.00% 61.37MiB 3.32MiB 69.59KiB 0 0.0540638 60.31MiB 4.85MiB 101.04KiB 0 0.0804303 False False
http_pipelines_blackhole -29.31KiB -1.78 100.00% 1.61MiB 21.51KiB 449.99B 0 0.0130601 1.58MiB 97.29KiB 1.98KiB 0 0.0601293 False False
datadog_agent_remap_datadog_logs_acks -1.38MiB -2.2 100.00% 62.91MiB 3.6MiB 75.23KiB 0 0.057269 61.53MiB 4.35MiB 90.52KiB 0 0.0706577 False False
syslog_loki -583.98KiB -3.91 100.00% 14.59MiB 358.71KiB 7.34KiB 0 0.024008 14.02MiB 694.7KiB 14.12KiB 0 0.0483864 False False
http_text_to_http_json -2.51MiB -6.4 100.00% 39.26MiB 775.31KiB 15.83KiB 0 0.0192823 36.75MiB 768.59KiB 15.69KiB 0 0.0204219 False False
socket_to_socket_blackhole -10.22MiB -43.27 100.00% 23.62MiB 269.74KiB 5.51KiB 0 0.0111479 13.4MiB 106.55KiB 2.18KiB 0 0.00776287 False False

@tobz
Copy link
Contributor

tobz commented Aug 9, 2022

Regarding failing checks, they are complaining about line 344 in a file of 334 lines so I'm not sure what's that about.

This is the file https://github.com/vectordotdev/vector/blob/02d9add57437ddcb20bbf25e3c2b3a44f1b128bb/src/sinks/elasticsearch/tests.rs

@tobz if it will help with review I can split this into two PRs. One for distributed service and one for elasticsearch.

If it's not too much effort to split it, that would be helpful. I really do apologize for the long back-and-forth here, but unfortunately there's a lot of detail here to cover. 😓

@ktff
Copy link
Contributor Author

ktff commented Aug 10, 2022

Closing this PR in favor of splitting it into two PRs.

First one #13918.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain: external docs Anything related to Vector's external, public documentation domain: sinks Anything related to the Vector's sinks
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants