perf: Use RTT variance instead of a fixed threshold #3671

bruceg · 2020-09-01T21:55:41Z

Instead of using a fixed threshold, this calculates the standard deviation of the EWMA of the past RTT measurements, and uses a multiple of that as the threshold range.

Instead of using a fixed threshold, this calculates the standard deviation of the EWMA of the past RTT measurements, and uses a multiple of that as the threshold range. Signed-off-by: Bruce Guenter <bruce@timber.io>

This also adds an internal option for a custom EWMA alpha. Signed-off-by: Bruce Guenter <bruce@timber.io>

jszwedko

🎉

I'll try this out with the tests we have so far.

jszwedko · 2020-09-02T13:44:09Z

src/sinks/util/auto_concurrency/controller.rs

+                let variance = self.variance.unwrap_or(0.0);
+                (
+                    point * self.alpha + avg * (1.0 - self.alpha),
+                    Some((1.0 - self.alpha) * (variance + self.alpha * delta * delta)),


Would you be able to explain a bit what this is doing?

(specifically the variance bit of the calculation)

jszwedko · 2020-09-02T17:49:03Z

I note that I see:

warning: method is never used: `new`
   --> src/sinks/util/auto_concurrency/controller.rs:266:5
    |
266 |     fn new(alpha: f64) -> Self {
    |     ^^^^^^^^^^^^^^^^^^^^^^^^^^
    |
    = note: `#[warn(dead_code)]` on by default

warning: 1 warning emitted

When building.

I reran the tests (including the added "knee behavior" tests).

Some observations:

It seems like it does better in the small variance case, but actually appears worse in the medium/large variance cases
It seems to do better for the knee case with a higher slope

Before (424ea5e):

After (6670a0e):

I'm realizing it'd probably be useful to plot before/after on the same graphs (specifically the throughput) to make it easier to see differences. I opened vectordotdev/http_test_server#4 to that end.

bruceg · 2020-10-19T14:50:17Z

Since this doesn't seem to have helped any, at least with our simulations, and needs some work to be usable, I will close this for now and see what happens in real deployment situations.

Bruce Guenter added 2 commits September 1, 2020 15:46

Use the variance of the past RTT measurements as a threshold indicator

b40ca86

Instead of using a fixed threshold, this calculates the standard deviation of the EWMA of the past RTT measurements, and uses a multiple of that as the threshold range. Signed-off-by: Bruce Guenter <bruce@timber.io>

Add self-test for the EWMA standard deviation calculation

6670a0e

This also adds an internal option for a custom EWMA alpha. Signed-off-by: Bruce Guenter <bruce@timber.io>

bruceg added type: enhancement A value-adding code change that enhances its existing functionality. domain: networking Anything related to Vector's networking domain: sinks Anything related to the Vector's sinks domain: performance Anything related to Vector's performance labels Sep 1, 2020

bruceg requested a review from jszwedko September 1, 2020 21:55

bruceg self-assigned this Sep 1, 2020

jszwedko reviewed Sep 2, 2020

View reviewed changes

bruceg mentioned this pull request Sep 2, 2020

enhancement: Add a new options to control the auto concurrency limiter #3690

Merged

bruceg added this to the 2020-09-14 - The Grid milestone Sep 14, 2020

jamtur01 modified the milestones: 2020-09-14 - The Grid, 2020-09-28 - Derezzed Sep 28, 2020

bruceg mentioned this pull request Oct 9, 2020

fix(networking): Adjust auto concurrency tuning defaults #4476

Merged

bruceg closed this Oct 19, 2020

binarylogic deleted the auto-concurrency-handle-variance branch January 19, 2021 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Use RTT variance instead of a fixed threshold #3671

perf: Use RTT variance instead of a fixed threshold #3671

bruceg commented Sep 1, 2020

jszwedko left a comment

jszwedko Sep 2, 2020

jszwedko Sep 2, 2020

jszwedko commented Sep 2, 2020

bruceg commented Oct 19, 2020

perf: Use RTT variance instead of a fixed threshold #3671

perf: Use RTT variance instead of a fixed threshold #3671

Conversation

bruceg commented Sep 1, 2020

jszwedko left a comment

Choose a reason for hiding this comment

jszwedko Sep 2, 2020

Choose a reason for hiding this comment

jszwedko Sep 2, 2020

Choose a reason for hiding this comment

jszwedko commented Sep 2, 2020

bruceg commented Oct 19, 2020