fix: Rate limiter `TokenBucket::auto_replenish()` #3370

JonathanWoollett-Light · 2023-01-16T16:46:23Z

Changes

Fixes bug in TokenBucket::auto_replenish.

Reason

When a TokenBucket gets empty, the implementation sets a timer of 100ms for replenishing it.

The bucket replenishing function calculates the new tokens to add to the budget using the following
formula:

let tokens = (time_delta * self.processed_capacity) / self.processed_refill_time;

However, this formula can return 0 depending of the values of self.processed_capacity and self.processed_refill_time.
For a TokenBucket of size total capacity and complete_refill_time_ns refill period in nanoseconds, the above values are calculated as follows:

// Get the greatest common factor between `size` and `complete_refill_time_ns`.
let common_factor = gcd(size, complete_refill_time_ns);
let processed_capacity: u64 = size / common_factor;
let processed_refill_time: u64 = complete_refill_time_ns / common_factor;

So, for example if size == 1 and complete_refill_time_ns == 1_000_000_000 (equivalent to 1 token per second) the replenishing tokens will be

let tokens = (100_000_000 * 1) / 1_000_000_000; // which gives 0 in integer division

As a result, the bucket will never get replenished.

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license.

PR Checklist

All commits in this PR are signed (git commit -s).
If a specific issue led to this PR, this PR closes the issue.
The description of changes is clear and encompassing.
Any required documentation changes (code and docs) are included in this PR.
New unsafe code is documented.
API changes follow the Runbook for Firecracker API changes.
User-facing changes are mentioned in CHANGELOG.md.
All added/changed functionality is tested.
New TODOs link to an issue.

This functionality can be added in rust-vmm.

acatangiu

since all internals are u64 (and 2^64 nanoseconds is > 580 years) the gcd is definitely not needed and this simplification makes total sense 👍

src/rate_limiter/src/lib.rs

JonathanWoollett-Light · 2023-01-17T13:43:30Z

@roypat @bchalios Could you please re-review.

src/rate_limiter/src/lib.rs

roypat

As @JonathanWoollett-Light noted, the implementation still "leaks" tokens (but doesnt hang anymore). This is because if auto_replenish is called after enough time for, say, 1.9 tokens to regenerate has passed, one token will be added, and last_update will be fast forwarded. The 0.9 tokens will effectively be discarded.

We can fix this by incrementing last_update by the actual time it took to generate the tokens that have been added to the budget, instead of setting it to Instant::now(). I think my proposed suggestion implements this correctly, but I'd love to have someone else trace out the logic to verify it.

Technically, this does not even need the "if tokens > 0" conditional anymore.

src/rate_limiter/src/lib.rs

Frequently calling `auto_replenish` will reset `self.last_update` each time and `tokens` may be a fractional value (0 since it is a `u64`), in this case no tokens will replenished. To avoid this we increment `self.last_update` by the minimum time required to generate `tokens`, in the case where we have the time to generate `1.8` tokens but only generate `x` tokens due to integer arithmetic this will carry the time required to generate 0.8th of a token over to the next call, such that if the next call where to generate `2.3` tokens it would instead generate `3.1` tokens. This minimizes dropping tokens at high frequencies. Signed-off-by: Jonathan Woollett-Light <jcawl@amazon.co.uk>

roypat

I share @JonathanWoollett-Light's concern regarding last_updated falling behind Instant::now(), so here’s a proof (I hope I got everything right) that it doesn’t (for simplicity, I use the non-reduced versions of size and refill time):

Note that integer division is just normal division, followed by rounding down (floor function)
For positive y, we have floor(x / y) * y >= x - y, as the expression on the LHS “rounds down” to the closest multiple of y
For integer z, we have floor(x + z) = floor(x) + z
Multiplication by (-1) flips inequalities, e.g. if x <= y, then z - x >= z - y.
We refer to the system time at the start of auto_replenish as now_old, and to the system time at the end of auto_replenish as now_new. With this, we have time_delta = now_start - last_update_old

With this in place, note how at the end of auto_replenish we set last_update to

last_update_new = last_update_old + floor(floor((now_start - last_update_old) * size / refill_time) * refill_time / size)

To prove that last_update does not deviate from now_end = Instant::now() unboundedly, we want to find an upper bound for now - last_update_new. We have

now_end - last_update_new = now_end - last_update_old - floor(floor((now_start - last_update_old) * size / refill_time) * refill_time / size)
<= now_end - last_update_old - floor(((now_start - last_update_old) * size - refill_time) / size)
= now_end - last_update_old - floor(now_start - last_update_old - refill_time / size)
= now_end - now_start - last_update_old + last_update_old + floor(refill_time / size)
= (now_end - now_start) + floor(refill_time / size)

This means that last_updated indeed does not fall behind more than the execution time of auto_replenish plus a constant dependent on the bucket configuration.

JonathanWoollett-Light · 2023-01-18T11:12:26Z

@bchalios @pb8o Could you please re-review need another approval.

Adds proofs that ensure various properties we expect of the ratelimiter are upheld, and that no panics can occur. Partially based on firecracker-microvm#3370 Signed-off-by: Patrick Roy <roypat@amazon.co.uk> Co-authored-by: Felipe R. Monteiro <felisous@amazon.com> Co-authored-by: Daniel Schwartz-Narbonne <dsn@amazon.co.uk>

Adds proofs that ensure various properties we expect of the ratelimiter are upheld, and that no panics can occur. Partially based on #3370 Signed-off-by: Patrick Roy <roypat@amazon.co.uk> Co-authored-by: Felipe R. Monteiro <felisous@amazon.com> Co-authored-by: Daniel Schwartz-Narbonne <dsn@amazon.co.uk>

Adds proofs that ensure various properties we expect of the ratelimiter are upheld, and that no panics can occur. Partially based on firecracker-microvm#3370 Signed-off-by: Patrick Roy <roypat@amazon.co.uk> Co-authored-by: Felipe R. Monteiro <felisous@amazon.com> Co-authored-by: Daniel Schwartz-Narbonne <dsn@amazon.co.uk>

JonathanWoollett-Light self-assigned this Jan 16, 2023

JonathanWoollett-Light added the Type: Bug Indicates an unexpected problem or unintended behavior label Jan 16, 2023

JonathanWoollett-Light force-pushed the auto_replenish branch 2 times, most recently from 4151d8b to daeb618 Compare January 16, 2023 16:49

acatangiu previously approved these changes Jan 16, 2023

View reviewed changes

pb8o previously approved these changes Jan 17, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

JonathanWoollett-Light dismissed stale reviews from pb8o and acatangiu via e1b007a January 17, 2023 09:36

JonathanWoollett-Light force-pushed the auto_replenish branch 3 times, most recently from 40bb276 to 711c86c Compare January 17, 2023 09:39

roypat reviewed Jan 17, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

JonathanWoollett-Light force-pushed the auto_replenish branch 2 times, most recently from 881f7c9 to de2999e Compare January 17, 2023 10:04

bchalios requested changes Jan 17, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

JonathanWoollett-Light force-pushed the auto_replenish branch from 92de456 to 23e3eca Compare January 17, 2023 12:45

acatangiu reviewed Jan 17, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

JonathanWoollett-Light force-pushed the auto_replenish branch 2 times, most recently from a5c0324 to 87575cf Compare January 17, 2023 13:08

bchalios requested changes Jan 17, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

JonathanWoollett-Light force-pushed the auto_replenish branch 2 times, most recently from afeaa6f to 9a996e3 Compare January 17, 2023 15:02

roypat requested changes Jan 17, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Outdated Show resolved Hide resolved

JonathanWoollett-Light force-pushed the auto_replenish branch from 9a996e3 to 02a5ad7 Compare January 17, 2023 16:04

acatangiu previously approved these changes Jan 17, 2023

View reviewed changes

bchalios previously approved these changes Jan 17, 2023

View reviewed changes

JonathanWoollett-Light dismissed stale reviews from bchalios and acatangiu via 95dedd6 January 17, 2023 16:15

JonathanWoollett-Light force-pushed the auto_replenish branch from 02a5ad7 to 95dedd6 Compare January 17, 2023 16:15

JonathanWoollett-Light force-pushed the auto_replenish branch from 95dedd6 to 4d2edfa Compare January 17, 2023 16:28

roypat approved these changes Jan 17, 2023

View reviewed changes

Merge branch 'main' into auto_replenish

50fe13b

bchalios approved these changes Jan 18, 2023

View reviewed changes

roypat merged commit 79ed3c4 into firecracker-microvm:main Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Rate limiter `TokenBucket::auto_replenish()` #3370

fix: Rate limiter `TokenBucket::auto_replenish()` #3370

JonathanWoollett-Light commented Jan 16, 2023 •

edited

acatangiu left a comment

JonathanWoollett-Light commented Jan 17, 2023

roypat left a comment •

edited

roypat left a comment •

edited

JonathanWoollett-Light commented Jan 18, 2023

fix: Rate limiter TokenBucket::auto_replenish() #3370

fix: Rate limiter TokenBucket::auto_replenish() #3370

Conversation

JonathanWoollett-Light commented Jan 16, 2023 • edited

Changes

Reason

License Acceptance

PR Checklist

acatangiu left a comment

Choose a reason for hiding this comment

JonathanWoollett-Light commented Jan 17, 2023

roypat left a comment • edited

Choose a reason for hiding this comment

roypat left a comment • edited

Choose a reason for hiding this comment

JonathanWoollett-Light commented Jan 18, 2023

fix: Rate limiter `TokenBucket::auto_replenish()` #3370

fix: Rate limiter `TokenBucket::auto_replenish()` #3370

JonathanWoollett-Light commented Jan 16, 2023 •

edited

roypat left a comment •

edited

roypat left a comment •

edited