Correctly handle some extreme edge cases in the ratelimiter implementation #3706

roypat · 2023-05-19T08:43:22Z

Changes

This series of commits fixes some edge cases in the ratelimiter implementation related to insanely large timespans (on the magnitude of hundreds of thousands of years)

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license. For more information on following
Developer Certificate of Origin and signing off your commits, please check
CONTRIBUTING.md.

PR Checklist

If a specific issue led to this PR, this PR closes the issue.
The description of changes is clear and encompassing.
Any required documentation changes (code and docs) are included in this PR.
API changes follow the Runbook for Firecracker API changes.
User-facing changes are mentioned in CHANGELOG.md.
All added/changed functionality is tested.
New TODOs link to an issue.
Commits meet contribution quality standards.

This functionality cannot be added in rust-vmm.

This is due to the constructor attempting to convert from milliseconds to nanoseconds without checking whether the nanosecond value fits into a u64. Signed-off-by: Patrick Roy <roypat@amazon.co.uk>

Due to rounding down instead of up when computing the time that it would take to generate "x" tokens, the fractional part of a nanosecond is used "twice" once per call to auto_replenish. This can only happen if the time since the last auto_replenish call is enough to generate at least one token, e.g. (refill_time / bucket_size) milliseconds. To accumulate enough "double used" time to generate an extra token, at least (refill_time * 1_000_000 / bucket_size) calls to auto_replenish are needed, with the calls timed in a way that causes the "double used" time to be as close as possible to 1ns. Note that since time resolution is at the nanosecond level, at least one nanosecond has to pass between these calls. For example, if a TokenBucket is setup for 1GB/s, we could allow one extra byte of traffic every (1000 / 1_000_000_000) * 1_000_000 = 1ms, e.g. excess throughput of 1KB/s (or 0.0001% above the target) Additionally, in extremely unrealistic scenarios (either refill_time of thousands of years or the guest waiting thousands of years), an integer overflow in a multiplication in auto_replenish can leads to less tokens being replenished than should be. Signed-off-by: Patrick Roy <roypat@amazon.co.uk>

Due to a missing clamping overation, it used to be able to increment one_time_burst beyond initial_one_time_burst. Additionally, integer overflow during addition could cause the budget/one_time_burst to actually decrease if they were close to u64::MAX priot to calling force_replenish (a very unrealistic scenario) Signed-off-by: Patrick Roy <roypat@amazon.co.uk>

bchalios

LGTM, just one comment. Is it possible to add unit-tests for these extreme cases?

bchalios · 2023-05-22T10:24:44Z

LGTM, just one comment. Is it possible to add unit-tests for these extreme cases?

Spoke offline with @roypat. We will be adding tests for these cases with subsequent contributions.

mattschlebusch

No blockers, but to echo Babis query on unit-testing, they are a must here as the edge-cases could easily be overlooked.

src/rate_limiter/src/lib.rs

roypat added 3 commits May 19, 2023 09:40

fix integer overflow in TokenBucket::new for excessive refill time

2a63515

This is due to the constructor attempting to convert from milliseconds to nanoseconds without checking whether the nanosecond value fits into a u64. Signed-off-by: Patrick Roy <roypat@amazon.co.uk>

roypat added the Status: Awaiting review Indicates that a pull request is ready to be reviewed label May 19, 2023

bchalios reviewed May 22, 2023

View reviewed changes

bchalios approved these changes May 22, 2023

View reviewed changes

Merge branch 'main' into ratelimiter-edgecases

5af0ac9

mattschlebusch approved these changes May 24, 2023

View reviewed changes

src/rate_limiter/src/lib.rs Show resolved Hide resolved

src/rate_limiter/src/lib.rs Show resolved Hide resolved

src/rate_limiter/src/lib.rs Show resolved Hide resolved

Merge branch 'main' into ratelimiter-edgecases

6da715c

roypat merged commit 2bd40cd into firecracker-microvm:main May 25, 2023
4 checks passed

kalyazin mentioned this pull request Jun 2, 2023

1.4: Correctly handle some extreme edge cases in the ratelimiter implementation #3706 #3781

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly handle some extreme edge cases in the ratelimiter implementation #3706

Correctly handle some extreme edge cases in the ratelimiter implementation #3706

roypat commented May 19, 2023

bchalios left a comment

bchalios commented May 22, 2023

mattschlebusch left a comment

Correctly handle some extreme edge cases in the ratelimiter implementation #3706

Correctly handle some extreme edge cases in the ratelimiter implementation #3706

Conversation

roypat commented May 19, 2023

Changes

License Acceptance

PR Checklist

bchalios left a comment

Choose a reason for hiding this comment

bchalios commented May 22, 2023

mattschlebusch left a comment

Choose a reason for hiding this comment