[DNM] util/quotapool: add LIFO queueing discipline by ajwerner · Pull Request #46655 · cockroachdb/cockroach

ajwerner · 2020-03-26T22:51:46Z

Sometimes you care more about something getting done than things getting done
in order. This change is motivated by range snapshots. We currently have a
timeout when sending a snapshot. In the short term, the easier fix for bad
queueing behavior on the snapshot semaphore is to not wait on it for too long.
Nevertheless, it motivated the typing of this change so I figure I'll post it.

One point of discussion is the behavior when an acquire call is at the front of
the queue and then fails to acquire because there was insufficient quota. Upon
release of later quota, if there had been

Release justification: doesn't have one, will wait to merge it.

Release note: None

cockroach-teamcity · 2020-03-26T22:51:53Z

This change is

ajwerner · 2020-06-17T15:56:14Z

@nvanbenschoten @tbg the LIFO generalization of the quotapool.

ajwerner · 2020-06-17T15:57:33Z

It should be easy enough to replace the notifyQueue with an interface and replace it with more generalized implementations. This work helps along that path by lifting some of the responsibility up.

Sometimes you care more about something getting done than things getting done in order. This change is motivated by range snapshots. We currently have a timeout when sending a snapshot. In the short term, the easier fix for bad queueing behavior on the snapshot semaphore is to not wait on it for too long. Nevertheless, it motivated the typing of this change so I figure I'll post it. One point of discussion is the behavior when an acquire call is at the front of the queue and then fails to acquire because there was insufficient quota. Upon release of later quota, if there had been Release justification: doesn't have one, will wait to merge it. Release note: None

Alternative to cockroachdb#46655. This commit introduces a new cluster setting called `kv.snapshot_receiver.queue_timeout_fraction` which dictates the fraction of a snapshot's total timeout that it is allowed to spend queued on the receiver waiting for a reservation. Enforcement of this snapshotApplySem-scoped timeout is intended to prevent starvation of snapshots in cases where a queue of snapshots waiting for reservations builds and no single snapshot acquires the semaphore with sufficient time to complete, but each holds the semaphore long enough to ensure that later snapshots in the queue encounter this same situation. This is a case of FIFO queuing + timeouts leading to starvation. By rejecting snapshot attempts earlier, we ensure that those that do acquire the semaphore have sufficient time to complete. The commit adds a new test called `TestReserveSnapshotQueueTimeoutAvoidsStarvation` which reproduces this starvation without the fix. With the fix, the test passes and goodput never collapses to 0. This is an alternative to strict LIFO queueing (cockroachdb#46655) and an alternative to Adaptive LIFO queueing (https://queue.acm.org/detail.cfm?id=2839461). The former avoids starvation but at the expense of fairness even under low but steady concurrency. The later avoids compromising on fairness until it switches from FIFO to LIFO, but is fairly complex. The approach taken in this PR is a compromise that does not trade fairness under low concurrency and is still relatively simple, but does retain some risk of starvation in the case where `totalTimeout - queueTimeout < processingTime`. The default settings ensure that `processingTime` needs to be at least `30s` (assuming `kv.queue.process.guaranteed_time_budget` is used) before this will become a problem in practice. Release notes (bug fix): Raft snapshots no longer risk starvation under very high concurrency. Before this fix, it was possible that a thundering herd of Raft snapshots could be starved and prevented from succeeding due to timeouts, which were accompanied by errors like `error rate limiting bulk io write: context deadline exceeded`.

73288: kv: apply limited timeout to snapshots waiting in reservation queue r=tbg,erikgrinaker a=nvanbenschoten Alternative to #46655. This commit introduces a new cluster setting called `kv.snapshot_receiver.queue_timeout_fraction` which dictates the fraction of a snapshot's total timeout that it is allowed to spend queued on the receiver waiting for a reservation. Enforcement of this snapshotApplySem-scoped timeout is intended to prevent starvation of snapshots in cases where a queue of snapshots waiting for reservations builds and no single snapshot acquires the semaphore with sufficient time to complete, but each holds the semaphore long enough to ensure that later snapshots in the queue encounter this same situation. This is a case of FIFO queuing + timeouts leading to starvation. By rejecting snapshot attempts earlier, we ensure that those that do acquire the semaphore have sufficient time to complete. The commit adds a new test called `TestReserveSnapshotQueueTimeoutAvoidsStarvation` which reproduces this starvation without the fix. With the fix, the test passes and goodput never collapses to 0. This is an alternative to strict LIFO queueing (#46655) and an alternative to Adaptive LIFO queueing (https://queue.acm.org/detail.cfm?id=2839461). The former avoids starvation but at the expense of fairness even under low but steady concurrency. The latter avoids compromising on fairness until it switches from FIFO to LIFO, but is fairly complex. The approach taken in this PR is a compromise that does not trade fairness under low concurrency and is still relatively simple, but does retain some risk of starvation in the case where `totalTimeout - queueTimeout < processingTime`. The default settings ensure that `processingTime` needs to be at least `30s` (assuming `kv.queue.process.guaranteed_time_budget` is used) before this will become a problem in practice. Release notes (bug fix): Raft snapshots no longer risk starvation under very high concurrency. Before this fix, it was possible that a thundering herd of Raft snapshots could be starved and prevented from succeeding due to timeouts, which were accompanied by errors like `error rate limiting bulk io write: context deadline exceeded`. Co-authored-by: Nathan VanBenschoten <nvanbenschoten@gmail.com>

Alternative to cockroachdb#46655. This commit introduces a new cluster setting called `kv.snapshot_receiver.queue_timeout_fraction` which dictates the fraction of a snapshot's total timeout that it is allowed to spend queued on the receiver waiting for a reservation. Enforcement of this snapshotApplySem-scoped timeout is intended to prevent starvation of snapshots in cases where a queue of snapshots waiting for reservations builds and no single snapshot acquires the semaphore with sufficient time to complete, but each holds the semaphore long enough to ensure that later snapshots in the queue encounter this same situation. This is a case of FIFO queuing + timeouts leading to starvation. By rejecting snapshot attempts earlier, we ensure that those that do acquire the semaphore have sufficient time to complete. The commit adds a new test called `TestReserveSnapshotQueueTimeoutAvoidsStarvation` which reproduces this starvation without the fix. With the fix, the test passes and goodput never collapses to 0. This is an alternative to strict LIFO queueing (cockroachdb#46655) and an alternative to Adaptive LIFO queueing (https://queue.acm.org/detail.cfm?id=2839461). The former avoids starvation but at the expense of fairness even under low but steady concurrency. The later avoids compromising on fairness until it switches from FIFO to LIFO, but is fairly complex. The approach taken in this PR is a compromise that does not trade fairness under low concurrency and is still relatively simple, but does retain some risk of starvation in the case where `totalTimeout - queueTimeout < processingTime`. The default settings ensure that `processingTime` needs to be at least `30s` (assuming `kv.queue.process.guaranteed_time_budget` is used) before this will become a problem in practice. Release notes (bug fix): Raft snapshots no longer risk starvation under very high concurrency. Before this fix, it was possible that a thundering herd of Raft snapshots could be starved and prevented from succeeding due to timeouts, which were accompanied by errors like `error rate limiting bulk io write: context deadline exceeded`.

ajwerner force-pushed the ajwerner/lifo-quotapool-queueing branch from 7a02b3a to 3a5ae6d Compare June 17, 2020 15:55

ajwerner force-pushed the ajwerner/lifo-quotapool-queueing branch from 3a5ae6d to df5d7dd Compare October 20, 2020 03:11

ajwerner mentioned this pull request Oct 20, 2020

[DNM] sql,quotapool: very basic rate limiting #55728

Closed

tbg added the X-noremind Bots won't notify about PRs with X-noremind label May 6, 2021

nvb mentioned this pull request Nov 30, 2021

kv: apply limited timeout to snapshots waiting in reservation queue #73288

Merged

nvb mentioned this pull request May 16, 2022

release-21.2: kv: apply limited timeout to snapshots waiting in reservation queue #81335

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DNM] util/quotapool: add LIFO queueing discipline#46655

[DNM] util/quotapool: add LIFO queueing discipline#46655
ajwerner wants to merge 1 commit intocockroachdb:masterfrom
ajwerner:ajwerner/lifo-quotapool-queueing

ajwerner commented Mar 26, 2020

Uh oh!

cockroach-teamcity commented Mar 26, 2020

Uh oh!

ajwerner commented Jun 17, 2020

Uh oh!

ajwerner commented Jun 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ajwerner commented Mar 26, 2020

Uh oh!

cockroach-teamcity commented Mar 26, 2020

Uh oh!

ajwerner commented Jun 17, 2020

Uh oh!

ajwerner commented Jun 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants