admission: adjust token computation during WAL failover #120135

sumeerbhola · 2024-03-08T15:52:35Z

During WAL failover, possibly caused by a disk stall in the primary location, flushes and compactions can also be stalled. This can cause admission control to compute artificially low token counts for compaction bandwidth out of L0 (if L0 has elevated score), and flush tokens (which are meant to prevent memtable write stalls).

The solution outlined here detects WAL failover by looking at increases in the pebble metric WAL.Failover.SecondaryWriteDuration. If an increase happened in the last 15s interval (the token computation interval), the current flush and compaction bytes are ignored for the purpose of smoothing and therefore ignored for computing tokens. For regular work, the previous smoothed compaction tokens continue to be used, and flush tokens are unlimited. For elastic work, the tokens are reduced to near zero. An alternative is to allow unlimited tokens during the stall, but it runs the risk of over-admitting. We allow this alternative to be configured by changing the cluster setting
admission.wal.failover.unlimited_tokens.enabled to true.

Informs cockroachdb/pebble#3230

Informs CRDB-35401

Epic: none

Release note (ops change): The cluster setting
admission.wal.failover.unlimited_tokens.enabled can be set to true to cause unlimited admission tokens during WAL failover. This should not be changed without consulting admission control team since the default, which preserves the token counts from the preceding non-WAL-failover interval, is expected to be safer.

cockroach-teamcity · 2024-03-08T15:52:43Z

This change is

sumeerbhola

I'll add tests if this looks ok.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @aadityasondhi and @jbowens)

aadityasondhi

Looks good to me so far

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @jbowens)

aadityasondhi

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @jbowens and @sumeerbhola)

pkg/util/admission/io_load_listener.go line 785 at r1 (raw file):

	// once the primary is considered healthy, say within 10s. So a disk stall
	// in the primary that lasts 30s, will cause WAL failover for ~40s, and a
	// disk stall for 1s will cause failover for ~11s. The latter (11s) is short

In the former case, are we worried that an extended failover will cause a backlog of flushes that will then hit L0 all at the same time once we failback?

sumeerbhola

TFTR! I will ping when tests are ready.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @aadityasondhi and @jbowens)

pkg/util/admission/io_load_listener.go line 785 at r1 (raw file):

Previously, aadityasondhi (Aaditya Sondhi) wrote…

In the former case, are we worried that an extended failover will cause a backlog of flushes that will then hit L0 all at the same time once we failback?

If we have a backlog of immutable memtables they are all flushed in one flush, so will add a single sub-level.

During WAL failover, possibly caused by a disk stall in the primary location, flushes and compactions can also be stalled. This can cause admission control to compute artificially low token counts for compaction bandwidth out of L0 (if L0 has elevated score), and flush tokens (which are meant to prevent memtable write stalls). The solution outlined here detects WAL failover by looking at increases in the pebble metric WAL.Failover.SecondaryWriteDuration. If an increase happened in the last 15s interval (the token computation interval), the current flush and compaction bytes are ignored for the purpose of smoothing and therefore ignored for computing tokens. For regular work, the previous smoothed compaction tokens continue to be used, and flush tokens are unlimited. For elastic work, the tokens are reduced to near zero. An alternative is to allow unlimited tokens during the stall, but it runs the risk of over-admitting. We allow this alternative to be configured by changing the cluster setting admission.wal.failover.unlimited_tokens.enabled to true. Informs cockroachdb/pebble#3230 Informs CRDB-35401 Epic: none Release note (ops change): The cluster setting admission.wal.failover.unlimited_tokens.enabled can be set to true to cause unlimited admission tokens during WAL failover. This should not be changed without consulting admission control team since the default, which preserves the token counts from the preceding non-WAL-failover interval, is expected to be safer.

sumeerbhola

test is ready

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @aadityasondhi and @jbowens)

aadityasondhi

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @jbowens)

sumeerbhola · 2024-03-14T18:54:09Z

bors r=aadityasondhi

craig · 2024-03-14T20:08:03Z

Build failed (retrying...):

Bazel Essential CI (Cockroach)

craig · 2024-03-14T20:48:42Z

Build succeeded:

sumeerbhola requested review from jbowens and aadityasondhi March 8, 2024 15:52

sumeerbhola requested a review from a team as a code owner March 8, 2024 15:52

sumeerbhola commented Mar 8, 2024

View reviewed changes

aadityasondhi reviewed Mar 11, 2024

View reviewed changes

sumeerbhola requested a review from aadityasondhi March 12, 2024 14:00

sumeerbhola commented Mar 12, 2024

View reviewed changes

sumeerbhola force-pushed the wal-ac branch from 9f7a1f5 to 15b3f4c Compare March 13, 2024 21:57

sumeerbhola force-pushed the wal-ac branch from 15b3f4c to 0cec031 Compare March 13, 2024 21:58

sumeerbhola commented Mar 13, 2024

View reviewed changes

aadityasondhi approved these changes Mar 14, 2024

View reviewed changes

craig bot merged commit db58abe into cockroachdb:master Mar 14, 2024
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

admission: adjust token computation during WAL failover #120135

admission: adjust token computation during WAL failover #120135

sumeerbhola commented Mar 8, 2024

cockroach-teamcity commented Mar 8, 2024

sumeerbhola left a comment

aadityasondhi left a comment

aadityasondhi left a comment

sumeerbhola left a comment

sumeerbhola left a comment

aadityasondhi left a comment

sumeerbhola commented Mar 14, 2024

craig bot commented Mar 14, 2024

craig bot commented Mar 14, 2024

admission: adjust token computation during WAL failover #120135

admission: adjust token computation during WAL failover #120135

Conversation

sumeerbhola commented Mar 8, 2024

cockroach-teamcity commented Mar 8, 2024

sumeerbhola left a comment

Choose a reason for hiding this comment

aadityasondhi left a comment

Choose a reason for hiding this comment

aadityasondhi left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

aadityasondhi left a comment

Choose a reason for hiding this comment

sumeerbhola commented Mar 14, 2024

craig bot commented Mar 14, 2024

craig bot commented Mar 14, 2024