sink to mysql (cdc) workload skew issue #10341

zhangjinpeng87 · 2023-12-21T17:36:02Z

Workload Skew Problem

Sink to MySQL use the conflict detector to detect potential conflicts between transactions, and build the dependency graph. When a transaction depend on other transactions, this transaction must execute after these dependent transactions. The logic is simple and straight forward. But the conflict detector introduced a fast dependencies resolving optimization: for example transaction B only depend on transaction A, and if transaction A has sent to a special worker, the conflict detector will resolve the dependency and then send B to the same worker where transaction A is, even before transaction A actually executed. This is because each worker execute received transaction one by one in order. So this optimization can guarantee the transaction B must execute after transaction A. This optimization can resolve dependency early and send transaction to worker early when a transaction just depend on another single transaction.

But this early resolve dependency optimization also can result in workload skew issue (one worker handle most transactions) in some cases, and result in the sink to MySQL has a big lag because of its throughput can't catch up with upstream QPS. For example we have A, B, C, D 4 transactions, they come into the conflict detector in order. Transaction A has no dependency on other transactions, and it has been sent to worker 1. Transaction B is a big transaction contains multiple row changes, it has dependency on transaction A because of key 1, because transaction B just depend on transaction A, so according to the above early dependency resolving optimization, transaction B should be send to worker 1 ASAP. And then transaction C contains one Key 555 and has a dependency on transaction B because of key 555, transaction D contains 2 keys [1000, 1001] and has a dependency on transaction B because of key 1000. Both transaction C and transaction D can apply the early dependency resolving optimization. The consequence is transaction A, B, C, D, all of them will be routed to the same worker 1. From the above example we can see that, the early dependency resolving optimization has a problem that will "attract" more and more transactions to a single worker. Actually after Transaction B executed, transaction C and transaction D can be run parallel in different workers.

Txn C(row555)                                         Txn D(row1000, row1001)
         |                                                       |
         |                                                       |
         ------------> Txn B(row1, row2, ..., row1000) <-------
                                          |
                                          |
                                          -------------> Txn A(row1)    worker1

New Proposal

The motivation of early dependency resolving optimization is resolve the transaction dependency as early as possible, but resolve the dependency and send the transaction to a random worker after depended transactions executed is a better choice in terms of stability and predictable throughput.

The text was updated successfully, but these errors were encountered:

asddongmen · 2023-12-28T13:19:10Z

Even if the early conflict resolving mechanism is removed, the workload skew issue may still exist in some rare cases.
Let's consider two tables, t1 and t2, being replicated in a CDC node. If t1 has a lot of transactions that sequentially depend on one another, such as:

Txn N(1, 2,3,4,5, … , n) → … → Txn 3(1,2,3) -> Txn 2(1,2) -> Txn 1(1)

All the transactions that come later still need to wait for their prior dependee transactions to be executed.

This can cause the memory quota of the processor to be consumed by a single table, leading to the starving of other tables and ultimately causing the workload skew issue.

The root causes of this problem are as follows:

The memory quota distribution algorithm works in a "optimistic" way, as it does not track the consumed memory of a table and always allocates quota to the applicant.
The sinkManager prioritizes the slowest table to pull data from the sorter to the sink, exacerbating the situation.

zhangjinpeng87 · 2023-12-29T19:41:25Z

Even if the early conflict resolving mechanism is removed, the workload skew issue may still exist in some rare cases. Let's consider two tables, t1 and t2, being replicated in a CDC node. If t1 has a lot of transactions that sequentially depend on one another, such as:
Txn N(1, 2,3,4,5, … , n) → … → Txn 3(1,2,3) -> Txn 2(1,2) -> Txn 1(1)
All the transactions that come later still need to wait for their prior dependee transactions to be executed.

This can cause the memory quota of the processor to be consumed by a single table, leading to the starving of other tables and ultimately causing the workload skew issue.

The root causes of this problem are as follows:

The memory quota distribution algorithm works in a "optimistic" way, as it does not track the consumed memory of a table and always allocates quota to the applicant.

The sinkManager prioritizes the slowest table to pull data from the sorter to the sink, exacerbating the situation.

@asddongmen In the case you described, all transactions of t1 are executed sequentially in both upstream TiDB and TiCDC since these transactions have dependencies one by one like a chain. These transactions can be routed to different workers after this PR, I don't think it can cause skew issue, it is slow CPU usage issue need other thorough solutions.

…issue (#10376) close #10341

flowbehappy · 2024-03-14T06:05:16Z

We found the same issue on some users' environment with older versions. So will pick this enhancement back to older versions.

…issue (pingcap#10376) close pingcap#10341

…issue (#10376) (#10780) close #10341

…issue (#10376) (#10779) close #10341

…issue (#10376) (#10778) close #10341

ref #10341

close #10341

zhangjinpeng87 self-assigned this Dec 21, 2023

zhangjinpeng87 added the type/enhancement This is a enhancement PR label Dec 21, 2023

This was referenced Dec 21, 2023

Observability (cdc) improve observability of changfeeds #10342

Open

stability (cdc) improve the stability of TiCDC #10343

Open

sink-to-mysql(cdc) simplify conflict detector, prevent workload skew issue #10376

Merged

ti-chi-bot bot closed this as completed in #10376 Jan 2, 2024

ti-chi-bot bot pushed a commit that referenced this issue Jan 2, 2024

sink-to-mysql(cdc) simplify conflict detector, prevent workload skew …

9e0cacf

…issue (#10376) close #10341

flowbehappy added affects-6.5 affects-7.1 affects-7.5 labels Mar 14, 2024

CharlesCheung96 pushed a commit to ti-chi-bot/tiflow that referenced this issue Mar 20, 2024

sink-to-mysql(cdc) simplify conflict detector, prevent workload skew …

8f4e56a

…issue (pingcap#10376) close pingcap#10341

ti-chi-bot bot pushed a commit that referenced this issue Mar 20, 2024

sink-to-mysql(cdc) simplify conflict detector, prevent workload skew …

2631409

…issue (#10376) (#10780) close #10341

jebter added the area/ticdc Issues or PRs related to TiCDC. label Mar 28, 2024

ti-chi-bot bot pushed a commit that referenced this issue Mar 31, 2024

sink-to-mysql(cdc) simplify conflict detector, prevent workload skew …

c43300c

…issue (#10376) (#10779) close #10341

ti-chi-bot bot pushed a commit that referenced this issue Mar 31, 2024

sink-to-mysql(cdc) simplify conflict detector, prevent workload skew …

2c55c0e

…issue (#10376) (#10778) close #10341

CharlesCheung96 mentioned this issue Apr 7, 2024

*(ticdc): Revert pr 10780 for release-6.5 #10882

Merged

ti-chi-bot bot pushed a commit that referenced this issue Apr 7, 2024

*(ticdc): Revert pr 10780 for release-6.5 (#10882)

986ebf7

ref #10341

This was referenced Apr 9, 2024

sink(ticdc): limit the number of transactions cached in a mysql worker #10892

Merged

limit the maximum number of cached txns in mysql worker #10896

Closed

This was referenced Apr 18, 2024

sink(ticdc): Revert changes related to the conflict detector #10923

Merged

sink(ticdc): Revert changes related to the conflict detector #10924

Merged

ti-chi-bot bot pushed a commit that referenced this issue Apr 19, 2024

sink(ticdc): Revert changes related to the conflict detector (#10923)

1b84fc6

close #10341

ti-chi-bot bot pushed a commit that referenced this issue Apr 19, 2024

sink(ticdc): Revert changes related to the conflict detector (#10924)

823a389

close #10341

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sink to mysql (cdc) workload skew issue #10341

sink to mysql (cdc) workload skew issue #10341

zhangjinpeng87 commented Dec 21, 2023 •

edited

asddongmen commented Dec 28, 2023 •

edited

zhangjinpeng87 commented Dec 29, 2023

flowbehappy commented Mar 14, 2024

sink to mysql (cdc) workload skew issue #10341

sink to mysql (cdc) workload skew issue #10341

Comments

zhangjinpeng87 commented Dec 21, 2023 • edited

Workload Skew Problem

New Proposal

asddongmen commented Dec 28, 2023 • edited

zhangjinpeng87 commented Dec 29, 2023

flowbehappy commented Mar 14, 2024

zhangjinpeng87 commented Dec 21, 2023 •

edited

asddongmen commented Dec 28, 2023 •

edited