txn: only wake up waiters when locks are indeed released #7379

youjiali1995 · 2020-04-07T13:19:25Z

Signed-off-by: youjiali1995 zlwgx1023@gmail.com

What problem does this PR solve?

Problem Summary:
TiKV will wake up waiters as long as it receives requests that may release locks, e.g., pessimistic_rollback, rollback, commit. If a request doesn't release locks, typically the lock doesn't exist, it needn't wake up waiters.

In TiDB, if a pessimistic DML meets write conflict, it will use pessimistic_rollback to clean up all locks it needs to lock in this DML and then retry the DML. If a transaction is waked up and there are other transactions waiting for the lock, these transactions will be waked up by pessimistic_rollback one by one. It dramatically affects performance and results in useless retry.

What is changed and how it works?

What's Changed:
Only wake up waiters when locks are indeed released and small refactor.

Related changes

Need to cherry-pick to the release branch

Check List

Tests

Manual test (add detailed scripts or steps below)
No code

I think existing tests are enough and I benched it using sysbench with the workload below:

con:query(CREATE TABLE wmtest(k INT PRIMARY KEY, v INT))
con:query(string.format("UPDATE wmtest SET v = v + 1 WHERE k IN (%d, 1)", sysbench.rand.uniform(2, sysbench.opt.table_size)))

master:

threads	tps	avg lat(ms)	.95 lat(ms)	.99 retry count
1	783	1.28	1.79	0
100	71	1393	2493	800
500	16	30111	72316	1457
1000	12	71181	100000	654

This PR:

threads	tps	avg lat(ms)	.95 lat(ms)	.99 retry count
100	772	129	155	1
500	724	688	787	1
1000	647	1534	1903	5

Release note

Fix the issue that needless wake-up results in useless retry and performance reduction in heavy contention workloads.

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 · 2020-04-07T13:19:49Z

/release

sre-bot · 2020-04-07T13:39:28Z

download tikv at http://fileserver.pingcap.net/download/builds/pingcap/tikv/pr/f0940bded80643cd53de49b5ff30fee7e6397e10/centos7/tikv-server.tar.gz

sticnarf

LGTM except a small question.

src/storage/txn/process.rs

youjiali1995 · 2020-04-09T02:25:14Z

If the DML only need to lock one key, it won't send pessimistic_rollback before retry. That's why I didn't find the bug earlier:
https://github.com/pingcap/tidb/blob/b54ac5b2ec3bd1f5de37eb813d17de43cc500bf3/store/tikv/txn.go#L397

nrc

Looks good, a few comments inline, none are major

src/storage/mvcc/txn.rs

nrc · 2020-04-12T22:54:02Z

src/storage/txn/process.rs

+        }
+    }
+
+    fn push(&mut self, lock: ReleasedLock) {


You could implement a from_iter method to build the hashes Vec in one go, rather than using for loops and push

Sorry, I don't get it. I still need to push hash to a vec to get the iter..

So for example, rather than use

for k in keys { released_locks.push(txn.rollback(k)?); }

you can use

released_locks.hashes(keys.iter().map(|k| txn.rollback(k)))?;

where hashes is something like:

fn hashes<I, T, E>(&mut self, iter: I) -> Result<(), E> where I: Iterator<Item = Result<T, E>> { self.hashes = iter.collect()?; Ok(()) }

I see. Thanks!

I tried to implement it but I found it's more complex than for loop:

impl ReleasedLocks { fn hashes<I, E>(&mut self, iter: I) -> std::result::Result<(), E> where I: Iterator<Item = std::result::Result<Option<ReleasedLock>, E>>, { self.hashes = iter .filter_map(|v| match v { Ok(Some(lock)) => { if !self.pessimistic { self.pessimistic = lock.pessimistic; } Some(Ok(lock.hash)) } Ok(None) => None, Err(e) => Some(Err(e)), }) .collect::<std::result::Result<Vec<u64>, E>>()?; Ok(()) } } fn process_write_impl(...) { ... let mut released_locks = ReleasedLocks::new(lock_ts, commit_ts); // for k in keys { // released_locks.push(txn.commit(k, commit_ts)?); // } released_locks.hashes(keys.into_iter().map(|k| txn.commit(k, commit_ts)))?; released_locks.wake_up(lock_mgr.as_ref()); ... }

And I still need to call push when processing ResolveLock..

@nrc PTAL again. Thanks!

src/storage/txn/process.rs

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 · 2020-04-14T07:55:14Z

/run-all-tests

youjiali1995 · 2020-04-14T07:55:23Z

/bench +tpcc

sre-bot · 2020-04-14T11:26:17Z

Benchmark Report

Run TPC-C Performance Test on VMs

@@                               Benchmark Diff                               @@
================================================================================
tidb: c1a31708b0a3eb8e75adbc5cf75c86926fcf4d1b
--- tikv: 2b1b9b2cd537e47591795dc034b59a0585cfe7a8
+++ tikv: 1641dbb41273c50b7a0b630295d65fc9e722076e
pd: 8438f3fc004da1bff7442229e53fe4272f74ce2d
================================================================================
Measured tpmC (NewOrders): 8296.83 ± 1.98% (std=106.77), delta: -2.84% (p=0.036)

youjiali1995 · 2020-04-14T12:16:26Z

/bench +tpcc

…p-when-lock-not-exist Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

sre-bot · 2020-04-14T13:39:57Z

Benchmark Report

Run TPC-C Performance Test on VMs

@@                               Benchmark Diff                               @@
================================================================================
tidb: c1a31708b0a3eb8e75adbc5cf75c86926fcf4d1b
--- tikv: 7935019849d46b7d32b1a6b0d14e795cd7da1591
+++ tikv: 1641dbb41273c50b7a0b630295d65fc9e722076e
pd: 8438f3fc004da1bff7442229e53fe4272f74ce2d
================================================================================
Measured tpmC (NewOrders): 7007.72 ± 1.01% (std=66.19), delta: -1.77% (p=0.072)

nrc

LGTM, thanks for the changes!

youjiali1995 · 2020-04-17T03:36:03Z

@MyonKeminta PTAL

sre-bot · 2020-04-20T03:41:57Z

/run-all-tests

Signed-off-by: sre-bot <sre-bot@pingcap.com>

sre-bot · 2020-04-20T04:15:53Z

cherry pick to release-3.0 in PR #7549

Signed-off-by: sre-bot <sre-bot@pingcap.com>

sre-bot · 2020-04-20T04:17:02Z

cherry pick to release-3.1 in PR #7550

Signed-off-by: sre-bot <sre-bot@pingcap.com>

sre-bot · 2020-04-20T04:18:37Z

cherry pick to release-4.0 in PR #7551

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

txn: only wake up waiters when locks are indeed released (tikv#7379) (tikv#7585) Signed-off-by: youjiali1995 <zlwgx1023@gmail.com> txn: don't protect rollback for BatchRollback (tikv#7605) (tikv#7608) Signed-off-by: youjiali1995 <zlwgx1023@gmail.com> tidb_query: add is true/false keep null ScalarFuncSig (tikv#7532) (tikv#7566) Signed-off-by: zhongzc <zhongzc_arch@outlook.com> tidb_query: fix the logical behavior of floats (tikv#7342) (tikv#7582) Signed-off-by: zhongzc <zhongzc_arch@outlook.com> tidb_query: fix converting bytes to bool (tikv#7486) (tikv#7547) Signed-off-by: zhongzc <zhongzc_arch@outlook.com> raftstore: change the condition of proposing rollback merge (tikv#6584) (tikv#7762) Signed-off-by: Liqi Geng <gengliqiii@gmail.com> Signed-off-by: Tong Zhigao <tongzhigao@pingcap.com>

txn: only wake up waiters when locks are indeed released

f0940bd

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added this to the v3.0.13 milestone Apr 8, 2020

youjiali1995 marked this pull request as ready for review April 8, 2020 11:27

youjiali1995 requested review from nrc and sticnarf and removed request for nrc April 8, 2020 11:28

sticnarf approved these changes Apr 8, 2020

View reviewed changes

src/storage/txn/process.rs Outdated Show resolved Hide resolved

nrc reviewed Apr 12, 2020

View reviewed changes

youjiali1995 added 2 commits April 13, 2020 21:44

address comments

c9a4a2d

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

always calculate hash of released lock

d155450

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 force-pushed the not-wake-up-when-lock-not-exist branch from 2095388 to 1641dbb Compare April 14, 2020 07:54

Merge branch 'master' of https://github.com/tikv/tikv into not-wake-u…

58feb08

…p-when-lock-not-exist Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 force-pushed the not-wake-up-when-lock-not-exist branch from 1641dbb to 58feb08 Compare April 14, 2020 12:18

youjiali1995 requested a review from MyonKeminta April 16, 2020 07:24

nrc approved these changes Apr 17, 2020

View reviewed changes

sre-bot merged commit b4dd42f into tikv:master Apr 20, 2020

sre-bot pushed a commit to sre-bot/tikv that referenced this pull request Apr 20, 2020

cherry pick tikv#7379 to release-3.0

545c913

Signed-off-by: sre-bot <sre-bot@pingcap.com>

sre-bot mentioned this pull request Apr 20, 2020

txn: only wake up waiters when locks are indeed released (#7379) #7549

Closed

sre-bot pushed a commit to sre-bot/tikv that referenced this pull request Apr 20, 2020

cherry pick tikv#7379 to release-3.1

9a16fe1

Signed-off-by: sre-bot <sre-bot@pingcap.com>

sre-bot mentioned this pull request Apr 20, 2020

txn: only wake up waiters when locks are indeed released (#7379) #7550

Closed

sre-bot pushed a commit to sre-bot/tikv that referenced this pull request Apr 20, 2020

cherry pick tikv#7379 to release-4.0

76e176f

Signed-off-by: sre-bot <sre-bot@pingcap.com>

sre-bot mentioned this pull request Apr 20, 2020

txn: only wake up waiters when locks are indeed released (#7379) #7551

Merged

youjiali1995 deleted the not-wake-up-when-lock-not-exist branch April 20, 2020 06:42

youjiali1995 added a commit that referenced this pull request Apr 21, 2020

cherry pick #7379 to release-4.0

49af8f7

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added a commit to sre-bot/tikv that referenced this pull request Apr 21, 2020

cherry pick tikv#7379 to release-4.0

3f63589

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added a commit to youjiali1995/tikv that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

2c021d2

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 mentioned this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (#7379) #7584

Merged

youjiali1995 added a commit to youjiali1995/tikv that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

fe00145

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added a commit to youjiali1995/tikv that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

b41c2eb

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added a commit to youjiali1995/tikv that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

5db498b

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 mentioned this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (#7379) #7585

Merged

youjiali1995 added a commit to youjiali1995/tikv that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

9688559

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added a commit to youjiali1995/tikv that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

29750ab

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 added a commit to sre-bot/tikv that referenced this pull request Apr 21, 2020

cherry pick tikv#7379 to release-4.0

fdd4941

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 mentioned this pull request Apr 21, 2020

storage: fix wrong write size of resolve lock #7590

Merged

sre-bot pushed a commit that referenced this pull request Apr 21, 2020

txn: only wake up waiters when locks are indeed released (#7379) (#7584)

cd3ba29

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

sre-bot added a commit that referenced this pull request Apr 22, 2020

txn: only wake up waiters when locks are indeed released (#7379) (#7551)

d4398c1

Signed-off-by: sre-bot <sre-bot@pingcap.com> Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

youjiali1995 mentioned this pull request Apr 27, 2020

storage: add tests for lock manager #7663

Merged

youjiali1995 added a commit that referenced this pull request Apr 27, 2020

txn: only wake up waiters when locks are indeed released (#7379) (#7585)

7b4344d

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

c1ay pushed a commit to c1ay/tikv that referenced this pull request May 9, 2020

txn: only wake up waiters when locks are indeed released (tikv#7379)

f9f1a14

Signed-off-by: youjiali1995 <zlwgx1023@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

txn: only wake up waiters when locks are indeed released #7379

txn: only wake up waiters when locks are indeed released #7379

youjiali1995 commented Apr 7, 2020 •

edited

youjiali1995 commented Apr 7, 2020

sre-bot commented Apr 7, 2020

sticnarf left a comment

youjiali1995 commented Apr 9, 2020

nrc left a comment

nrc Apr 12, 2020

youjiali1995 Apr 13, 2020 •

edited

nrc Apr 14, 2020

youjiali1995 Apr 14, 2020

youjiali1995 Apr 14, 2020

youjiali1995 Apr 14, 2020 •

edited

youjiali1995 Apr 16, 2020

youjiali1995 commented Apr 14, 2020

youjiali1995 commented Apr 14, 2020

sre-bot commented Apr 14, 2020

youjiali1995 commented Apr 14, 2020

sre-bot commented Apr 14, 2020

nrc left a comment

youjiali1995 commented Apr 17, 2020

sre-bot commented Apr 20, 2020

sre-bot commented Apr 20, 2020

sre-bot commented Apr 20, 2020

sre-bot commented Apr 20, 2020

txn: only wake up waiters when locks are indeed released #7379

txn: only wake up waiters when locks are indeed released #7379

Conversation

youjiali1995 commented Apr 7, 2020 • edited

What problem does this PR solve?

What is changed and how it works?

Related changes

Check List

Release note

youjiali1995 commented Apr 7, 2020

sre-bot commented Apr 7, 2020

sticnarf left a comment

Choose a reason for hiding this comment

youjiali1995 commented Apr 9, 2020

nrc left a comment

Choose a reason for hiding this comment

nrc Apr 12, 2020

Choose a reason for hiding this comment

youjiali1995 Apr 13, 2020 • edited

Choose a reason for hiding this comment

nrc Apr 14, 2020

Choose a reason for hiding this comment

youjiali1995 Apr 14, 2020

Choose a reason for hiding this comment

youjiali1995 Apr 14, 2020

Choose a reason for hiding this comment

youjiali1995 Apr 14, 2020 • edited

Choose a reason for hiding this comment

youjiali1995 Apr 16, 2020

Choose a reason for hiding this comment

youjiali1995 commented Apr 14, 2020

youjiali1995 commented Apr 14, 2020

sre-bot commented Apr 14, 2020

Benchmark Report

youjiali1995 commented Apr 14, 2020

sre-bot commented Apr 14, 2020

Benchmark Report

nrc left a comment

Choose a reason for hiding this comment

youjiali1995 commented Apr 17, 2020

sre-bot commented Apr 20, 2020

sre-bot commented Apr 20, 2020

sre-bot commented Apr 20, 2020

sre-bot commented Apr 20, 2020

youjiali1995 commented Apr 7, 2020 •

edited

youjiali1995 Apr 13, 2020 •

edited

youjiali1995 Apr 14, 2020 •

edited