Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
d9a23c6
init
crazycs520 Jun 27, 2020
d9c54aa
update
crazycs520 Jun 27, 2020
a109c61
refine
crazycs520 Jun 29, 2020
68125c3
Merge branch 'master' of https://github.com/pingcap/docs into translate
crazycs520 Jun 29, 2020
0f41e53
add png
crazycs520 Jun 29, 2020
ab9f9af
refine
crazycs520 Jun 29, 2020
7ac0cc7
Merge branch 'master' into translate
yikeke Jun 30, 2020
a42d7ed
Merge branch 'master' into translate
lilin90 Jul 7, 2020
29742fc
address comment
crazycs520 Jul 7, 2020
13a3820
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
48bc37a
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
9d6c8e4
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
1138e05
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
e6668cb
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
f055478
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
9e50508
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
8931c5c
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
06c356a
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
6073d40
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
66e12d3
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
94472e1
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
6465de1
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
b1c0d4d
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
367774f
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
55fdf7b
Update troubleshoot-write-conflicts.md
crazycs520 Jul 8, 2020
ca62fac
add toc
crazycs520 Jul 8, 2020
0cb9812
Merge branch 'translate' of https://github.com/crazycs520/docs into t…
crazycs520 Jul 8, 2020
596fd76
Merge branch 'master' into translate
crazycs520 Jul 8, 2020
656f425
Update TOC.md
lilin90 Jul 8, 2020
e5581dc
Optimize descriptions and fix typo
lilin90 Jul 8, 2020
1332853
Merge branch 'translate' of https://github.com/crazycs520/docs into p…
lilin90 Jul 8, 2020
4ad9bd4
Merge branch 'master' into translate
ti-srebot Jul 9, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions TOC.md
Original file line number Diff line number Diff line change
Expand Up @@ -88,6 +88,7 @@
+ [TiDB Troubleshooting Map](/tidb-troubleshooting-map.md)
+ [Troubleshoot TiCDC](/ticdc/troubleshoot-ticdc.md)
+ [Troubleshoot TiFlash](/tiflash/troubleshoot-tiflash.md)
+ [Troubleshoot Write Conflicts in Optimistic Transactions](/troubleshoot-write-conflicts.md)
+ Performance Tuning
+ System Tuning
+ [Operating System Tuning](/tune-operating-system.md)
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
94 changes: 94 additions & 0 deletions troubleshoot-write-conflicts.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
---
title: Troubleshoot Write Conflicts in Optimistic Transactions
summary: Learn about the reason of and solutions to write conflicts in optimistic transactions.
category: troubleshoot
aliases: ['/docs/dev/troubleshoot-write-conflicts/']
---

# Troubleshoot Write Conflicts in Optimistic Transactions

This document introduces the reason of and solutions to write conflicts in optimistic transactions.

Before TiDB v3.0.8, TiDB uses the optimistic transaction model by default. In this model, TiDB does not check conflicts during transaction execution. Instead, while the transaction is finally committed, the two-phase commit (2PC) is triggered and TiDB checks write conflicts. If a write conflict exists and the auto-retry mechanism is enabled, then TiDB retries the transaction within limited times. If the retry succeeds or has reached the upper limit on retry times, TiDB returns the result of transaction execution to the client. Therefore, if a lot of write conflicts exist in the TiDB cluster, the duration can be longer.

## The reason of write conflicts

TiDB implements its transactions by using the [Percolator](https://www.usenix.org/legacy/event/osdi10/tech/full_papers/Peng.pdf) transaction model. `percolator` is generally an implementation of 2PC. For the detailed 2PC process, see [TiDB Optimistic Transaction Model](/optimistic-transaction.md).

After the client sends a `COMMIT` request to TiDB, TiDB starts the 2PC process:

1. TiDB chooses one key from all keys in the transaction as the primary key of the transaction.
2. TiDB sends the `prewrite` request to all the TiKV Regions involved in this commit. TiKV judges whether all keys can preview successfully.
3. TiDB receives the result that all `prewrite` requests are successful.
4. TiDB gets the `commit_ts` from PD.
5. TiDB sends the `commit` request to the TiKV Region that contains the primary key of the transaction. After TiKV receives the `commit` request, it checks the validity of the data and clears the locks left in the `prewrite` stage.
6. After the `commit` request returns successfully, TiDB returns success to the client.

The write conflict occurs in the `prewrite` stage. When the transaction finds that another transaction is writing the current key (`data.commit_ts` > `txn.start_ts`), a write conflict occurs.

## Detect write conflicts

In the TiDB Grafana panel, check the following monitoring metrics under **KV Errors**:

* **KV Backoff OPS** indicates the count of error messages per second returned by TiKV.

![kv-backoff-ops](/media/troubleshooting-write-conflict-kv-backoff-ops.png)

The `txnlock` metric indicates the write-write conflict. The `txnLockFast` metric indicates the read-write conflict.

* **Lock Resolve OPS** indicates the count of items related to transaction conflicts per second:

![lock-resolve-ops](/media/troubleshooting-write-conflict-lock-resolve-ops.png)

- `not_expired` indicates the TTL of the lock was not expired. The conflict transaction cannot resolve locks until the TTL is expired.
- `wait_expired` indicates that the transaction needs to wait the lock to expire.
- `expired` indicates the TTL of the lock was expired. Then the conflict transaction can resolve this lock.

* **KV Retry Duration** indicates the duration of re-sends the KV request:

![kv-retry-duration](/media/troubleshooting-write-conflict-kv-retry-duration.png)

You can also use `[kv:9007]Write conflict` as the key word to search in the TiDB log. The key word also indicates the write conflict exists in the cluster.

## Resolve write conflicts

If many write conflicts exist in the cluster, it is recommended to find out the write conflict key and the reason, and then try to change the application logic to avoid write conflicts. When the write conflict exists in the cluster, you can see the log similar to the following one in the TiDB log file:

```log
[2020/05/12 15:17:01.568 +08:00] [WARN] [session.go:446] ["commit failed"] [conn=3] ["finished txn"="Txn{state=invalid}"] [error="[kv:9007]Write conflict, txnStartTS=416617006551793665, conflictStartTS=416617018650001409, conflictCommitTS=416617023093080065, key={tableID=47, indexID=1, indexValues={string, }} primary={tableID=47, indexID=1, indexValues={string, }} [try again later]"]
```

The explanation of the log above is as follows:

* `[kv:9007]Write conflict`: indicates the write-write conflict.
* `txnStartTS=416617006551793665`:indicates the `start_ts` of the current transaction. You can use the `pd-ctl` tool to convert `start_ts` to physical time.
* `conflictStartTS=416617018650001409`: indicates the `start_ts` of the write conflict transaction.
* `conflictCommitTS=416617023093080065`: indicates the `commit_ts` of the write conflict transaction.
* `key={tableID=47, indexID=1, indexValues={string, }}`:indicates the write conflict key. `tableID` indicates the ID of the write conflict table. `indexID` indicates the ID of write conflict index. If the write conflict key is a record key, the log prints `handle=x`, indicating which record(row) has a conflict. `indexValues` indicates the value of the index that has a conflict.
* `primary={tableID=47, indexID=1, indexValues={string, }}`: indicates the primary key information of the current transaction.

You can use the `pd-ctl` tool to convert the timestamp to readable time:

{{< copyable "" >}}

```shell
./pd-ctl -u https://127.0.0.1:2379 tso {TIMESTAMP}
```

You can use `tableID` to find the name of the related table:

{{< copyable "" >}}

```shell
curl http://{TiDBIP}:10080/db-table/{tableID}
```

You can use `indexID` and the table name to find the name of the related index:

{{< copyable "sql" >}}

```sql
SELECT * FROM INFORMATION_SCHEMA.TIDB_INDEXES WHERE TABLE_SCHEMA='{table_name}' AND TABLE_NAME='{table_name}' AND INDEX_ID={indexID};
```

In addition, in TiDB v3.0.8 and later versions, the pessimistic transaction becomes the default model. The pessimistic transaction model can avoid write conflicts during the transaction prewrite stage, so you do not need to modify the application any more. In the pessimistic transaction mode, each DML statement writes a pessimistic lock to the related keys during execution. This pessimistic lock can prevent other transactions from modifying the same keys, thus ensuring no write conflicts exist in the `prewrite` stage of the transaction 2PC.