-
Notifications
You must be signed in to change notification settings - Fork 709
add troubleshooting-write-conflict #3064
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
d9a23c6
init
crazycs520 d9c54aa
update
crazycs520 a109c61
refine
crazycs520 68125c3
Merge branch 'master' of https://github.com/pingcap/docs into translate
crazycs520 0f41e53
add png
crazycs520 ab9f9af
refine
crazycs520 7ac0cc7
Merge branch 'master' into translate
yikeke a42d7ed
Merge branch 'master' into translate
lilin90 29742fc
address comment
crazycs520 13a3820
Update troubleshoot-write-conflicts.md
crazycs520 48bc37a
Update troubleshoot-write-conflicts.md
crazycs520 9d6c8e4
Update troubleshoot-write-conflicts.md
crazycs520 1138e05
Update troubleshoot-write-conflicts.md
crazycs520 e6668cb
Update troubleshoot-write-conflicts.md
crazycs520 f055478
Update troubleshoot-write-conflicts.md
crazycs520 9e50508
Update troubleshoot-write-conflicts.md
crazycs520 8931c5c
Update troubleshoot-write-conflicts.md
crazycs520 06c356a
Update troubleshoot-write-conflicts.md
crazycs520 6073d40
Update troubleshoot-write-conflicts.md
crazycs520 66e12d3
Update troubleshoot-write-conflicts.md
crazycs520 94472e1
Update troubleshoot-write-conflicts.md
crazycs520 6465de1
Update troubleshoot-write-conflicts.md
crazycs520 b1c0d4d
Update troubleshoot-write-conflicts.md
crazycs520 367774f
Update troubleshoot-write-conflicts.md
crazycs520 55fdf7b
Update troubleshoot-write-conflicts.md
crazycs520 ca62fac
add toc
crazycs520 0cb9812
Merge branch 'translate' of https://github.com/crazycs520/docs into t…
crazycs520 596fd76
Merge branch 'master' into translate
crazycs520 656f425
Update TOC.md
lilin90 e5581dc
Optimize descriptions and fix typo
lilin90 1332853
Merge branch 'translate' of https://github.com/crazycs520/docs into p…
lilin90 4ad9bd4
Merge branch 'master' into translate
ti-srebot File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,94 @@ | ||
| --- | ||
| title: Troubleshoot Write Conflicts in Optimistic Transactions | ||
| summary: Learn about the reason of and solutions to write conflicts in optimistic transactions. | ||
| category: troubleshoot | ||
| aliases: ['/docs/dev/troubleshoot-write-conflicts/'] | ||
| --- | ||
|
|
||
| # Troubleshoot Write Conflicts in Optimistic Transactions | ||
|
|
||
| This document introduces the reason of and solutions to write conflicts in optimistic transactions. | ||
|
|
||
| Before TiDB v3.0.8, TiDB uses the optimistic transaction model by default. In this model, TiDB does not check conflicts during transaction execution. Instead, while the transaction is finally committed, the two-phase commit (2PC) is triggered and TiDB checks write conflicts. If a write conflict exists and the auto-retry mechanism is enabled, then TiDB retries the transaction within limited times. If the retry succeeds or has reached the upper limit on retry times, TiDB returns the result of transaction execution to the client. Therefore, if a lot of write conflicts exist in the TiDB cluster, the duration can be longer. | ||
|
|
||
| ## The reason of write conflicts | ||
|
|
||
| TiDB implements its transactions by using the [Percolator](https://www.usenix.org/legacy/event/osdi10/tech/full_papers/Peng.pdf) transaction model. `percolator` is generally an implementation of 2PC. For the detailed 2PC process, see [TiDB Optimistic Transaction Model](/optimistic-transaction.md). | ||
|
|
||
| After the client sends a `COMMIT` request to TiDB, TiDB starts the 2PC process: | ||
|
|
||
| 1. TiDB chooses one key from all keys in the transaction as the primary key of the transaction. | ||
| 2. TiDB sends the `prewrite` request to all the TiKV Regions involved in this commit. TiKV judges whether all keys can preview successfully. | ||
| 3. TiDB receives the result that all `prewrite` requests are successful. | ||
| 4. TiDB gets the `commit_ts` from PD. | ||
| 5. TiDB sends the `commit` request to the TiKV Region that contains the primary key of the transaction. After TiKV receives the `commit` request, it checks the validity of the data and clears the locks left in the `prewrite` stage. | ||
| 6. After the `commit` request returns successfully, TiDB returns success to the client. | ||
|
|
||
| The write conflict occurs in the `prewrite` stage. When the transaction finds that another transaction is writing the current key (`data.commit_ts` > `txn.start_ts`), a write conflict occurs. | ||
|
|
||
| ## Detect write conflicts | ||
|
|
||
| In the TiDB Grafana panel, check the following monitoring metrics under **KV Errors**: | ||
|
|
||
| * **KV Backoff OPS** indicates the count of error messages per second returned by TiKV. | ||
|
|
||
|  | ||
|
|
||
| The `txnlock` metric indicates the write-write conflict. The `txnLockFast` metric indicates the read-write conflict. | ||
|
|
||
| * **Lock Resolve OPS** indicates the count of items related to transaction conflicts per second: | ||
|
|
||
|  | ||
|
|
||
| - `not_expired` indicates the TTL of the lock was not expired. The conflict transaction cannot resolve locks until the TTL is expired. | ||
| - `wait_expired` indicates that the transaction needs to wait the lock to expire. | ||
| - `expired` indicates the TTL of the lock was expired. Then the conflict transaction can resolve this lock. | ||
|
|
||
| * **KV Retry Duration** indicates the duration of re-sends the KV request: | ||
|
|
||
|  | ||
|
|
||
| You can also use `[kv:9007]Write conflict` as the key word to search in the TiDB log. The key word also indicates the write conflict exists in the cluster. | ||
|
|
||
| ## Resolve write conflicts | ||
|
|
||
| If many write conflicts exist in the cluster, it is recommended to find out the write conflict key and the reason, and then try to change the application logic to avoid write conflicts. When the write conflict exists in the cluster, you can see the log similar to the following one in the TiDB log file: | ||
|
|
||
| ```log | ||
| [2020/05/12 15:17:01.568 +08:00] [WARN] [session.go:446] ["commit failed"] [conn=3] ["finished txn"="Txn{state=invalid}"] [error="[kv:9007]Write conflict, txnStartTS=416617006551793665, conflictStartTS=416617018650001409, conflictCommitTS=416617023093080065, key={tableID=47, indexID=1, indexValues={string, }} primary={tableID=47, indexID=1, indexValues={string, }} [try again later]"] | ||
| ``` | ||
|
|
||
| The explanation of the log above is as follows: | ||
|
|
||
| * `[kv:9007]Write conflict`: indicates the write-write conflict. | ||
| * `txnStartTS=416617006551793665`:indicates the `start_ts` of the current transaction. You can use the `pd-ctl` tool to convert `start_ts` to physical time. | ||
| * `conflictStartTS=416617018650001409`: indicates the `start_ts` of the write conflict transaction. | ||
| * `conflictCommitTS=416617023093080065`: indicates the `commit_ts` of the write conflict transaction. | ||
| * `key={tableID=47, indexID=1, indexValues={string, }}`:indicates the write conflict key. `tableID` indicates the ID of the write conflict table. `indexID` indicates the ID of write conflict index. If the write conflict key is a record key, the log prints `handle=x`, indicating which record(row) has a conflict. `indexValues` indicates the value of the index that has a conflict. | ||
| * `primary={tableID=47, indexID=1, indexValues={string, }}`: indicates the primary key information of the current transaction. | ||
|
|
||
| You can use the `pd-ctl` tool to convert the timestamp to readable time: | ||
|
|
||
| {{< copyable "" >}} | ||
|
|
||
| ```shell | ||
| ./pd-ctl -u https://127.0.0.1:2379 tso {TIMESTAMP} | ||
| ``` | ||
|
|
||
| You can use `tableID` to find the name of the related table: | ||
|
|
||
| {{< copyable "" >}} | ||
|
|
||
| ```shell | ||
| curl http://{TiDBIP}:10080/db-table/{tableID} | ||
| ``` | ||
|
|
||
| You can use `indexID` and the table name to find the name of the related index: | ||
|
|
||
| {{< copyable "sql" >}} | ||
|
|
||
| ```sql | ||
| SELECT * FROM INFORMATION_SCHEMA.TIDB_INDEXES WHERE TABLE_SCHEMA='{table_name}' AND TABLE_NAME='{table_name}' AND INDEX_ID={indexID}; | ||
| ``` | ||
|
|
||
| In addition, in TiDB v3.0.8 and later versions, the pessimistic transaction becomes the default model. The pessimistic transaction model can avoid write conflicts during the transaction prewrite stage, so you do not need to modify the application any more. In the pessimistic transaction mode, each DML statement writes a pessimistic lock to the related keys during execution. This pessimistic lock can prevent other transactions from modifying the same keys, thus ensuring no write conflicts exist in the `prewrite` stage of the transaction 2PC. | ||
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.