Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

*: redirect the DML writes to a temp index during creating index #37709

Merged
merged 31 commits into from Sep 15, 2022

Conversation

tangenta
Copy link
Contributor

@tangenta tangenta commented Sep 8, 2022

What problem does this PR solve?

Issue Number: ref #35983

Problem summary:

Previously, the new index records backfill are done by a reorg worker with multiple transactions (DDL txn). When the index schema state has entered delete-only/write-only, according to the online schema change, the DML (DELETE/INSERT/UPDATE) statements can write new records the index (DML txn).

If the same keys are written in both DDL txn and DML txn, one of them will be rollbacked because of the write conflict. This could make either the create-index DDL inefficient or introduce high latency to the DML.

What is changed and how it works?

The old implementation:

rows    = [1, 2, 3]
-- CREATE INDEX
new_idx = [nil, nil, nil] <- [1, 2, 3]  (DDL txn)
-- UPDATE 1 -> 4
new_idx = [nil, nil, nil] <- [4, nil, nil] (DML txn)

This PR provides a new index creating process. We can redirect the DML txn writes to a temporary index once the creating index's schema state has entered delete-only:

rows    = [1, 2, 3]
-- CREATE INDEX
new_idx = [nil, nil, nil] <- [1, 2, 3]  (DDL txn)
-- UPDATE 1 -> 4
new_idx_tmp = [nil, nil, nil] <- [4, nil, nil] (DML txn)

Because the index IDs are different, no write conflicts can happen. After the DDL txns finished, it requires a new txn to merge the temp index record to the origin index.

rows =    [1, 2, 3]
-- CREATE INDEX BACKFILL
new_idx = [nil, nil, nil] <- [1, 2, 3]  (DDL txn)
-- UPDATE 1 -> 4
new_idx_tmp = [nil, nil, nil] <- [4, nil, nil] (DML txn)
-- CREATE INDEX MERGE
new_idx = [1, 2, 3] <- [4, nil, nil]  (DDL-merge txn)

Thus, the whole add index process needs to reorganize the data twice. To control the progress and make sure the data consistency, here we introduce the new field BackfillState to model.IndexInfo. It contains the following values:

  • Inapplicable: this DDL job doesn't use the merge process.
  • Running: the first backfill work is running. The index's write and delete operations are redirected to a temporary index.
  • ReadyToMerge: the first backfill worker is finished. The index's write and delete operations are copied to a temporary index, aka "double write". This state is used to make sure that all the TiDB instances are aware of the copy during the merge.
  • Merging: the merging worker is running. The index's write and delete operations are copied to a temporary index.

The tuple (SchemaState, BackfillState) reflects the current status of creating index.

As a result, the index's schema state changes are as follows:

1. (none, backfill_state_inapplicable)
2. (delete-only, backfill_state_running)
3. (write-only, backfill_state_running)
4. (write-reorg, backfill_state_running)
5. (write-reorg, backfill_state_ready_to_merge)
6. (write-reorg, backfill_state_merging)
7. (public, backfill_state_inapplicable)

There are also changes for GC-delete range: before this PR, ADD INDEX jobs only insert the index ID to gc_delete_range table if it is rollbacked or cancelled. After this PR, we always insert the temporary index ID to gc_delete_range because TiDB could fallback to the old backfill process in extreme cases.

Note that the DML writes to temporary index are handled specially. For example, delete operations are converted to writes like key -> "delete". In the merging stage, TiDB convert these special records back to delete operations.

Check List

Tests

  • Unit test
  • Integration test
    • The realtikvtest/addindextest is enabled with set global tidb_ddl_enable_fast_reorg = on;.
    • Two specific tests with DDL hooks are added.
  • Manual test (add detailed scripts or steps below)
  • No code

Side effects

  • Performance regression: Consumes more CPU
  • Performance regression: Consumes more Memory
  • Breaking backward compatibility

Documentation

  • Affects user behaviors
  • Contains syntax changes
  • Contains variable changes
  • Contains experimental features
  • Changes MySQL compatibility

Release note

Please refer to Release Notes Language Style Guide to write a quality release note.

None

@ti-chi-bot
Copy link
Member

ti-chi-bot commented Sep 8, 2022

[REVIEW NOTIFICATION]

This pull request has been approved by:

  • wjhuang2016
  • xiongjiwei

To complete the pull request process, please ask the reviewers in the list to review by filling /cc @reviewer in the comment.
After your PR has acquired the required number of LGTMs, you can assign this pull request to the committer in the list by filling /assign @committer in the comment to help you merge this pull request.

The full list of commands accepted by this bot can be found here.

Reviewer can indicate their review by submitting an approval review.
Reviewer can cancel approval by submitting a request changes review.

@ti-chi-bot ti-chi-bot added release-note-none size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Sep 8, 2022
@tangenta tangenta requested a review from a team as a code owner September 8, 2022 08:02
@tangenta tangenta changed the title *: redirect the kv writes to a temp index during creating index *: redirect the DML writes to a temp index during creating index Sep 8, 2022
ddl/index.go Outdated Show resolved Hide resolved
ddl/ddl_worker.go Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
table/tables/mutation_checker.go Outdated Show resolved Hide resolved
table/tables/index.go Outdated Show resolved Hide resolved
table/tables/index.go Show resolved Hide resolved
tangenta and others added 6 commits September 9, 2022 15:31
Co-authored-by: wjHuang <huangwenjun1997@gmail.com>
Co-authored-by: wjHuang <huangwenjun1997@gmail.com>
Co-authored-by: wjHuang <huangwenjun1997@gmail.com>
Co-authored-by: wjHuang <huangwenjun1997@gmail.com>
Co-authored-by: wjHuang <huangwenjun1997@gmail.com>
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
tablecodec/tablecodec.go Outdated Show resolved Hide resolved
ddl/index.go Show resolved Hide resolved
ddl/index.go Outdated Show resolved Hide resolved
ddl/index.go Outdated Show resolved Hide resolved
ddl/index_merge_tmp.go Outdated Show resolved Hide resolved
parser/model/model.go Show resolved Hide resolved
ddl/index.go Outdated Show resolved Hide resolved
ddl/index_merge_tmp_test.go Show resolved Hide resolved
ddl/index.go Outdated Show resolved Hide resolved
ddl/index.go Outdated Show resolved Hide resolved
ddl/index.go Outdated Show resolved Hide resolved
tangenta and others added 5 commits September 13, 2022 12:31
Co-authored-by: Lynn <zimu_xia@126.com>
Co-authored-by: Lynn <zimu_xia@126.com>
Co-authored-by: xiongjiwei <xiongjiwei1996@outlook.com>
Co-authored-by: xiongjiwei <xiongjiwei1996@outlook.com>
Co-authored-by: Lynn <zimu_xia@126.com>
Copy link
Collaborator

@Benjamin2037 Benjamin2037 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ti-chi-bot
Copy link
Member

@Benjamin2037: Thanks for your review. The bot only counts approvals from reviewers and higher roles in list, but you're still welcome to leave your comments.

In response to this:

LGTM

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot ti-chi-bot added the status/LGT1 Indicates that a PR has LGTM 1. label Sep 14, 2022
@ti-chi-bot ti-chi-bot added status/LGT2 Indicates that a PR has LGTM 2. and removed status/LGT1 Indicates that a PR has LGTM 1. labels Sep 14, 2022
@tangenta tangenta removed the request for review from a team September 14, 2022 12:00
parser/model/model.go Show resolved Hide resolved
ddl/reorg.go Show resolved Hide resolved
table/tables/index.go Outdated Show resolved Hide resolved
ddl/index_merge_tmp.go Show resolved Hide resolved
ddl/index_merge_tmp.go Outdated Show resolved Hide resolved
ddl/reorg.go Show resolved Hide resolved
table/tables/index.go Outdated Show resolved Hide resolved
ddl/index.go Show resolved Hide resolved
ddl/index_merge_tmp.go Outdated Show resolved Hide resolved
parser/model/model.go Show resolved Hide resolved
ddl/index_merge_tmp_test.go Show resolved Hide resolved
Copy link
Contributor

@zimulala zimulala left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tangenta
Copy link
Contributor Author

/merge

@ti-chi-bot
Copy link
Member

This pull request has been accepted and is ready to merge.

Commit hash: ead1325

@ti-chi-bot ti-chi-bot added the status/can-merge Indicates a PR has been approved by a committer. label Sep 15, 2022
@ti-chi-bot ti-chi-bot merged commit 40e67ed into pingcap:master Sep 15, 2022
@sre-bot
Copy link
Contributor

sre-bot commented Sep 15, 2022

TiDB MergeCI notify

🔴 Bad News! New failing [1] after this pr merged.
These new failed integration tests seem to be caused by the current PR, please try to fix these new failed integration tests, thanks!

CI Name Result Duration Compare with Parent commit
idc-jenkins-ci-tidb/integration-ddl-test 🟥 failed 1, success 5, total 6 23 min New failing
idc-jenkins-ci-tidb/common-test 🔴 failed 1, success 10, total 11 34 min Existing failure
idc-jenkins-ci-tidb/integration-common-test 🔴 failed 2, success 15, total 17 8 min 5 sec Existing failure
idc-jenkins-ci/integration-cdc-test 🟢 all 37 tests passed 32 min Existing passed
idc-jenkins-ci-tidb/tics-test 🟢 all 1 tests passed 5 min 31 sec Existing passed
idc-jenkins-ci-tidb/sqllogic-test-2 🟢 all 28 tests passed 4 min 23 sec Existing passed
idc-jenkins-ci-tidb/sqllogic-test-1 🟢 all 26 tests passed 4 min 2 sec Existing passed
idc-jenkins-ci-tidb/mybatis-test 🟢 all 1 tests passed 2 min 59 sec Existing passed
idc-jenkins-ci-tidb/integration-compatibility-test 🟢 all 1 tests passed 2 min 45 sec Existing passed
idc-jenkins-ci-tidb/plugin-test 🟢 build success, plugin test success 4min Existing passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release-note-none size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. status/can-merge Indicates a PR has been approved by a committer. status/LGT2 Indicates that a PR has LGTM 2.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

7 participants