Support multi-engine per table (batching) #113

kennytm · 2018-12-31T07:48:36Z

What problem does this PR solve?

Implements RFC 3 (a.k.a. Batching).

What is changed and how it works?

Decoupled 1 table = 1 engine. Now one table can produce multiple engines, partitioned by a batch-size, allowing import of table partially. See the design document above for details.

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
- Tested on the 10T machines

Code changes

Has exported function/method change
Has exported variable/fields change
Has interface methods change
Has persistent data change

Side effects

Possible performance regression
Increased code complexity
Breaking backward compatibility

Related changes

Need to update the documentation
Need to update the tidb-ansible repository
Need to be included in the release note

sre-bot · 2018-12-31T07:48:39Z

Hi contributor, thanks for your PR.

This patch needs to be approved by someone of admins. They should reply with "/ok-to-test" to accept this PR for running test automatically.

kennytm · 2018-12-31T08:23:34Z

/run-all-tests

cmd/tidb-lightning-ctl/main.go

Removed `[tikv-importer] batch-size` to avoid confusion. Removed `[mydumper] min-region-size` since it is useless now.

kennytm · 2019-01-02T14:29:46Z

/run-all-tests

lightning/kv/importer.go

lightning/mydump/region.go

IANTHEREAL · 2019-01-02T13:04:08Z

lightning/restore/file_checkpoints.proto

    uint32 status = 3;
    int64 alloc_base = 4;
+    repeated EngineCheckpointModel engines = 6;


why not use map, a nature way

Engine IDs are assigned sequentially, so making it an array is more natural I think (a map would be just mapping 0, 1, 2, 3, ... to engines).

you must make sure Engine IDs are assigned sequentially, this logic should be written in design document

OK, Updated the design doc.

IANTHEREAL · 2019-01-02T13:04:32Z

lightning/restore/checkpoints.go

+			if err := engineRows.Scan(&engineID, &status); err != nil {
+				return errors.Trace(err)
+			}
+			for len(cp.Engines) <= engineID {


why not use map too?

tests/checkpoint_engines/config.toml

IANTHEREAL · 2019-01-02T13:18:58Z

tests/checkpoint_engines/data/cpeng-schema-create.sql

@@ -0,0 +1 @@
+create database cpeng;


what's cpeng? 😄

Check-Point — Engines 🤔

IANTHEREAL · 2019-01-02T14:13:18Z

lightning/restore/checkpoints.go

 				&value.Chunk.Offset, &value.Chunk.EndOffset, &value.Chunk.PrevRowIDMax, &value.Chunk.RowIDMax,
 				&kvcBytes, &kvcKVs, &kvcChecksum,
 			); err != nil {
 				return errors.Trace(err)
 			}
 			value.Checksum = verify.MakeKVChecksum(kvcBytes, kvcKVs, kvcChecksum)
-			cp.Chunks = append(cp.Chunks, value)
+			cp.Engines[engineID].Chunks = append(cp.Engines[engineID].Chunks, value)


is chunks sorted as comment at L101?

Yes... a subslice of a sorted slice is still sorted 😁

kennytm · 2019-01-02T15:31:10Z

/run-all-tests

lightning/kv/importer.go

tests/checkpoint_engines/mysql.toml

lonng · 2019-01-03T06:03:49Z

LGTM

IANTHEREAL · 2019-01-03T08:45:51Z

/run-all-tests

IANTHEREAL · 2019-01-03T09:18:39Z

LGTM

kennytm · 2019-01-03T10:00:58Z

I'll merge after confirming whether 10G is a good default size. Maybe this needs to be larger (the previous default was 500G).

* mydump: non-uniform batch size * *: make the `batch-size-scale` configurable * *: implemented the optimized non-uniform strategy * tests: due to change of strategy, checkpoint_engines count becomes 4 again * mydump/region: slightly adjust the batch size computation * Use the exact result of 1/Beta(N, R) instead of an approximation * When the number of engines is small and the total engine size of the first (table-concurrency) batches exceed the table size, the last batch was truncated, and disrupt the pipeline. Now in these case we will reduce the batch size to avoid this disruption. * restore: log the SQL size and KV size of each engine for debugging * config: change default batch size and ratio given experiment result * config: added more explanation about batch-import-ratio

kennytm added status/WIP Work in progress Should Update Docs Should update docs after this PR is merged. Remove this label once the docs are updated priority/important type/feature New feature labels Dec 31, 2018

kennytm changed the title ~~[WIP] Support multi-engine per table (batching)~~ Support multi-engine per table (batching) Dec 31, 2018

kennytm added status/PTAL This PR is ready for review. Add this label back after committing new changes and removed status/WIP Work in progress labels Dec 31, 2018

lonng reviewed Jan 2, 2019

View reviewed changes

cmd/tidb-lightning-ctl/main.go Show resolved Hide resolved

kennytm added 10 commits January 2, 2019 21:55

config,restore: introduced [mydumper] batch-size

44dc8d1

Removed `[tikv-importer] batch-size` to avoid confusion. Removed `[mydumper] min-region-size` since it is useless now.

restore,mydump: pre-allocate engine IDs

e61bacf

restore: separate table checkpoints and engine checkpoints

8e00a50

importer: stop exposing the UUID

c621266

checkpoints: make checkpoint diff understand 1 table = many engines

142dd9a

checkpoints: make file checkpoints recognize multiple engines

8b2498c

checkpoints: migrated MySQL-based checkpoint to multi-engine as well

ed0471a

restore: adapt restore workflow for multi-engine

485fc9d

tests: added test case for multi-engine

cc34c4e

*: fixed code

c7f0a37

kennytm force-pushed the kennytm/batching branch from bb752cc to 54de1aa Compare January 2, 2019 14:14

IANTHEREAL reviewed Jan 2, 2019

View reviewed changes

*: addressed comments

294f228

kennytm force-pushed the kennytm/batching branch from 54de1aa to 294f228 Compare January 2, 2019 15:27

lonng suggested changes Jan 3, 2019

View reviewed changes

lightning/kv/importer.go Outdated Show resolved Hide resolved

lonng reviewed Jan 3, 2019

View reviewed changes

tests/checkpoint_engines/mysql.toml Outdated Show resolved Hide resolved

lonng approved these changes Jan 3, 2019

View reviewed changes

kennytm force-pushed the kennytm/batching branch from 59f529c to a2274b8 Compare January 3, 2019 05:52

*: addressed comments

92e8dad

kennytm force-pushed the kennytm/batching branch from a2274b8 to 92e8dad Compare January 3, 2019 06:02

kennytm added status/LGT1 One reviewer already commented LGTM (LGTM1) and removed status/PTAL This PR is ready for review. Add this label back after committing new changes labels Jan 3, 2019

IANTHEREAL added status/LGT2 Two reviewers already commented LGTM, ready for merge (LGTM2) and removed status/LGT1 One reviewer already commented LGTM (LGTM1) labels Jan 3, 2019

kennytm mentioned this pull request Jan 9, 2019

Support non-uniform batch size #114

Merged

kennytm merged commit f73d0f9 into master Jan 14, 2019

kennytm deleted the kennytm/batching branch January 14, 2019 12:17

kennytm removed the Should Update Docs Should update docs after this PR is merged. Remove this label once the docs are updated label Mar 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multi-engine per table (batching) #113

Support multi-engine per table (batching) #113

kennytm commented Dec 31, 2018 •

edited

sre-bot commented Dec 31, 2018

kennytm commented Dec 31, 2018

kennytm commented Jan 2, 2019

IANTHEREAL Jan 2, 2019

kennytm Jan 2, 2019

IANTHEREAL Jan 3, 2019

kennytm Jan 3, 2019 •

edited

IANTHEREAL Jan 2, 2019

IANTHEREAL Jan 2, 2019

kennytm Jan 2, 2019

IANTHEREAL Jan 2, 2019

kennytm Jan 2, 2019

kennytm commented Jan 2, 2019

lonng commented Jan 3, 2019

IANTHEREAL commented Jan 3, 2019

IANTHEREAL commented Jan 3, 2019

kennytm commented Jan 3, 2019

Support multi-engine per table (batching) #113

Support multi-engine per table (batching) #113

Conversation

kennytm commented Dec 31, 2018 • edited

What problem does this PR solve?

What is changed and how it works?

Check List

sre-bot commented Dec 31, 2018

kennytm commented Dec 31, 2018

kennytm commented Jan 2, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kennytm Jan 3, 2019 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kennytm commented Jan 2, 2019

lonng commented Jan 3, 2019

IANTHEREAL commented Jan 3, 2019

IANTHEREAL commented Jan 3, 2019

kennytm commented Jan 3, 2019

kennytm commented Dec 31, 2018 •

edited

kennytm Jan 3, 2019 •

edited