Create index on a small table is slow #49477

tangenta · 2023-12-14T12:28:28Z

Enhancement

use test;
drop table if exists t;
create table t (a int);
insert into t values (1);
alter table t add index i(a);

Output:

mysql> alter table t add index i(a);
Query OK, 0 rows affected (2.75 sec)

Log:

[WARN] [region_job.go:531] ["meet error and handle the job later"] ["job stage"=needRescan] [error="[Lightning:KV:EpochNotMatch]EpochNotMatch current epoch of region 22 is conf_ver: 1 version: 66"] [] [start=74800000000000006A5F698000000000000001038000000000000001038000000000000001] [end=74800000000000006A5F69800000000000000103800000000000000103800000000000000100]

You will almost certainly encounter an "EpochNotMatch" error when adding an index, and wait at least two seconds.

tidb/br/pkg/lightning/backend/local/local.go

Lines 1654 to 1658 in eb69dac

    
           // max retry backoff time: 2+4+8+16+30*26=810s 
        
           sleepSecond := math.Pow(2, float64(job.retryCount)) 
        
           if sleepSecond > float64(maxRetryBackoffSecond) { 
        
           	sleepSecond = float64(maxRetryBackoffSecond) 
        
           }

Maybe we can improve this by synchronizing region splitting.

The text was updated successfully, but these errors were encountered:

tangenta · 2023-12-14T12:35:39Z

TiDB-lightning needs to stop the scheduling of the corresponding region before importing (See PauseSchedulersByKeyRange) to improve the stability.

The method is to post a label rule:

tidb/br/pkg/pdutil/pd.go

Lines 1020 to 1026 in eb69dac

    
           rule := LabelRule{ 
        
           	ID: uuid.New().String(), 
        
           	Labels: []RegionLabel{{ 
        
           		Key:   "schedule", 
        
           		Value: "deny", 
        
           		TTL:   ttl.String(), 
        
           	}},

When PD detects a new rule, it will generate a labeler-split-region operator. However, this process is asynchronous, which means that the region may be split during import. If the corresponding range is no longer on the original region, the request may encounter epoch not match error.

tangenta · 2023-12-14T12:49:41Z

A possible solution is to do the synchronization learning from Backend.waitForScatterRegions(): we can fetch the PD operator periodically, if it is labeler-split-region and the status is Success, the region-split is considered complete.

However, there is a corner-case: if the region ID is unchanged, we may get an outdataed PD operator from previous adding index job. This is not an uncommon situation in integration testing.

close #49477

tangenta added the type/enhancement label Dec 14, 2023

tangenta self-assigned this Dec 14, 2023

tangenta mentioned this issue Dec 14, 2023

pdutil: wait after updating region label rules #49479

Merged

13 tasks

ti-chi-bot bot closed this as completed in #49479 Dec 21, 2023

ti-chi-bot bot pushed a commit that referenced this issue Dec 21, 2023

pdutil: wait after updating region label rules (#49479)

b1e5d61

close #49477

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create index on a small table is slow #49477

Create index on a small table is slow #49477

tangenta commented Dec 14, 2023

tangenta commented Dec 14, 2023

tangenta commented Dec 14, 2023

Create index on a small table is slow #49477

Create index on a small table is slow #49477

Comments

tangenta commented Dec 14, 2023

Enhancement

tangenta commented Dec 14, 2023

tangenta commented Dec 14, 2023