-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
executor: add cache for approximate table count #44979
Merged
Merged
Changes from all commits
Commits
Show all changes
16 commits
Select commit
Hold shift + click to select a range
609a203
executor: add chache for approximate table count
hawkingrei 5a9ed95
executor: add chache for approximate table count
hawkingrei 3774db8
executor: add chache for approximate table count
hawkingrei 92b8981
executor: add chache for approximate table count
hawkingrei 88efd98
executor: add chache for approximate table count
hawkingrei 1f0ed99
update
hawkingrei 1390485
update
hawkingrei cddfc2d
update
hawkingrei bcd23f2
update
hawkingrei dd5fe87
update
hawkingrei ef4a0dd
tmp
hawkingrei fac150b
update
hawkingrei 61680c3
update
hawkingrei a703d36
update
hawkingrei c12ea26
update
hawkingrei c326425
update
hawkingrei File filter
Filter by extension
Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
load("@io_bazel_rules_go//go:def.bzl", "go_library", "go_test") | ||
|
||
go_library( | ||
name = "pdhelper", | ||
srcs = ["pd.go"], | ||
importpath = "github.com/pingcap/tidb/executor/internal/pdhelper", | ||
visibility = ["//executor:__subpackages__"], | ||
deps = [ | ||
"//kv", | ||
"//sessionctx", | ||
"//store/helper", | ||
"//util", | ||
"//util/sqlexec", | ||
"@com_github_jellydator_ttlcache_v3//:ttlcache", | ||
"@com_github_pingcap_failpoint//:failpoint", | ||
], | ||
) | ||
|
||
go_test( | ||
name = "pdhelper_test", | ||
timeout = "short", | ||
srcs = ["pd_test.go"], | ||
embed = [":pdhelper"], | ||
flaky = True, | ||
deps = [ | ||
"//sessionctx", | ||
"@com_github_jellydator_ttlcache_v3//:ttlcache", | ||
"@com_github_stretchr_testify//require", | ||
], | ||
) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,124 @@ | ||
// Copyright 2023 PingCAP, Inc. | ||
// | ||
// Licensed under the Apache License, Version 2.0 (the "License"); | ||
// you may not use this file except in compliance with the License. | ||
// You may obtain a copy of the License at | ||
// | ||
// http://www.apache.org/licenses/LICENSE-2.0 | ||
// | ||
// Unless required by applicable law or agreed to in writing, software | ||
// distributed under the License is distributed on an "AS IS" BASIS, | ||
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
// See the License for the specific language governing permissions and | ||
// limitations under the License. | ||
|
||
package pdhelper | ||
|
||
import ( | ||
"context" | ||
"strconv" | ||
"strings" | ||
"sync" | ||
"time" | ||
|
||
"github.com/jellydator/ttlcache/v3" | ||
"github.com/pingcap/failpoint" | ||
"github.com/pingcap/tidb/kv" | ||
"github.com/pingcap/tidb/sessionctx" | ||
"github.com/pingcap/tidb/store/helper" | ||
"github.com/pingcap/tidb/util" | ||
"github.com/pingcap/tidb/util/sqlexec" | ||
) | ||
|
||
// GlobalPDHelper is the global variable for PDHelper. | ||
var GlobalPDHelper = defaultPDHelper() | ||
var globalPDHelperOnce sync.Once | ||
|
||
// PDHelper is used to get some information from PD. | ||
type PDHelper struct { | ||
wg util.WaitGroupWrapper | ||
cacheForApproximateTableCountFromStorage *ttlcache.Cache[string, float64] | ||
|
||
getApproximateTableCountFromStorageFunc func(sctx sessionctx.Context, tid int64, dbName, tableName, partitionName string) (float64, bool) | ||
} | ||
|
||
func defaultPDHelper() *PDHelper { | ||
cache := ttlcache.New[string, float64]( | ||
ttlcache.WithTTL[string, float64](30*time.Second), | ||
ttlcache.WithCapacity[string, float64](1024*1024), | ||
) | ||
return &PDHelper{ | ||
cacheForApproximateTableCountFromStorage: cache, | ||
getApproximateTableCountFromStorageFunc: getApproximateTableCountFromStorage, | ||
} | ||
} | ||
|
||
// Start is used to start the background task of PDHelper. Currently, the background task is used to clean up TTL cache. | ||
func (p *PDHelper) Start() { | ||
globalPDHelperOnce.Do(func() { | ||
p.wg.Run(p.cacheForApproximateTableCountFromStorage.Start) | ||
chrysan marked this conversation as resolved.
Show resolved
Hide resolved
|
||
}) | ||
} | ||
|
||
// Stop stops the background task of PDHelper. | ||
func (p *PDHelper) Stop() { | ||
p.cacheForApproximateTableCountFromStorage.Stop() | ||
p.wg.Wait() | ||
} | ||
|
||
func approximateTableCountKey(tid int64, dbName, tableName, partitionName string) string { | ||
return strings.Join([]string{strconv.FormatInt(tid, 10), dbName, tableName, partitionName}, "_") | ||
} | ||
|
||
// GetApproximateTableCountFromStorage gets the approximate count of the table. | ||
func (p *PDHelper) GetApproximateTableCountFromStorage(sctx sessionctx.Context, tid int64, dbName, tableName, partitionName string) (float64, bool) { | ||
key := approximateTableCountKey(tid, dbName, tableName, partitionName) | ||
if item := p.cacheForApproximateTableCountFromStorage.Get(key); item != nil { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
time-and-fate marked this conversation as resolved.
Show resolved
Hide resolved
|
||
return item.Value(), true | ||
} | ||
result, hasPD := p.getApproximateTableCountFromStorageFunc(sctx, tid, dbName, tableName, partitionName) | ||
p.cacheForApproximateTableCountFromStorage.Set(key, result, ttlcache.DefaultTTL) | ||
return result, hasPD | ||
} | ||
|
||
func getApproximateTableCountFromStorage(sctx sessionctx.Context, tid int64, dbName, tableName, partitionName string) (float64, bool) { | ||
tikvStore, ok := sctx.GetStore().(helper.Storage) | ||
if !ok { | ||
return 0, false | ||
} | ||
regionStats := &helper.PDRegionStats{} | ||
pdHelper := helper.NewHelper(tikvStore) | ||
err := pdHelper.GetPDRegionStats(tid, regionStats, true) | ||
failpoint.Inject("calcSampleRateByStorageCount", func() { | ||
// Force the TiDB thinking that there's PD and the count of region is small. | ||
err = nil | ||
regionStats.Count = 1 | ||
// Set a very large approximate count. | ||
regionStats.StorageKeys = 1000000 | ||
}) | ||
if err != nil { | ||
return 0, false | ||
} | ||
// If this table is not small, we directly use the count from PD, | ||
// since for a small table, it's possible that it's data is in the same region with part of another large table. | ||
// Thus, we use the number of the regions of the table's table KV to decide whether the table is small. | ||
if regionStats.Count > 2 { | ||
return float64(regionStats.StorageKeys), true | ||
} | ||
// Otherwise, we use count(*) to calc it's size, since it's very small, the table data can be filled in no more than 2 regions. | ||
sql := new(strings.Builder) | ||
sqlexec.MustFormatSQL(sql, "select count(*) from %n.%n", dbName, tableName) | ||
if partitionName != "" { | ||
sqlexec.MustFormatSQL(sql, " partition(%n)", partitionName) | ||
} | ||
ctx := kv.WithInternalSourceType(context.Background(), kv.InternalTxnStats) | ||
rows, _, err := sctx.(sqlexec.RestrictedSQLExecutor).ExecRestrictedSQL(ctx, nil, sql.String()) | ||
if err != nil { | ||
return 0, false | ||
} | ||
// If the record set is nil, there's something wrong with the execution. The COUNT(*) would always return one row. | ||
if len(rows) == 0 || rows[0].Len() == 0 { | ||
return 0, false | ||
} | ||
return float64(rows[0].GetInt64(0)), true | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,67 @@ | ||
// Copyright 2023 PingCAP, Inc. | ||
// | ||
// Licensed under the Apache License, Version 2.0 (the "License"); | ||
// you may not use this file except in compliance with the License. | ||
// You may obtain a copy of the License at | ||
// | ||
// http://www.apache.org/licenses/LICENSE-2.0 | ||
// | ||
// Unless required by applicable law or agreed to in writing, software | ||
// distributed under the License is distributed on an "AS IS" BASIS, | ||
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
// See the License for the specific language governing permissions and | ||
// limitations under the License. | ||
|
||
package pdhelper | ||
|
||
import ( | ||
"testing" | ||
"time" | ||
|
||
"github.com/jellydator/ttlcache/v3" | ||
"github.com/pingcap/tidb/sessionctx" | ||
"github.com/stretchr/testify/require" | ||
) | ||
|
||
var globalMockClient mockClient | ||
|
||
type mockClient struct { | ||
missCnt int | ||
} | ||
|
||
func (m *mockClient) getMissCnt() int { | ||
return m.missCnt | ||
} | ||
|
||
func (m *mockClient) getFakeApproximateTableCountFromStorage(_ sessionctx.Context, _ int64, _, _, _ string) (float64, bool) { | ||
m.missCnt++ | ||
return 1.0, true | ||
} | ||
|
||
func TestTTLCache(t *testing.T) { | ||
cache := ttlcache.New[string, float64]( | ||
ttlcache.WithTTL[string, float64](100*time.Millisecond), | ||
ttlcache.WithCapacity[string, float64](2), | ||
) | ||
helper := &PDHelper{ | ||
cacheForApproximateTableCountFromStorage: cache, | ||
getApproximateTableCountFromStorageFunc: globalMockClient.getFakeApproximateTableCountFromStorage, | ||
} | ||
helper.GetApproximateTableCountFromStorage(nil, 1, "db", "table", "partition") // Miss | ||
require.Equal(t, 1, globalMockClient.getMissCnt()) | ||
helper.GetApproximateTableCountFromStorage(nil, 1, "db", "table", "partition") // Hit | ||
require.Equal(t, 1, globalMockClient.getMissCnt()) | ||
helper.GetApproximateTableCountFromStorage(nil, 2, "db1", "table1", "partition") // Miss | ||
require.Equal(t, 2, globalMockClient.getMissCnt()) | ||
helper.GetApproximateTableCountFromStorage(nil, 3, "db2", "table2", "partition") // Miss | ||
helper.GetApproximateTableCountFromStorage(nil, 1, "db", "table", "partition") // Miss | ||
require.Equal(t, 4, globalMockClient.getMissCnt()) | ||
helper.GetApproximateTableCountFromStorage(nil, 3, "db2", "table2", "partition") // Hit | ||
require.Equal(t, 4, globalMockClient.getMissCnt()) | ||
time.Sleep(200 * time.Millisecond) | ||
// All is miss. | ||
helper.GetApproximateTableCountFromStorage(nil, 1, "db", "table", "partition") | ||
helper.GetApproximateTableCountFromStorage(nil, 2, "db1", "table1", "partition") | ||
helper.GetApproximateTableCountFromStorage(nil, 3, "db2", "table2", "partition") | ||
require.Equal(t, 7, globalMockClient.getMissCnt()) | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest add some comments since we are introducing the "backend components" new concept to this package.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done