perf(core): Fix performance issue in type filter #9065

harshil-goel · 2024-04-05T17:12:58Z

Currently when we do queries like func(uid: 0x1) @filter(type). We retrieve the entire type index. Sometimes, when the index is too big, fetching the index is quite slow. We realised that if we know we only want to check few uids are of the same, then we can just check those uids directly. Right now we are hard coding the number of uids threshold. This could be improved with a more statistical based model, where we figure out how many items does the type index have, how many we need to check.

damonfeldman · 2024-04-11T13:32:23Z

posting/list.go

+	numNormalPostingsRead := 0
+	defer func() {
+		if numNormalPostingsRead < numDeletePostingsRead {
+			glog.V(3).Infof("During iterate on posting list, we read %d set postings, %d delete postings"+


Can we clarify this message in some way to be more useful to someone who is not familiar with internal workings? Also include which posting list if that is available.

I think this means that the badger values representing a posting list (in 256KB chunks IIRC) contained many deleted structures, and are doing more than 50% of the data movement for non-useful deleted data.

If so something like: "High proportion of deleted data observed for posting list {l.key}: total = {numNormal+numdeleted}, percent deleted = {numDel / (numnormal+numdel) * 100}%".

damonfeldman · 2024-04-11T13:55:51Z

worker/server_state.go

@@ -48,7 +48,7 @@ const (
 		`client_key=; sasl-mechanism=PLAIN; tls=false;`
 	LimitDefaults = `mutations=allow; query-edge=1000000; normalize-node=10000; ` +
 		`mutations-nquad=1000000; disallow-drop=false; query-timeout=0ms; txn-abort-after=5m; ` +
-		` max-retries=10;max-pending-queries=10000;shared-instance=false`
+		` max-retries=10;max-pending-queries=10000;shared-instance=false;type-filter-uid-limit=10`


Thinking about this... If a customer is large enough to have performance concerns, my guess is the dgraph.type index is going to be very large (or include some very large index entries that drive the performance profile). In that case, perhaps we should optimize for this case, and consider what uid-limit will balance performance for 1M or more UIDs? I suspect that will be more like 100 or more.

Maybe not something to delay/retest for, but consider.

Agreed. The number 10 is too small for such optimization. But we can defer this until we see another case and get some validation of type-filter-uid-limit.

worker/task.go

meghalims

looks good

@filter

Currently when we do queries like `func(uid: 0x1) @filter(type)`. We retrieve the entire type index. Sometimes, when the index is too big, fetching the index is quite slow. We realised that if we know we only want to check few `uids` are of the same, then we can just check those `uids` directly. Right now we are hard coding the number of `uids` threshold. This could be improved with a more statistical based model, where we figure out how many items does the type index have, how many we need to check.

@filter

Currently when we do queries like `func(uid: 0x1) @filter(type)`. We retrieve the entire type index. Sometimes, when the index is too big, fetching the index is quite slow. We realised that if we know we only want to check few `uids` are of the same, then we can just check those `uids` directly. Right now we are hard coding the number of `uids` threshold. This could be improved with a more statistical based model, where we figure out how many items does the type index have, how many we need to check.

harshil-goel requested review from mangalaman93, meghalims, billprovince and joshua-goldstein as code owners April 5, 2024 17:12

dgraph-bot added area/core internal mechanisms go Pull requests that update Go code labels Apr 5, 2024

harshil-goel added 2 commits April 10, 2024 11:41

Fix performance issue in perf

657293e

fixed lint

8375b71

harshil-goel force-pushed the harshil-goel/fix-type-perf branch 2 times, most recently from b31cea7 to 8375b71 Compare April 10, 2024 09:47

harshil-goel added 3 commits April 11, 2024 18:47

added logs and tests

a363d1d

skipped a test

24be5b3

fixed tlstest

7e89d5a

damonfeldman previously approved these changes Apr 12, 2024

View reviewed changes

fixed some tests

73974fb

harshil-goel dismissed damonfeldman’s stale review via 73974fb April 12, 2024 17:11

updated comments

bbcdb01

meghalims approved these changes Apr 12, 2024

View reviewed changes

harshil-goel merged commit 724e4db into release/v23.1 Apr 12, 2024
10 checks passed

harshil-goel deleted the harshil-goel/fix-type-perf branch April 12, 2024 18:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(core): Fix performance issue in type filter #9065

perf(core): Fix performance issue in type filter #9065

harshil-goel commented Apr 5, 2024 •

edited

damonfeldman Apr 11, 2024

damonfeldman Apr 11, 2024

gajanan-dgraph Apr 12, 2024

meghalims left a comment

perf(core): Fix performance issue in type filter #9065

perf(core): Fix performance issue in type filter #9065

Conversation

harshil-goel commented Apr 5, 2024 • edited

damonfeldman Apr 11, 2024

Choose a reason for hiding this comment

damonfeldman Apr 11, 2024

Choose a reason for hiding this comment

gajanan-dgraph Apr 12, 2024

Choose a reason for hiding this comment

meghalims left a comment

Choose a reason for hiding this comment

harshil-goel commented Apr 5, 2024 •

edited