Traversal Engine Graph Combinator #6194

haikalpribadi · 2021-02-19T14:43:09Z

Problem to Solve

The type-resolver performance is becoming a detrimental problem to the query engine. For larger "multi-hop" queries, such queries with 7 relationships or higher (that are quite common), the type-resolver may cause a significant slow-down if the schema it touches is diverse/complex. But this behaviour is not expected; the slow-down is caused by the "permutative" nature of traversal engine answers that type resolver depends on. But this is the wrong behaviour designed into type resolvers algorithm.

The fundamental problem with type-resolver is that it is not actually a traditional "graph traversal" in which most user queries are expected to be. The goal of the type-resolver algorithm, is to find "all possible (type) vertices that can form a valid query". Which means, it is actually looking for "all valid combinations of vertices" that can satisfy a given query, as opposed to all valid "permutations" of vertices that can satisfy a given query. This problem is actually a variant of the "connected-components" graph clustering problem, where the exception is that we need to map a vertex in the query to a set of vertices in the answer.

Current Workaround

None.

Proposed Solution

We should not have used GraphIterator engine to implement type-resolver algorithm. We should implement a "graph combinator" algorithm, specifically for type-resolver. In a big graph, we should implement it in a Pregel/BSP style algorithm. But in a small graph such as the schema graph in which the type-resolver operates on (only once for all queries with the same schema structure!), we just need to do a simple DFS!!! And the DFS is NOT equal to GraphIterator algorithm - GraphIterator is a permutation algorithm! The type-resolver is only a combination problem, so we just need to implement a "graph combinator" DFS algorithm in the TraversalEngine.

Additional Information

This feature will solve all the type-resolver performance issues that we see among our users, including those are captured in issue #6155, #6161, and #6183.

The text was updated successfully, but these errors were encountered:

lriuui0x0 · 2021-02-19T15:00:44Z

@haikalpribadi I don't have context on this one, why do we care about combination instead of permutation?

thomaschristopherking · 2021-05-26T08:36:48Z

I'm just commenting to up-vote this, because it's the most important feature for us right now. Currently, for #6304, we need to implement a workaround where we are matching addresses not by their attributes but by a "composite key" attribute that we will hack together.

haikalpribadi · 2021-05-26T09:45:53Z

Yes this is high on our list this right now @thomaschristopherking thanks for sharing!

thomaschristopherking · 2021-07-27T09:20:35Z

hi, @haikalpribadi @flyingsilverfin I noticed that the most recent release doesn't address this feature, as far as I know. Is this likely to come soon/is there any update on this one?

haikalpribadi · 2021-07-29T11:19:40Z

I believe this is next on @flyingsilverfin list of priorities, @thomaschristopherking. So I think it's fair to say it's highly likely to come out int the next release if there isn't any drastic blocker.

ps: correct me if I'm wrong here @flyingsilverfin

flyingsilverfin · 2021-08-02T18:19:05Z

@thomaschristopherking we're starting to design this now and get to work on the implementation. Given that the vacations are peppered around august it's unlikely to get done for some more weeks :)

flyingsilverfin · 2021-09-16T11:40:36Z

Solved with #6431 !!

haikalpribadi added type: feature priority: blocker labels Feb 19, 2021

haikalpribadi self-assigned this Feb 19, 2021

This was referenced Feb 19, 2021

Match query volatility #6155

Closed

Query planning and traversal slowdowns #6161

Closed

Query with a lot of entities and relations between them takes unreasonable time to compute #6183

Closed

flyingsilverfin mentioned this issue Apr 29, 2021

Grakn Crash: Inserting two things and matching one #6304

Closed

flyingsilverfin assigned flyingsilverfin and unassigned haikalpribadi Jun 7, 2021

flyingsilverfin mentioned this issue Jun 29, 2021

Perform rule validation without type checker permutations #6387

Open

haikalpribadi mentioned this issue Aug 5, 2021

Extremely low perfomance of query with many relations #6409

Closed

flyingsilverfin closed this as completed Sep 16, 2021

grabl added the status: solved label Sep 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Traversal Engine Graph Combinator #6194

Traversal Engine Graph Combinator #6194

haikalpribadi commented Feb 19, 2021 •

edited

Loading

lriuui0x0 commented Feb 19, 2021

thomaschristopherking commented May 26, 2021

haikalpribadi commented May 26, 2021

thomaschristopherking commented Jul 27, 2021

haikalpribadi commented Jul 29, 2021 •

edited

Loading

flyingsilverfin commented Aug 2, 2021

flyingsilverfin commented Sep 16, 2021

Traversal Engine Graph Combinator #6194

Traversal Engine Graph Combinator #6194

Comments

haikalpribadi commented Feb 19, 2021 • edited Loading

Problem to Solve

Current Workaround

Proposed Solution

Additional Information

lriuui0x0 commented Feb 19, 2021

thomaschristopherking commented May 26, 2021

haikalpribadi commented May 26, 2021

thomaschristopherking commented Jul 27, 2021

haikalpribadi commented Jul 29, 2021 • edited Loading

flyingsilverfin commented Aug 2, 2021

flyingsilverfin commented Sep 16, 2021

haikalpribadi commented Feb 19, 2021 •

edited

Loading

haikalpribadi commented Jul 29, 2021 •

edited

Loading