feat: faster regional queries #1171

hperl · 2023-01-03T08:42:42Z

This PR accelerates the database queries when running on a globally-distributed Cockroach cluster.

Related issue(s)

Fixes #1167
Relates to https://github.com/ory-corp/cloud/issues/3668
Fixes https://github.com/ory-corp/cloud/issues/3680

Checklist

I have read the contributing guidelines.
I have referenced an issue containing the design document if my change
introduces a new feature.
I am following the
contributing code guidelines.
I have read the security policy.
I confirm that this pull request does not address a security
vulnerability. If this pull request addresses a security vulnerability, I
confirm that I got the approval (please contact
security@ory.sh) from the maintainers to push
the changes.
I have added tests that prove my fix is effective or that my feature
works.
I have added or changed the documentation.

Further Comments

Further improvements

Optimize subject-set expand queries: multiple hops in one query
Optimize computed userset queries: multiple hops in one query

Query fusing in the check eninge

During one check, we currently execute a lot of queries in parallel that could be fused into one. For example, take query for a namespace with $n$ computed userset rewrites. These currently result in $2\cdot n$ queries (one for the direct access, and one for the subject set expansion, respectively).

Ideally, we want to combine the queries to have one of

Either one single query for each of the traversals:
- subject set expansion
- computed userset rewrites
- tuple to userset rewrites
or one big query combining the above.

In both cases, the queries should work on sets of starting tuples, so that multiple paths can be explored in parallel by the database. Of course, you always start with a single node (the query), but that might lead to multiple intermediate nodes that should be checked in parallel, so that each successive query expands the frontier of the traversed graph until either a tuple is found in the DB or the maximum depth is reached.

Traversal results

In order to be compatible with each other, all graph traversal queries are written such that they return the exact same columns and can be scanned into a common TraversalResult.

type (
	TraversalResult struct {
		From  RelationTuple
		To    RelationTuple
		Via   Traversal
		Found bool
	}

	Traversal string
)

Before/after the computed userset optimization:

name                                 old time/op     new time/op     delta
ComputedUsersets/Computed_userset-8      583µs ± 0%       81µs ± 0%   ~     (p=1.000 n=1+1)

name                                 old queries/op  new queries/op  delta
ComputedUsersets/Computed_userset-8       9.00 ± 0%       1.00 ± 0%   ~     (p=1.000 n=1+1)

name                                 old alloc/op    new alloc/op    delta
ComputedUsersets/Computed_userset-8      109kB ± 0%       18kB ± 0%   ~     (p=1.000 n=1+1)

name                                 old allocs/op   new allocs/op   delta
ComputedUsersets/Computed_userset-8      1.56k ± 0%      0.36k ± 0%   ~     (p=1.000 n=1+1)

aeneasr · 2023-01-03T09:25:28Z

Can you run the formatter fo rthe new year in another pr and then rebase? makes it easier to review

internal/persistence/sql/relationtuples.go

zepatrik

Nice, looking pretty good already 👍

internal/persistence/sql/traverser.go

hperl · 2023-01-09T17:05:21Z

benchstat ./internal/check/2023-01-09-benchtest.txt ./benchtest.new.txt              
name                                 old time/op     new time/op     delta
ComputedUsersets/Computed_userset-8      583µs ± 0%      112µs ± 0%   ~     (p=1.000 n=1+1)

name                                 old queries/op  new queries/op  delta
ComputedUsersets/Computed_userset-8       9.00 ± 0%       2.00 ± 0%   ~     (p=1.000 n=1+1)

name                                 old alloc/op    new alloc/op    delta
ComputedUsersets/Computed_userset-8      109kB ± 0%       24kB ± 0%   ~     (p=1.000 n=1+1)

name                                 old allocs/op   new allocs/op   delta
ComputedUsersets/Computed_userset-8      1.56k ± 0%      0.47k ± 0%   ~     (p=1.000 n=1+1)

Queries went down from 9 to 2 already (and I have an idea about the last redundant one) for the common case of computed userset rewrite checks.

go.mod

internal/check/bench_test.go

aeneasr · 2023-01-12T12:14:35Z

Can you create a PR in cloud that uses this patch and demonstrates it for our queries?

hperl · 2023-01-12T12:49:50Z

Can you create a PR in cloud that uses this patch and demonstrates it for our queries?

done :)

aeneasr

As far as I can tell from our review, this looks good to me. Only a few minor things that should be addressed IMO.

I think it would be good to add some test:

self-referencing OPL definitions (this is the -1 bugfix)
think about potential edge cases for this new set up and write tests
create a regression check list in the cloud PR and test it locally

internal/persistence/sql/traverser.go

internal/check/engine.go

internal/check/rewrites.go

embedx/config.schema.json

hperl requested a review from zepatrik as a code owner January 3, 2023 08:42

hperl changed the title ~~hperl/faster-regional-queries~~ feat: faster regional queries Jan 3, 2023

hperl self-assigned this Jan 3, 2023

hperl marked this pull request as draft January 3, 2023 08:43

aeneasr reviewed Jan 3, 2023

View reviewed changes

internal/persistence/sql/relationtuples.go Outdated Show resolved Hide resolved

hperl force-pushed the hperl/faster-regional-queries branch 4 times, most recently from e8f9fc7 to eaa4cf9 Compare January 6, 2023 14:20

zepatrik reviewed Jan 9, 2023

View reviewed changes

internal/persistence/sql/traverser.go Outdated Show resolved Hide resolved

hperl force-pushed the hperl/faster-regional-queries branch from d2e490a to b5b7c97 Compare January 9, 2023 17:02

hperl force-pushed the hperl/faster-regional-queries branch 3 times, most recently from 4c39cf1 to 055f6ec Compare January 12, 2023 11:15

hperl marked this pull request as ready for review January 12, 2023 11:20

aeneasr reviewed Jan 12, 2023

View reviewed changes

go.mod Outdated Show resolved Hide resolved

internal/check/bench_test.go Outdated Show resolved Hide resolved

aeneasr reviewed Jan 16, 2023

View reviewed changes

internal/persistence/sql/traverser.go Outdated Show resolved Hide resolved

internal/check/engine.go Show resolved Hide resolved

internal/check/rewrites.go Show resolved Hide resolved

zepatrik force-pushed the hperl/faster-regional-queries branch from 3bf159d to 4ab0113 Compare January 16, 2023 11:57

hperl requested review from zepatrik and aeneasr January 16, 2023 16:11

hperl commented Jan 16, 2023

View reviewed changes

embedx/config.schema.json Show resolved Hide resolved

zepatrik force-pushed the hperl/faster-regional-queries branch from 27757f4 to 32484d4 Compare January 17, 2023 14:59

hperl force-pushed the hperl/faster-regional-queries branch from 32484d4 to 736c9bd Compare January 17, 2023 15:25

feat: use where-exist queries in check direct

3a71213

hperl force-pushed the hperl/faster-regional-queries branch from 736c9bd to 93c8ed6 Compare January 17, 2023 15:26

feat: faster subject set expands

b1701f2

hperl added 5 commits January 17, 2023 17:26

WIP: faster subject set rewrites

1cbab1d

test: benchmark for computed userset queries

06aa3ce

feat: optimize computed userset queries

cc98726

feat: optimize computed userset queries more

8d45cbd

feat: hide check optimizations behind strict_mode flag

f691a2d

zepatrik force-pushed the hperl/faster-regional-queries branch from 93c8ed6 to 5dbe21e Compare January 17, 2023 16:27

zepatrik and others added 11 commits January 17, 2023 17:28

chore: small improvements

54140f3

chore: review comments

cd6000a

test: remove timer resets

3c19fb2

docs: mark strict mode as experimental

9e0bb6c

test: add case for strict mode

38e8217

chore: fix strict mode tests

d7e0ac9

chore: fix package-lock.json by using npm@8

124eaca

test: add more cases

e42f7a0

chore: format

e19a20f

fix: use query builder in TraverserSubjectSetRewrite

7f11af7

fix: rest depth handling

3d1cbbc

zepatrik force-pushed the hperl/faster-regional-queries branch from 5dbe21e to 3d1cbbc Compare January 17, 2023 16:28

zepatrik approved these changes Jan 17, 2023

View reviewed changes

zepatrik merged commit 8e07890 into master Jan 17, 2023

zepatrik deleted the hperl/faster-regional-queries branch January 17, 2023 16:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: faster regional queries #1171

feat: faster regional queries #1171

hperl commented Jan 3, 2023 •

edited

aeneasr commented Jan 3, 2023

zepatrik left a comment

hperl commented Jan 9, 2023

aeneasr commented Jan 12, 2023

hperl commented Jan 12, 2023

aeneasr left a comment

feat: faster regional queries #1171

feat: faster regional queries #1171

Conversation

hperl commented Jan 3, 2023 • edited

Related issue(s)

Checklist

Further Comments

Query fusing in the check eninge

Traversal results

Before/after the computed userset optimization:

aeneasr commented Jan 3, 2023

zepatrik left a comment

Choose a reason for hiding this comment

hperl commented Jan 9, 2023

aeneasr commented Jan 12, 2023

hperl commented Jan 12, 2023

aeneasr left a comment

Choose a reason for hiding this comment

hperl commented Jan 3, 2023 •

edited