Proposal: Support for composed schemas and multi-test validation of schemas #1437

josephschorr · 2023-07-13T21:07:31Z

Background

Schema in SpiceDB is written as a single unit via the WriteSchema call. This is an important requirement for type checking and validation, but comes with a cost of making it harder for different teams using the same unified permission system to manage changes to their own portions of a schema.

Furthermore, the current file format passed to tooling such as Playground, zed validate and the https://github.com/authzed/action-spicedb-validate GitHub Action only supports a single set of test relationships and associated assertions/expected relationships, making the testing of different scenarios much harder.

Proposed Solution

Define a new modified file format for .zed files that allows for the definition of both schema data (caveat and definition), as well as a set of validation tests, each with their own relationships, assertions and expected relationships. This new file format will also support import-ing of other .zed files via a new import keyword.

To create the "final" schema for WriteSchema, a new zed build command will be supplied that performs import resolution, type checks the combined schema, runs the validation tests (if any) and, finally, returns or saves the combined schema to be sent to SpiceDB

Note
The semantics of the WriteSchema call will not change; imports will not be supported via that call, under this proposal.

Proposed File Format Example

// Import statement allows for importing of object type definitions and caveats from other .zed files locally or via git
from .somelocalzedfile import sometype
from "../../someotherfile" import someothertype
from github.com/mycomp/myauthzteam@v2 import user

// Schema can reference an imported type
definition resource { 
  relation viewer: user
}

/**
 * Tests if the document can be viewed directly, or via being a
 * member of the group.
 */
test SomeSortOfThing {
  relationships {
    document:somedoc viewer user:someuser
    team:someteam    member user:anotheruser with `caveat expression goes here`
  }
  assertions {
    document:somedoc     view user:someuser     // Positive assertion
    document:anotherdoc !view user:someuser     // Negative assertion
    document:anotherdoc  view user:someuser when {
      now: “2022-12-21 12:34:00+0000”
    }
  }
  expected {
    document:firstdoc view (
      user:tom, user:fred are viewer of document:firstdoc
      user:sarah                is  member of team:someteam
    ) when {
        now: “2022-12-21 12:34:00+0000”
    }
  }
}

Proposed new and updated commands for zed

zed build

zed build myfile.zed will run a full import, type check, validation and build process, producing a combined schema to be applied via WriteSchema, with all imports resolved and removed, and all test data removed.

zed validate

zed validate myfile.zed - runs all validation tests found, recursively, through the schema file and any imported dependencies
zed validate myfile.zed -run SomeSortOfThing - runs only those validation tests whose name matches SomeSortOfThing.
zed validate myfile.zed -run SomeSortOfThing/assertions/document:somedoc - runs only the single assertion

zed docgen

zed docgen would produce Markdown or other form of documentation from the doc comments found in the schema (see: #735)

Imports

Imports will support two modes:

Local imports in the form from .somelocalfile import ..., which are unversioned
Imports from git in the form from somegithost.com/some/repo@tagname import ... which will always use a specific tag to pull in that schema

Warning
If multiple versions of the same import are used, the system will return an error

The text was updated successfully, but these errors were encountered:

josephschorr · 2023-08-12T04:51:01Z

Related: #497

reify-tanner-stirrat · 2023-08-26T18:56:36Z

One thing that would be really useful for our use case is the ability to merge definition blocks across schema files.

Our schema has a relatively small number (~6) of definitions but a relatively large number (~60) of permissions. We're using the Cloud IAM model from the blog post as a template, so we have a bunch of permissions and relations that are independently defined and independently computed.

It'd be nice to be able to separate our schema by feature, which would see the same 6 definition blocks in each of the schema fragments but different relations, permissions, and assertions in each.

I understand that this doesn't really follow normal import semantics and could end up being a relatively large complexity jump in a filetype where simplicity and duplication might be strongly preferable to flexibility and indirection, though.

josephschorr · 2023-08-26T21:39:40Z

@reify-tanner-stirrat Would you prefer a mixin style approach, ala:

mixin addsfoo {
  relation foo: bar
}

definition someresource {
  with addsfoo
}

or would you just prefer definitions in different files to "squash" together?

reify-tanner-stirrat · 2023-08-27T13:43:56Z

I don't know how different they are, but I'd prefer something like javascript object merge semantics to a mixin:

definition sub_def_1 {
  relation: sub_1
  permission permission_1 = sub_1
}

definition sub_def_2 {
  relation: sub_2
  permission permission_2 = sub_2
}

definition full_definition {
  ...sub_def_1
  ...sub_def_2
}

Duplicates could be overwritten according to the order of declaration to stick with the semantic, or else error out as a collision.

josephschorr · 2023-08-28T00:36:13Z

The problem with the above is now sub_def_1 is also a valid type; that doesn't seem intentional?

reify-tanner-stirrat · 2023-08-28T13:09:10Z

Yeah, I was just thinking about that...

It isn't, and I wouldn't want that.

To be honest, most of our pain points are around 1. needing to cat our schema into a temporary file to validate it because the validate command wants yaml that looks like a playground file and 2. wanting to split tests across multiple files. I'm okay with the size and shape of our schema file as it is.

josephschorr · 2023-08-28T13:13:33Z

Well, that's why I proposed having a mixin keyword; we could still have them compose using ...whatever like you have in your example

reify-tanner-stirrat · 2023-08-28T15:43:31Z

I think a mixin approach would allow us to decompose the schema but not necessarily the tests, because I'd need a subschema composed entirely of mixins to be able to test a subsection of the scheme, so it doesn't feel to me like it's solving the right problem (for us).

josephschorr · 2023-08-28T16:29:27Z

because I'd need a subschema composed entirely of mixins to be able to test a subsection of the scheme

Not sure I follow... mixins won't impact testing. Tests will always run on the full schema

reify-tanner-stirrat · 2023-08-29T15:12:06Z

Ah, so the test blocks can be imported/collated independently of the mixin/definition blocks under this model?

josephschorr · 2023-08-29T15:46:14Z

The test blocks are pulled in and run against the entire merged schema

reify-tanner-stirrat · 2023-08-29T15:55:50Z

Would that mean we can define test files outside of a single schema file and then just import them? I think that would serve us well.

josephschorr · 2023-08-29T16:03:11Z

You could have a .zed file that just imported a bunch of schema files and had tests, yes (or import a file with just tests)

josephschorr · 2023-09-01T17:31:24Z

User Module

type Query {
  user(id: ID!): User
}
 
type User {
  id: ID!
  email: String!
}

Authentication module

extend type Query {
  me: User
}
 
type Mutation {
  login(username: String!, password: String!): User
  signup(username: String!, password: String!): User
}
 
extend type User {
  username: String!
}

Translating to authzed

Using the example of a JIRA-like product.

Questions that come up:

what module imports which other modules
do we need to import specific fields from a module?

I think in the example below, we just need to specify file imports (no field selection) to construct our dependency graph.

Project Module

Filename: project.zed
This is the entrypoint; zed build project.zed
Imports issue.zed

import ./issue.zed

definition project {
  relation member: user
  relation manager: user

  permission manage = manager
}

// I guess we could have a user.zed as well, but you get the idea
definition user { }

Issue Module

Filename: issue.zed
Extends the project definition

extend definition project {
  relation all_issues_manager: user // Can edit all issues
}

definition issue {
  relation project: project
  relation assigned: user

  permission read = project->member
  permission edit = assigned + project->all_issues_manager
}

wscalf · 2023-09-19T19:13:52Z

+1 to @reify-tanner-stirrat 's object merge semantics.

We're working on a model inspired by the Google IAM blog post where each service provider has a subschema that with a domain-specific hierarchy, and has a number of definitions that aggregate relations/permissions from all service providers (roles, rolebindings, projects, etc), and it would be super useful if each service provider's needs could be described in a file, including the relations/permissions that need to be added to shared definitions.

enriquedacostacambio · 2023-12-19T19:26:31Z

I agree with @edorivai, this proposal could really take inspiration from GraphGL Federation. It is a nice way to have separation of concerns for different microservices.

CygnusBill · 2024-03-04T14:21:17Z

The import is really just saying, "I need definitions from ..." I think the intent here is perfect but the arrow created by the import is backward.

Something like this would work:

// project.zed

definition project {
  relation member: user
  relation manager: user

  permission manage = manager
}

definition user { }

And then to reference and extend the above:

// issue.zed

import ./project.zed

extend definition project {
  relation all_issues_manager: user // Can edit all issues
}

definition issue {
  relation project: project
  relation assigned: user

  permission read = project->member
  permission edit = assigned + project->all_issues_manager
}

The aggregation or compile process would accept all of the files at once and should be able to check the validity of all of the references before combining them, as well as globally after they are stitched together. Redefinitions should not be allowed to avoid circumventing upstream definitions. Any changes should be done by the 'owning' team.

Essentially, this makes the extend keyword very simple to think about as it is just saying, "you've seen this before... please add this to it."

A quick pass to prevent circular definitions between imports and this is finished.

I am not a fan of having a source file doing the aggregation. Something like a top-level file containing all of the imports to cause them to be compiled together. Systems where files are more completely independent scale better. Cross-group coordination is not as critical.

amadard · 2024-04-02T19:44:05Z

The import is really just saying, "I need definitions from ..." I think the intent here is perfect but the arrow created by the import is backward.

I like this proposal for extending definitions and adding new definitions. Could you give an example of how it would look for test relationships and assertions?

enriquedacostacambio · 2024-04-03T14:09:08Z

i think what is lacking is that when using a type from another file you should re-declare the relations that you are just "importing to be used" but not adding yourself (in graphql federation this is done by marking them as @external), this way, you can test each schema file in isolation as it is self contained.
for all intents and purposes an individual file should be "usable" without any import.
i really encourage you to look at https://www.apollographql.com/docs/federation/ for inspiration.

josephschorr · 2024-04-03T14:14:13Z

@enriquedacostacambio Unless I'm misunderstanding, that won't work. Consider:

File 1:

definition user  {}

definition organization {
  relation admin: user
  permission can_admin = admin
}

File 2:

from .someorgfile import organization

definition resource {
  relation viewer: user
  relation org: organization
  permission view = viewer + org->can_admin
}

If file 2 was declared without organization actually being present, tests cannot run, because we have no idea how can_admin is defined

enriquedacostacambio · 2024-04-03T14:38:52Z

@josephschorr I was actually proposing the opposite; that you duplicate everything you want to use the other file:

File 1:

definition user  {}

definition organization {
  relation admin: user
  permission can_admin = admin
  // ... a bunch of other things ...
}

File 2:

extends definition organization {
  external relation admin: user
  external permission can_admin = admin
}

definition resource {
  relation viewer: user
  relation org: organization
  permission view = viewer + org->can_admin
}

For imported relations this works fine, for imported permissions it could pull a lot of the schema from the other file.
If you want to avoid that then the assertions need to have a way to specify something like "assuming can_admin is true", and then you can declare the permission names without specifying the rules behind them.

josephschorr · 2024-04-03T14:48:37Z

@enriquedacostacambio The downside of the above approach is that it breaks the ownership: the team supervising organization should be in charge of how can_admin is defined; that's why using an import feels more natural to me: it allows for different owners while also allowing for shared reuse

enriquedacostacambio · 2024-04-03T15:07:31Z

@josephschorr correct, that's the drawback I was referring to. If you want to avoid it, this is the solution I was suggesting:

File 1:

definition user  {}

definition organization {
  relation admin: user
  permission can_admin = admin
  // ... a bunch of other things ...
}

File 2:

extends definition organization {
  external permission can_admin: user
}

definition resource {
  relation viewer: user
  relation org: organization
  permission view = viewer + org->can_admin
}

and then for testing purposes, you would treat the external permission as a relation:

organization:TEST_ORG#can_admin@user:TEST_ADMIN_USER

amadard · 2024-04-03T15:23:37Z

for all intents and purposes an individual file should be "usable" without any import.

I may not be thinking this all the way through, but on its face, it appears this concept conflicts with distributed definitions in composed schemas. If the individual file is fully usable without any import, then it is fully self contained, and doesn't have any dependency on any other schema. But for a distributed definition, I want any "extends" file to be dependent on the imports that build the original definition.

Since we are dealing with YAML files, actually writing a file wouldn't care about the dependencies, it is just text. When working in the playground or testing the schema, any updates to the example resource definition below require the organization.zed file to be imported for the schema to have its dependencies.

File: Organization.zed

definition user  {}

definition organization {
  relation admin: user
  permission can_admin = admin
}

File Resource.zed

imports ./organization.zed

definition resource {
  relation viewer: user
  relation org: organization
  permission view = viewer + org->can_admin
}

enriquedacostacambio · 2024-04-03T15:39:23Z

@amadard I'm sorry I'm probably doing a poor job at explaining. In the approach I'm suggesting you don't even need the organization.zed file in order to load and test the resource.zed schema, you can use it in standalone mode as long as the interpreter drops the extends and external keywords, and interprets external permission as a relation. With this you could microservice teams independently work on/test their own subset of the graph. When SpiceDB wants to load the composed schema it would not do this, but instead just check that the overlapping definitions match.
There are not my original ideas, I'm adapting Apollo federation ideas, their explanations are far better than mine, you might want to check it out.

amadard · 2024-04-03T16:08:16Z

@enriquedacostacambio I did some reading, and I think I'm understanding the concept. I'm comfortable with having the composed schema files dependent on each other, since at the end of the day, they are all going to be stitched together when getting imported in to SpiceDB. It requires the writer of the composed schema to have visibility into what definitions/relations/permissions exist in the imported file, but it simplifies how the composed files are written and keeps it closer to our current syntax.

josephschorr · 2024-04-03T16:28:49Z

@enriquedacostacambio What is your concern around imports? As @amadard mentioned, the schema will have to be combined before written, so having explicitly dependencies should reflect the actual result

enriquedacostacambio · 2024-04-03T16:44:28Z

@josephschorr no concern, i just think the composition approach is better in terms of ownership, for example, the sub-schemas could live in their respective microservice repos without even needing references to other services' sub-schemas, and they would be testable in isolation by that repo's tests. Imports force a monorepo layout which might or might not be the desired architecture of spicedb users.

josephschorr · 2024-04-03T16:51:12Z

@enriquedacostacambio That's actually why the proposal supports git-based imports:

from github.com/mycomp/myauthzteam@v2 import user

That way, you can define your import based on another repository entirely (and properly versioned)

amadard · 2024-04-11T21:00:02Z

@josephschorr I'm not understanding one part of the zed build process. If I have multiple "sub-schemas" that import a single primary file, but don't reference each other, how does the zed build know all of the files that need to be combined to get the final output?

E.g.

File: Organization.zed

definition user  {}

definition organization {
  relation admin: user
  permission can_admin = admin
}

File Resource.zed

imports ./organization.zed

definition resource {
  relation viewer: user
  relation org: organization
  permission view = viewer + org->can_admin
}

File UnrelatedResource.zed

imports ./organization.zed

definition unrelatedresource {
  relation viewer: user
  relation org: organization
  permission view = viewer + org->can_admin
}

The final build needs Organization.zed, Resource.zed AND UnrelatedResource.zed. If I were build with just Resource.zed, it knows to import Organization.zed, but has no knowledge of UnrelatedResource.zed. Does the build need the full list of zed files to import to create the final schema?

josephschorr · 2024-04-13T17:34:55Z

You'd define an overall "permissionsystem.zed" and import those files or give them all to the build command

Does the build need the full list of zed files to import to create the final schema?

Yes, same as any other compilation process

josephschorr · 2024-04-16T21:40:50Z

@enriquedacostacambio Thoughts on the import semantics vs the externs?

enriquedacostacambio · 2024-04-17T15:12:50Z

@josephschorr I still think the extending leads to a cleaner architecture. For example, if I have C imports B imports A, where most likely A, B and C map each to an individual microservice, the extensions approach would let me test C independently, without having to test the whole chain (something I would defer to an end to end test).
My tests can be focus on narrower concerns making the whole permissions system more testable. Plus the tests that my microservices run during build won't even need to have the files for A and B available (or know their appropriate versions), simplifying my setup.
I think we can come up with more benefits that stem from the sub-schema files being usable independently, deferring the need for all files to composition-time, when you actually need to run spicedb with the resulting combined schema.

josephschorr · 2024-04-17T17:58:54Z

@enriquedacostacambio if C is included in B, then it would be testable independently, since dependency is the other direction?

Can you come up with an example where an import of a child-type is not sufficient?

enriquedacostacambio · 2024-04-17T18:08:21Z

Oops, I reversed the direction of the imports. I'll fix the comment, but you get the idea.

josephschorr · 2024-04-17T18:39:27Z

@enriquedacostacambio Gotcha, just making sure :)

So in the case where microservice C depends on B and A, why is the import of the explicitly versioned dependency "heavier" than an externs? IMO the safety afforded outweighs the extra work the compiler has to do to check out the versioned bundle

enriquedacostacambio · 2024-04-17T20:26:06Z

@josephschorr it is heavier because when using extensions, I can test C in isolation giving it a provider state (in the sense used by PACT in contract driven testing) that defines what the values for the relationships/permissions that are external to C, which act as the "contract" between C and B. Using imports, tests in the C domain are forced to know and set up the provider state for all relationships in the graph: A & B.
This doesn't preclude the need for e2e tests on the composed schema, but lets you have more modular development & (unit?) tests.

tin-canva · 2024-08-26T07:19:34Z

A question regarding aliasing or packages:
E.g., suppose two different teams define a resource called user:

// Import statement allows for importing of object type definitions and caveats from other .zed files locally or via git
from "./service1" import user
from "./service2" import user


// Schema can reference an imported type
definition resource { 
  relation viewer: user  // Ambiguous which user we are talking about here
}

It would be great if we could either:
a) Alias (from "./service1" import user as service1_user)
b) Qualify via namespaces (leave import as is) but usage looks like relation viewer: service1.user

Any thoughts/suggestions? An alternative would be to enforce uniqueness of names on the client side, but qualified/aliased imports would be really nice.

josephschorr · 2024-08-26T14:36:20Z

It would be great if we could either: a) Alias (from "./service1" import user as service1_user) b) Qualify via namespaces (leave import as is) but usage looks like relation viewer: service1.user

Supporting as service1_user seems pretty straightforward and powerful

tstirrat15 · 2024-10-18T22:27:56Z

@tin-canva one thing that came up in discussion is that definition names aren't an internal implementation detail, but rather a part of the external API of your schema, because the name of the definition is the name that you're using when you write a relationship. This means that aliasing can't really be used other than as a local convenience without potentially breaking an assumption on the part of the author of the imported schema fragment, and that definition names still need to be globally unique within the system.

josephschorr self-assigned this Jul 13, 2023

josephschorr mentioned this issue Aug 12, 2023

Make schema validation and collaboration easier #1208

Open

This was referenced Sep 13, 2023

Private scope for domain-namespaced definitions #1535

Open

Add contextual relationships on CheckRequest #1398

Open

josephschorr mentioned this issue Sep 20, 2023

Coverage for schema assertions #1476

Open

ecordell mentioned this issue Sep 22, 2023

Proposal: Operator-bootstrapped schema authzed/spicedb-operator#259

Open

josephschorr mentioned this issue Dec 30, 2023

Support validate (and import) taking in a schema directly authzed/zed#321

Merged

josephschorr mentioned this issue Jan 19, 2024

Enable arbitrary nesting of arrow operators #15

Open

tstirrat15 mentioned this issue Oct 25, 2024

Implement parsing of local imports #2103

Merged

tstirrat15 mentioned this issue Nov 8, 2024

Implement first version of composable schemas #2127

Open

Proposal: Support for composed schemas and multi-test validation of schemas #1437

Proposal: Support for composed schemas and multi-test validation of schemas #1437

Comments

josephschorr commented Jul 13, 2023

Background

Proposed Solution

Proposed File Format Example

Proposed new and updated commands for zed

zed build

zed validate

zed docgen

Imports

josephschorr commented Aug 12, 2023

reify-tanner-stirrat commented Aug 26, 2023

josephschorr commented Aug 26, 2023

reify-tanner-stirrat commented Aug 27, 2023 • edited Loading

josephschorr commented Aug 28, 2023

reify-tanner-stirrat commented Aug 28, 2023

josephschorr commented Aug 28, 2023

reify-tanner-stirrat commented Aug 28, 2023 • edited Loading

josephschorr commented Aug 28, 2023

reify-tanner-stirrat commented Aug 29, 2023

josephschorr commented Aug 29, 2023

reify-tanner-stirrat commented Aug 29, 2023

josephschorr commented Aug 29, 2023

josephschorr commented Sep 1, 2023

edorivai commented Sep 8, 2023 • edited Loading

User Module

Authentication module

Translating to authzed

Project Module

Issue Module

wscalf commented Sep 19, 2023

enriquedacostacambio commented Dec 19, 2023 • edited Loading

CygnusBill commented Mar 4, 2024

amadard commented Apr 2, 2024 • edited Loading

enriquedacostacambio commented Apr 3, 2024

josephschorr commented Apr 3, 2024 • edited Loading

enriquedacostacambio commented Apr 3, 2024

josephschorr commented Apr 3, 2024

enriquedacostacambio commented Apr 3, 2024 • edited Loading

amadard commented Apr 3, 2024

enriquedacostacambio commented Apr 3, 2024

amadard commented Apr 3, 2024

josephschorr commented Apr 3, 2024

enriquedacostacambio commented Apr 3, 2024 • edited Loading

josephschorr commented Apr 3, 2024

amadard commented Apr 11, 2024

josephschorr commented Apr 13, 2024

josephschorr commented Apr 16, 2024

enriquedacostacambio commented Apr 17, 2024 • edited Loading

josephschorr commented Apr 17, 2024

enriquedacostacambio commented Apr 17, 2024

josephschorr commented Apr 17, 2024

enriquedacostacambio commented Apr 17, 2024 • edited Loading

tin-canva commented Aug 26, 2024

josephschorr commented Aug 26, 2024

tstirrat15 commented Oct 18, 2024

reify-tanner-stirrat commented Aug 27, 2023 •

edited

Loading

reify-tanner-stirrat commented Aug 28, 2023 •

edited

Loading

edorivai commented Sep 8, 2023 •

edited

Loading

enriquedacostacambio commented Dec 19, 2023 •

edited

Loading

amadard commented Apr 2, 2024 •

edited

Loading

josephschorr commented Apr 3, 2024 •

edited

Loading

enriquedacostacambio commented Apr 3, 2024 •

edited

Loading

enriquedacostacambio commented Apr 3, 2024 •

edited

Loading

enriquedacostacambio commented Apr 17, 2024 •

edited

Loading

enriquedacostacambio commented Apr 17, 2024 •

edited

Loading