Refactor selection set implementation to be immutable #2387

pcmanus · 2023-02-09T15:12:30Z

The current (before this commit) implementation of SelectionSet is inherently mutable: its SelectionSet.add method mutate the selection set it's called on. Which is at odds with the general query planner algorithm, which is the main user of SelectionSet, but is fundamentally based on dealing with immutable data (as we explore various "path" options and the possible plans, most things are shared (at least partly)).

This mismatch between the query planner that really want immutable selection sets but an implementation that is mutable leads to less than optimal code complexity and performance. More specificially, if we want to use a SelectionSet into 2 separate branch of the algorithm, we usually have to do defensive deep copies, which is inefficient. And to mitigate those inefficiencies, we sometimes don't do those copies, but this lead to code that fragile and easy to break and has lead to bug previously (where we save a copy, but then a branch mistakenly mutated and thus impacted another branch mistakenly). A few tricks had been added to the implementation to help mitigate the risks (the freeze behaviour, and the copy-on-write in FetchGroup), but they add complexity and aren't always optimal performance wise.

This commit thus rework the implementation/api of SelectionSet to make it an immutable data structure. Doing so makes the code a lot safer and easy to reason about. And it also provide some performance benefits: compared to current main over a set of production schema/query, this seem to provide around 20% improvement on average and up to almost 50% improvment on some specific cases for the computation of query plans (disclaimer: those benchmark are fairly unscientific so those numbers should be taken with a grain of salt, but the numbers are clear enough that this is a net measurable gain).

netlify · 2023-02-09T15:12:34Z

👷 Deploy request for apollo-federation-docs pending review.

Visit the deploys page to approve it

Name	Link
🔨 Latest commit	`ae71a71`

changeset-bot · 2023-02-09T15:12:35Z

🦋 Changeset detected

Latest commit: 11954af

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 7 packages

Name	Type
@apollo/composition	Patch
@apollo/gateway	Patch
@apollo/federation-internals	Patch
@apollo/query-graphs	Patch
@apollo/query-planner	Patch
@apollo/subgraph	Patch
apollo-federation-integration-testsuite	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

codesandbox-ci · 2023-02-09T15:13:30Z

This pull request is automatically built and testable in CodeSandbox.

To see build info of the built libraries, click here or the icon next to each commit SHA.

The current (before this commit) implementation of `SelectionSet` is inherently mutable: its `SelectionSet.add` method mutate the selection set it's called on. Which is at odds with the general query planner algorithm, which is the main user of `SelectionSet`, but is fundamentally based on dealing with immutable data (as we explore various "path" options and the possible plans, most things are shared (at least partly)). This mismatch between the query planner that really want immutable selection sets but an implementation that is mutable leads to less than optimal code complexity and performance. More specificially, if we want to use a `SelectionSet` into 2 separate branch of the algorithm, we usually have to do defensive deep copies, which is inefficient. And to mitigate those inefficiencies, we sometimes don't do those copies, but this lead to code that fragile and easy to break and has lead to bug previously (where we save a copy, but then a branch mistakenly mutated and thus impacted another branch mistakenly). A few tricks had been added to the implementation to help mitigate the risks (the `freeze` behaviour, and the copy-on-write in `FetchGroup`), but they add complexity and aren't always optimal performance wise. This commit thus rework the implementation/api of `SelectionSet` to make it an immutable data structure. Doing so makes the code a lot safer and easy to reason about. And it also provide some performance benefits: compared to current `main` over a set of production schema/query, this seem to provide around 20% improvement on average and up to almost 50% improvment on some specific cases (disclaimer: those benchmark are fairly unscientific so those numbers should be taken with a grain of salt, but the numbers are clear enough that this is a net measurable gain).

trevor-scheer

Review still in progress

internals-js/src/definitions.ts

internals-js/src/operations.ts

Co-authored-by: Trevor Scheer <trevor.scheer@gmail.com>

composition-js/src/validate.ts

clenfest · 2023-03-15T03:02:47Z

query-planner-js/src/buildPlan.ts

@@ -672,10 +631,14 @@ class FetchGroup {
    readonly rootKind: SchemaRootKind,
    readonly parentType: CompositeType,
    readonly isEntityFetch: boolean,
-    private _selection: LazySelectionSet,
-    private _inputs?: LazySelectionSet,
+    private _selection: MutableSelectionSet<{ conditions: Conditions}>,


It's probably past time to change this to named parameters, especially when there are so many optionals.

It's a private constructor and the public methods that calls it do use named parameters. Moving to named parameters here means separately declaring all those field and having trivial assignments for each. That'd be fine but it's a lot of noise and I like it the way it is.

query-planner-js/src/buildPlan.ts

internals-js/src/operations.ts

clenfest · 2023-03-15T04:03:58Z

internals-js/src/operations.ts

+  }
+
+
+  static of(selectionSet: SelectionSet): MutableSelectionSet {


Is this used?

It's indeed not used anymore, but I'd like to keep it both for symmetry with empty and because it make sense API wise and has a high change of being use soon enough (and it really isn't, it's hardly a maintenance burden).

internals-js/src/operations.ts

query-graphs-js/src/querygraph.ts

query-planner-js/src/buildPlan.ts

Outside of minor typos/updates, the bulk of this change is switching how we collect used variables to be more efficient/avoid generating useless garbage.

It turns out that we have had (since fed 2.0 at least, maybe earlier) a slightly weird behaviour (a bug really) in the validation of selection sets which impacts what we accept for the `fields` arg of `@key`, `@requires` and `@provides`. Namely, assuming a field `t` returnin an object type, we were accepting a selection like: ``` { t { a b } t } ``` even though the 2nd occurence of `t` is kind of incorrect according to the graphQL spec since `t` should always have a sub-selection. But the reason it works is that the code was merging the 1st and 2nd occurrence of `t` before any validation was run, so internally the selection is handled as just: ``` { t { a b } } ``` Now, an assertion was added in apollographql#2387 that is triggered by the example above, and that means that some `@key`, `@provides` or `@requires` that were accepted (and were mostly correctly working) in currently released versions would start erroring in 2.4 because of this. To be clear, the historical behaviour is kind of wrong here, and we should consider fixing it at some point. However, hard-failing on upgrades is not very nice: we should probably introduce a warning for a few versions before genuinely making this an error. Further, the current assertion does not provide a very user friendly message. In the meantime, this PR restore the status quo.

It turns out that we have had (since fed 2.0 at least, maybe earlier) a slightly weird behaviour (a bug really) in the validation of selection sets which impacts what we accept for the `fields` arg of `@key`, `@requires` and `@provides`. Namely, assuming a field `t` returnin an object type, we were accepting a selection like: ``` { t { a b } t } ``` even though the 2nd occurence of `t` is kind of incorrect according to the graphQL spec since `t` should always have a sub-selection. But the reason it works is that the code was merging the 1st and 2nd occurrence of `t` before any validation was run, so internally the selection is handled as just: ``` { t { a b } } ``` Now, an assertion was added in #2387 that is triggered by the example above, and that means that some `@key`, `@provides` or `@requires` that were accepted (and were mostly correctly working) in currently released versions would start erroring in 2.4 because of this. To be clear, the historical behaviour is kind of wrong here, and we should consider fixing it at some point. However, hard-failing on upgrades is not very nice: we should probably introduce a warning for a few versions before genuinely making this an error. Further, the current assertion does not provide a very user friendly message. In the meantime, this PR restore the status quo.

@defer

The previously committed [apollographql#2713](apollographql#2713) fixed an issue introduced by [apollographql#2387](apollographql#2387), ensuring that querying the same field with different directives applications was not merged, similar to what was/is done for fragments. But the exact behaviour slightly differs between fields and fragments when it comes to `@defer` in that for fragments, we never merge 2 similar fragments where both have `@defer`, which we do merge for fields. Or to put it more concretely, in the following query: ```graphq query Test($skipField: Boolean!) { x { ... on X @defer { a } ... on X @defer { b } } } ``` the 2 `... on X @defer` are not merged, resulting in 2 deferred sections that can run in parallel. But following [apollographql#2713](apollographql#2713), query: ```graphq query Test($skipField: Boolean!) { x @defer { a } x @defer { b } } ``` _will_ merge both `x @defer`, resulting in a single deferred section. This fix changes that later behaviour so that the 2 `x @defer` are not merged and result in 2 deferred sections, consistently with both 1) the case of fragments and 2) the behaviour prior to [apollographql#2387](apollographql#2387).

@defer

…ly (#2720) The previously committed [#2713](#2713) fixed an issue introduced by [#2387](#2387), ensuring that querying the same field with different directives applications was not merged, similar to what was/is done for fragments. But the exact behaviour slightly differs between fields and fragments when it comes to `@defer` in that for fragments, we never merge 2 similar fragments where both have `@defer`, which we do merge for fields. Or to put it more concretely, in the following query: ```graphq query Test($skipField: Boolean!) { x { ... on X @defer { a } ... on X @defer { b } } } ``` the 2 `... on X @defer` are not merged, resulting in 2 deferred sections that can run in parallel. But following [#2713](#2713), query: ```graphq query Test($skipField: Boolean!) { x @defer { a } x @defer { b } } ``` _will_ merge both `x @defer`, resulting in a single deferred section. This fix changes that later behaviour so that the 2 `x @defer` are not merged and result in 2 deferred sections, consistently with both 1) the case of fragments and 2) the behaviour prior to [#2387](#2387).

pcmanus force-pushed the selection-set-refactor branch from 5f584cf to d65fcb8 Compare February 9, 2023 15:16

korinne added the status/needs-review label Feb 9, 2023

jeffjakub requested a review from clenfest February 22, 2023 21:35

jeffjakub assigned clenfest Feb 22, 2023

pcmanus force-pushed the selection-set-refactor branch from d65fcb8 to ae71a71 Compare March 10, 2023 14:38

pcmanus requested a review from StephenBarlow as a code owner March 10, 2023 14:38

pcmanus changed the base branch from main to next March 10, 2023 14:39

pcmanus added this to the 2.4 milestone Mar 10, 2023

pcmanus mentioned this pull request Mar 10, 2023

Optim local value types #2449

Merged

jeffjakub assigned trevor-scheer Mar 10, 2023

trevor-scheer reviewed Mar 13, 2023

View reviewed changes

pcmanus and others added 3 commits March 14, 2023 10:42

Switch optional arg to one with default

e2db1a2

Co-authored-by: Trevor Scheer <trevor.scheer@gmail.com>

Comment typo

1e83c3a

Co-authored-by: Trevor Scheer <trevor.scheer@gmail.com>

Implement review feedback

aad3648

clenfest reviewed Mar 15, 2023

View reviewed changes

trevor-scheer reviewed Mar 15, 2023

View reviewed changes

internals-js/src/operations.ts Show resolved Hide resolved

query-graphs-js/src/querygraph.ts Outdated Show resolved Hide resolved

query-planner-js/src/buildPlan.ts Outdated Show resolved Hide resolved

trevor-scheer approved these changes Mar 15, 2023

View reviewed changes

Additional review feedback

11954af

Outside of minor typos/updates, the bulk of this change is switching how we collect used variables to be more efficient/avoid generating useless garbage.

pcmanus merged commit 260c357 into apollographql:next Mar 15, 2023

github-actions bot mentioned this pull request Mar 15, 2023

release: on branch next #2465

Merged

pcmanus mentioned this pull request Mar 16, 2023

Restore backward compatible handling of corner case for fields #2475

Merged

pcmanus mentioned this pull request Aug 7, 2023

Expands over-eager merging of field fix to handle @defer consistently #2720

Merged

github-actions bot mentioned this pull request Aug 15, 2023

release: on branch main #2730

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor selection set implementation to be immutable #2387

Refactor selection set implementation to be immutable #2387

pcmanus commented Feb 9, 2023

netlify bot commented Feb 9, 2023 •

edited

Loading

changeset-bot bot commented Feb 9, 2023 •

edited

Loading

codesandbox-ci bot commented Feb 9, 2023 •

edited

Loading

trevor-scheer left a comment

clenfest Mar 15, 2023

pcmanus Mar 15, 2023

clenfest Mar 15, 2023

pcmanus Mar 15, 2023

		}


		static of(selectionSet: SelectionSet): MutableSelectionSet {

Refactor selection set implementation to be immutable #2387

Refactor selection set implementation to be immutable #2387

Conversation

pcmanus commented Feb 9, 2023

netlify bot commented Feb 9, 2023 • edited Loading

👷 Deploy request for apollo-federation-docs pending review.

changeset-bot bot commented Feb 9, 2023 • edited Loading

🦋 Changeset detected

codesandbox-ci bot commented Feb 9, 2023 • edited Loading

trevor-scheer left a comment

Choose a reason for hiding this comment

clenfest Mar 15, 2023

Choose a reason for hiding this comment

pcmanus Mar 15, 2023

Choose a reason for hiding this comment

clenfest Mar 15, 2023

Choose a reason for hiding this comment

pcmanus Mar 15, 2023

Choose a reason for hiding this comment

netlify bot commented Feb 9, 2023 •

edited

Loading

changeset-bot bot commented Feb 9, 2023 •

edited

Loading

codesandbox-ci bot commented Feb 9, 2023 •

edited

Loading