Storage and invalidation for custom field read functions. #5667

benjamn · 2019-12-10T18:59:33Z

Custom field read functions might need to perform expensive computations, so caching the results of read functions is encouraged. Unfortunately, there is no good way for the InMemoryCache to provide that caching automatically, since read function results could depend on any/all/some/none of the arguments passed to the field, and only the application developer knows which arguments are important.

At this point you might be thinking, "Doesn't the developer already have an opportunity to tell the cache which arguments are important by configuring keyArgs: [...] in the field policy?" While that statement is true, it's only the beginning of the story. When you specify keyArgs, you're telling the cache how to distinguish multiple values for a given field, but the read and merge functions have access to the complete set of arguments, so keyArgs alone is not enough to differentiate between all possible read function return values.

In fact, a common pattern when implementing custom read and merge functions is to pass keyArgs: false to disable the default argument-based differentiation entirely, so the read and merge functions can take full responsibility for interpreting the arguments passed to the field. In this common scenario, the cache knows nothing about how read results should be stored. In short, read function caching is a responsibility that must be left to the read function.

To resolve this seemingly unresolvable conundrum, I've adopted a simple but flexible policy: every read function now has access to a private options.storage object, which is a Record<string, any> where the read function can stash any information it wants to preserve across multiple invocations of the read function. This options.storage object is unique to the current entity object and the current field name (plus any additional information specified via keyArgs). In other words, you can think of this options.storage object as storing mutable metadata about the immutable existing field value.

Here's an example of a cached read function for a WorkItem.expensiveResult field:

const cache = new InMemoryCache({
  typePolicies: {
    WorkItem: {
      fields: {
        expensiveResult: {
          keyArgs: false, // read function assumes full responsibility for args handling
          read(existing, { args, storage }) {
            const key = makeKeyFromArgs(args); // user-defined
            if (key in storage) return storage[key];
            return storage[key] = expensiveComputation(existing, args);
          },
        },
      },
    },
  },
});

To complement the options.storage object, this PR also introduces an options.invalidate() function that can be called to invalidate any cached query results that previously consumed the field value, which is especially useful when the read function uses external data sources that might change over time. Specifically, calling invalidate() will invalidate any results that were computed using the same options.storage object:

const cache = new InMemoryCache({
  typePolicies: {
    WorkItem: {
      fields: {
        expensiveResult: {
          keyArgs: false,
          read(existing, { args, storage, invalidate }) {
            const key = makeKeyFromArgs(args);
            if (key in storage) return storage[key];
            // Suppose the expensiveComputation takes a callback function that will
            // be called whenever the result of the computation may have changed:
            return storage[key] = expensiveComputation(existing, args, newResult => {
              // If a new result is not available, delete storage[key] instead.
              storage[key] = newResult;
              invalidate();
            });
          },
        },
      },
    },
  },
});

Finally, this PR renames the options.getFieldValue helper function to options.readField, and allows it to invoke custom read functions for any fields that it reads, rather than just retrieving the existing field value. See the commit message and tests included in 5219e36 to understand why this change was important.

TypeScript doesn't do a great job of enforcing parameter types (including `this:` types) for functions called with Function.prototype.{call,apply}, which is especially frustrating because you pretty much have to use those methods when you want to call a function with a specific `this` object. Another reason to avoid using `this` is simply that some developers prefer arrow functions, and arrow functions ignore any `this` object provided by .call or .apply. Instead, we can expose the Policies object as a property in the FieldFunctionOptions parameter, so it can be used (or ignored) without having to think about `this` at all.

We're no longer passing in a StoreObject, so the longer name is now technically inaccurate.

If a read function needs to cache expensive computations, or do any other sort of long-term bookkeeping, it's convenient to have a unique private storage object that gets passed to the read function each time it is invoked for a particular field within a particular object. Making this work for normalized entity objects that have a string ID was easy, but it was somewhat trickier to support non-normalized, nested objects. The trick is to use the object itself as a key in a WeakMap (via the KeyTrie), so the object will not be kept alive after it has been removed from the cache. We do not need or want to continue using the same storage object after such a change, because it is never safe to assume a non-normalized object has the same identity as any other (!==) object in the cache.

Custom field read functions that listen to external data sources need to inform the cache when the external data change, so any queries that previously consumed the field can be reevaluated. Invalidation can also be triggered by writing a new value for the underlying field data, but not all read functions are backed by existing data in the cache, so options.invalidate() fills the gap for those purely dynamic read functions.

At first I thought it would be risky to allow getFieldValue to call read functions, because it would open the door to infinite recursion and expensive chains of read functions. However, after attempting to write tests that made heavy use of getFieldValue, I realized that calling read functions is just too useful to forbid, and, if we crippled getFieldValue in this way, developers would probably just resort to using cache.readFragment to read data from the cache, which passes through several more layers of abstraction, and thus is almost certainly slower than readField.

benjamn · 2019-12-10T20:48:44Z

src/cache/inmemory/readFromStore.ts

+export type FieldValueGetter =
+  ReturnType<typeof makeFieldValueGetter>;


I like this pattern of exporting just the inferred return type of a function, without exporting the function itself.

benjamn · 2019-12-10T20:54:11Z

src/cache/inmemory/policies.ts

-      field: string,
-      foreignRef?: Reference,
-    ) => any,
-    typename = getFieldValue("__typename") as string,


We could keep passing in the typename (with a default expression fallback), since we always have to call getFieldValue<string>(objectOrReference, "__typename") in executeSelectionSet, which is the primary caller of this method. That would make reading fields that don't have custom read functions a tiny bit faster.

benjamn · 2019-12-10T20:55:02Z

src/cache/inmemory/policies.ts

+        invalidate() {
+          policies.fieldDep.dirty(storage);


In order to be really useful, the options.invalidate function should also schedule a cache.broadcastWatches() call. I'll tackle that in a follow-up PR.

benjamn · 2019-12-10T20:56:50Z

src/cache/inmemory/__tests__/policies.ts

@@ -472,6 +602,473 @@ describe("type policies", function () {
      expect(cache.extract(true)).toEqual(expectedExtraction);
    });

+    it("readField helper function calls custom read functions", function () {


This is a long test, but I think it's worth reading through it to get a sense for what it's like to implement highly dynamic custom read functions in terms of other custom read functions.

hwillson

This looks great @benjamn - super useful! Just a quick note while we're working on the docs: the idea of using a cache (StorageType) while working with the cache (InMemoryCache) might confuse people as they ramp up with the new cache API, so we'll want to make sure this is addressed clearly. Thanks!

hwillson · 2019-12-12T20:44:35Z

src/cache/inmemory/policies.ts

+  // consumed this field. If you use options.storage as a cache, setting a
+  // new value in the cache and then calling options.invalidate() can be a
+  // good way to deliver asynchronous results.


I'm wondering if the use invalidate to help deliver asynchronous results mention here might confuse people, without more context. These are code comments so maybe not (as people in here are grokking the source at the same time), but since we're mentioning it we might want to add another sentence or two that expands on this.

#5667 (comment)

benjamn added 7 commits December 9, 2019 16:14

Move getFieldValue creation into a helper function.

c6c7afa

Rename readFieldFromStoreObject to readField.

bc47bf7

We're no longer passing in a StoreObject, so the longer name is now technically inaccurate.

Increase bundlesize limit to 24kB.

a61e669

benjamn added 💡 idea 🧞‍♂️ enhancement labels Dec 10, 2019

benjamn added this to the Release 3.0 milestone Dec 10, 2019

benjamn self-assigned this Dec 10, 2019

benjamn requested review from hwillson and jbaxleyiii December 10, 2019 19:00

benjamn added 3 commits December 10, 2019 14:29

Avoid unnecessary as any type coercion.

a072e20

Better generic type handling for Policies#readField.

f1c833f

Fix comment about ReadFunctionOptions.readField.

109404b

benjamn commented Dec 10, 2019

View reviewed changes

hwillson approved these changes Dec 12, 2019

View reviewed changes

benjamn added 2 commits December 12, 2019 16:25

Reword comments about options.{storage,invalidate}.

73002b0

#5667 (comment)

Pass __typename into Policies#readField when already available.

4dcd9c6

#5667 (comment)

benjamn mentioned this pull request Dec 12, 2019

Release 3.0 #5116

Merged

31 tasks

benjamn merged commit c2c1f08 into release-3.0 Dec 12, 2019

benjamn mentioned this pull request Jan 30, 2020

Eliminate options.invalidate function in favor of local variables. #5883

Merged

github-actions bot locked as resolved and limited conversation to collaborators Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Storage and invalidation for custom field read functions. #5667

Storage and invalidation for custom field read functions. #5667

benjamn commented Dec 10, 2019 •

edited

Loading

benjamn Dec 10, 2019

benjamn Dec 10, 2019

benjamn Dec 10, 2019

benjamn Dec 10, 2019

hwillson left a comment

hwillson Dec 12, 2019

		export type FieldValueGetter =
		ReturnType<typeof makeFieldValueGetter>;

Storage and invalidation for custom field read functions. #5667

Storage and invalidation for custom field read functions. #5667

Conversation

benjamn commented Dec 10, 2019 • edited Loading

benjamn Dec 10, 2019

Choose a reason for hiding this comment

benjamn Dec 10, 2019

Choose a reason for hiding this comment

benjamn Dec 10, 2019

Choose a reason for hiding this comment

benjamn Dec 10, 2019

Choose a reason for hiding this comment

hwillson left a comment

Choose a reason for hiding this comment

hwillson Dec 12, 2019

Choose a reason for hiding this comment

benjamn commented Dec 10, 2019 •

edited

Loading