[BREAKING] Resolve cache access when values are dispatched #113

novemberborn · 2017-10-23T13:48:32Z

When cached keys are loaded along with new keys, make sure the promises
returned for the cached values are resolved in the same micro-task as
those for the newly loaded values.

This means calling code will not (easily) be able to observe whether a
key was cached or not. If calling code uses the values to perform more
loads (perhaps using different data loaders) this ensures that those
loads are combined into a single batch.

In the following example the same loader is used. Without this commit, once
userLoader.load(1) is cached, the subsequent loads unexpectedly result in
three requests, not two:

var DataLoader = require('dataloader')

var userLoader = new DataLoader(keys => myBatchGetUsers(keys));

userLoader.load(1)
  .then(firstUser => {
    // Later…

    userLoader.load(1)
      .then(user => userLoader.load(user.invitedByID))
      .then(invitedBy => console.log(`User 1 was invited by ${invitedBy}`));

    // Elsewhere in your application
    userLoader.load(2)
      .then(user => userLoader.load(user.lastInvitedID))
      .then(lastInvited => console.log(`User 2 last invited ${lastInvited}`));
  })

This is because userLoader.load(1) resolves nearly instantaneously while
userLoader.load(2) requires a backend round-trip. This means the subsequent
loads for user.invitedByID and user.lastInvitedID are not batched together.

With this commit however both promises resolve in the same micro-task,
allowing the subsequent loads to be batched.

Fixes #97.

This PR is meant to give context to #97.

I had to use const to avoid some Flow ambiguity. Since I don't see it used in the codebase please let me know if that's an issue
This uses a for..of loop over a Map, I hope that's OK
The cache access promise construction is a bit odd. I'm not too familiar with Flow so please let me know if this can be improved
There's no tests yet for the new behavior, but existing tests should still pass

The documentation implies that DataLoader#load() returns the same promise when caching is enabled. With this PR that changes and promises are only reused while the batch is still enqueued. This is kinda the point of the PR but it might be a considered a breaking change. There are no tests that explicitly guarantee this behavior though.

When cached keys are loaded along with new keys, make sure the promises returned for the cached values are resolved in the same micro-task as those for the newly loaded values. This means calling code will not (easily) be able to observe whether a key was cached or not. If calling code uses the values to perform more loads (perhaps using different data loaders) this ensures that those loads are combined into a single batch. In the following example the same loader is used. Without this commit, once `userLoader.load(1)` is cached, the subsequent loads unexpectedly result in three requests, not two: ```js var DataLoader = require('dataloader') var userLoader = new DataLoader(keys => myBatchGetUsers(keys)); userLoader.load(1) .then(firstUser => { // Later… userLoader.load(1) .then(user => userLoader.load(user.invitedByID)) .then(invitedBy => console.log(`User 1 was invited by ${invitedBy}`)); // Elsewhere in your application userLoader.load(2) .then(user => userLoader.load(user.lastInvitedID)) .then(lastInvited => console.log(`User 2 last invited ${lastInvited}`)); }) ``` This is because `userLoader.load(1)` resolves nearly instantaneously while `userLoader.load(2)` requires a backend round-trip. This means the subsequent loads for `user.invitedByID` and `user.lastInvitedID` are not batched together. *With* this commit however both promises resolve in the same micro-task, allowing the subsequent loads to be batched. Fixes #97.

coveralls · 2017-10-23T13:50:09Z

Coverage decreased (-0.2%) to 99.815% when pulling 57f9659 on novemberborn:resolve-cache-access-upon-queue-dispatch into dc3f0cd on facebook:master.

leebyron · 2018-01-30T23:27:03Z

This is pretty interesting, thanks for fleshing out an implementation.

It's definitely breaking, seems more like a shift in tradeoffs than an explicit improvement, and adds a lot of implementation complexity. So I'm a bit nervous about proceeding directly

novemberborn · 2018-01-31T10:58:09Z

@leebyron currently DataLoader leaks that a value has been retrieved previously. This makes it harder to reason about its behavior. In my example above it leads to decreased efficiency in subsequent requests.

But yes this is a trade-off, there might be other circumstances where the current behavior leads to better performance. I'd argue it violates the principle of least astonishment though.

caub · 2018-03-21T20:29:04Z

@novemberborn sorry to disturb you, but I don't see where it does 3 requests instead of 2: https://runkit.com/caub/5ab2bdc1a964eb001276b778 (I see one delay: 1 and one delay: 2 as expected), or maybe this lib changed since

novemberborn · 2018-03-22T09:20:14Z

@caub in your example the loads aren't racing each other. I think you'll see different behavior if you do Promise.all([dataloader.load(1), dataloader.load(2)]). My point in this PR is that you shouldn't be able to observe that 1 has been loaded previously.

caub · 2018-03-22T09:47:24Z

@novemberborn Ah ok, I updated the runkit example, but still can't reproduce it

novemberborn · 2018-03-26T15:45:29Z

@caub have a look at this one: https://runkit.com/novemberborn/5ab9142935659d0012049e22

When 1 is loaded it returns immediately (because it was loaded previously), but 2 has an extra delay. If we then load both again they both return immediately, as both have been loaded previously.

When used in GraphQL this can prevent requests from being properly batched. Arguably the current behavior is also valid, though.

caub · 2018-03-26T16:18:05Z

@novemberborn Ok, so it behaves as expected, there are no extra requests https://runkit.com/caub/5ab9186d567b6f0012824a7d (you talked about "subsequent loads unexpectedly result in three requests, not two")

I've read #97 as well, I'd be curious to see if you can reproduce this case, it'd help

novemberborn · 2018-03-26T16:20:09Z

(you talked about "subsequent loads unexpectedly result in three requests, not two")

It can impact batching in GraphQL, in unexpected ways. It's really hard to reason about though.

novemberborn mentioned this pull request Oct 23, 2017

Make load() return cached promises per queue cycle #97

Closed

leebyron changed the title ~~Resolve cache access when values are dispatched~~ [BREAKING] Resolve cache access when values are dispatched Nov 14, 2019

This was referenced Nov 14, 2019

Refactor batching logic #220

Merged

[BREAKING] Resolve cached values after batch dispatch #222

Merged

leebyron closed this in #222 Nov 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BREAKING] Resolve cache access when values are dispatched #113

[BREAKING] Resolve cache access when values are dispatched #113

novemberborn commented Oct 23, 2017

coveralls commented Oct 23, 2017

leebyron commented Jan 30, 2018

novemberborn commented Jan 31, 2018

caub commented Mar 21, 2018 •

edited

novemberborn commented Mar 22, 2018

caub commented Mar 22, 2018 •

edited

novemberborn commented Mar 26, 2018

caub commented Mar 26, 2018

novemberborn commented Mar 26, 2018

[BREAKING] Resolve cache access when values are dispatched #113

[BREAKING] Resolve cache access when values are dispatched #113

Conversation

novemberborn commented Oct 23, 2017

coveralls commented Oct 23, 2017

leebyron commented Jan 30, 2018

novemberborn commented Jan 31, 2018

caub commented Mar 21, 2018 • edited

novemberborn commented Mar 22, 2018

caub commented Mar 22, 2018 • edited

novemberborn commented Mar 26, 2018

caub commented Mar 26, 2018

novemberborn commented Mar 26, 2018

caub commented Mar 21, 2018 •

edited

caub commented Mar 22, 2018 •

edited