refactor(gatsby-source-filesystem): Upgrade XState and refactor its usage #17192

Andarist · 2019-08-29T18:24:54Z

Description

I was exploring the codebase to see how you are using XState and I've encountered this old machine using barebones Machine API from older version (v3) of XState. So I've decided to refactor this to use Interpreter (which you are also using in other parts of the codebase), mainly as an exercise for myself but I believe that it's cleaner now and gives better visualization than the previous version:

I strongly believe this should behave exactly the same as the previous implementation, BUT I have no idea how to test this properly. I could use a hint regarding that from you 😉

I've also left one TODO comment which should be resolved before merging (even if resolving just means removing it - I was unsure what's the proper logic for this one, I've decided to keep it like it was for now). In addition to that I've noticed that processed nodes before bootstrap is don are not reported - shouldn't maybe those logs be queued and flushed later?

Andarist · 2019-08-30T09:03:29Z

I get this on CI:

'gatsby-source-filesystem' doesn't seem to be installed. Restart gatsby-dev to publish it

But ain't sure what I should do to fix it, or even try this locally.

pieh · 2019-09-03T09:51:01Z

In addition to that I've noticed that processed nodes before bootstrap is don are not reported - shouldn't maybe those logs be queued and flushed later?

Reason for this is mostly that initial node creation (during bootstrap) may result in thousands of File nodes, so this would be very spammy. Node creation after bootstrap is finished is result of file changes and for most use cases this will be single new file / file changes at a time (editing files locally)

I'll take a look on this (why it fails in CI) - I agree this is a lot cleaner than current implementation

I get this on CI:

'gatsby-source-filesystem' doesn't seem to be installed. Restart gatsby-dev to publish it

But ain't sure what I should do to fix it, or even try this locally.

This is quirk of our hacky utility for testing changes in packages (gatsby-dev) and we didn't prfioritize fixing this false warning. This can be ignored

pieh

This is awesome! I left few notes, but foundation is very solid

packages/gatsby-source-filesystem/src/gatsby-node.js

pieh · 2019-09-03T10:03:00Z

packages/gatsby-source-filesystem/src/gatsby-node.js

+              },
+            },
+          },
+          // TODO: those two were not restricted to READY state, but maybe they should? or even maybe it should queue in NOT_READY?


This is very nice illustration of why using actions in state machine definition is so great (opposed to what we currently have) - it made it very clear that this is not handled properly. Those definitely need to have a bit different handling in CHOKIDAR.NOT_READY and CHOKIDAR.READY

That said, this might be not in scope of this PR (it preserves current handling) - so I'll leave this up to you if you want to attack this. If you don't - that's ok, let's just leave this TODO here

I would prefer keeping this as is now. I can try to fix this in a followup PR though - if only you could give me hints on how the handling should differ. Should those be queued until ready?

Yup, let's try to minimize scope of the PR. This will allow faster iteration.

pieh · 2019-09-03T10:04:21Z

packages/gatsby-source-filesystem/src/gatsby-node.js

+          },
+          // TODO: those two were not restricted to READY state, but maybe they should? or even maybe it should queue in NOT_READY?
+          on: {
+            CHOKIDAR_CHANGE: { actions: `createAndProcessNode` },


This can be handled in exactly same way as CHOKIDAR_ADD event (so queue in NOT_READY and process immediately in READY)

Maybe even CHOKIDAR_ADD / CHOKIDAR_CHANGE event can be merged because as far as plugin is concerned there's not much difference between creating and updating nodes (gatsby core takes care of this stuff).

But I don't think that verbosity of events is any problem and this might be easier to read and understand

Ok, so you have actually answered here my previous question 😅 Still would be in favor of keeping current logic for now, I can prepare a PR with that queuing immediately after this would get merged in.

As to verbosity of events - I would be in favor of keeping separate names.

pieh · 2019-09-03T10:17:22Z

packages/gatsby-source-filesystem/src/gatsby-node.js

+          // TODO: those two were not restricted to READY state, but maybe they should? or even maybe it should queue in NOT_READY?
+          on: {
+            CHOKIDAR_CHANGE: { actions: `createAndProcessNode` },
+            CHOKIDAR_UNLINK: { actions: `deleteNode` },


I do think this need similar handling as CHOKIDAR_ADD / CHOKIDAR_CHANGE:

We should queue it when it's not ready. I think we do need single queue for additions, changes and deletions - so potential queue could look like this:

[ { type: 'add', path: '<some_path>' }, { type: 'del', path: '<some_path>' } ]

so then flushing queue would handle those in proper order

pieh · 2019-09-03T10:52:49Z

packages/gatsby-source-filesystem/src/gatsby-node.js

+            deleteNode({ node })
+          }
+        },
+        flushPathQueue(_, { resolve, reject }) {


I spent embarrassingly long time looking where resolve / reject is coming from (initially I thought this is how xstate handles async action, before I found that resolve/reject is send in fsMachine.send({ type: `CHOKIDAR_READY`, resolve, reject })

I do think we can rework it to be more inline with idiomatic xstate by:

creating new state in CHOKIDAR (sibling to CHOKIDAR.NOT_READY) - let's call it FLUSHING

states: { NOT_READY: { on: { - CHOKIDAR_READY: `READY`, + CHOKIDAR_READY: `FLUSHING`, CHOKIDAR_ADD: { actions: `queueNodeProcessing` }, }, }, + FLUSHING: { + invoke: { + src: () => { + return flushPathQueue() + }, + onDone: { + target: `READY`, + }, + }, + }, READY: { on: { CHOKIDAR_ADD: { actions: `createAndProcessNode` }, }, }, },

see https://xstate.js.org/docs/guides/communication.html#invoking-services for docs about invoke property

change the promise returned from sourceNodes a bit, let chokidar.on(`ready`) just send event to machine, and let's wait for machine to reach CHOKIDAR.READY to resolve that promise

Making it this way also shows that there is another potential problem (it's good thing) - what happens currently when chokidar emits event while flushing is in progress? Right now we would not queue work, but instead would process it immediately - making order of processing potentially incorrect. So I think while we are flushing, we should queue chokidar events and when flushing finishes check if queue is empty (then we can go to CHOKIDAR.READY state, and if it's not - flush again and repeat that until queue is empty)

I like that 👍

One question though - flushPathQueue might reject and this would stay unhandled. I have no idea under what circumstances it could reject though? So not sure how this should be handled and for what to await for in ready listener to reject returned promise there.

And again - maybe we could pursue fixing this in a followup PR? Or do you prefer keeping it as part of this one?

Right, my idea here is certainly not fully fleshed out (just starting point).

And also agree let's not scope creep this PR - let's make this PR 1 to 1 refactor (keeping current behaviour with all its flaws). All those edge cases that were uncovered can be incrementally handled in future pull requests

Cool, I believe it's 1 to 1 right now if I haven't made any stupid mistake. I still don't know how to test this properly though - so I would appreciate you taking a look at this or giving me guidance.

Andarist · 2019-09-03T11:18:29Z

@pieh thanks for the initial review!

…esystem/xstate-upgrade

pieh · 2019-09-04T11:29:05Z

I added basic test suite (that tests sourceNodes rather than machine itself) with couple of tests skipped (edge cases discovered in this review).

I'll merge this in

pieh

Thank for cleaning this up! It's much easier to follow now!

gatsbot · 2019-09-04T11:30:35Z

Holy buckets, @Andarist — we just merged your PR to Gatsby! 💪💜

Gatsby is built by awesome people like you. Let us say “thanks” in two ways:

We’d like to send you some Gatsby swag. As a token of our appreciation, you can go to the Gatsby Swag Store and log in with your GitHub account to get a coupon code good for one free piece of swag. We’ve got Gatsby t-shirts, stickers, hats, scrunchies, and much more. (You can also unlock even more free swag with 5 contributions — wink wink nudge nudge.) See gatsby.dev/swag for details.
We just invited you to join the Gatsby organization on GitHub. This will add you to our team of maintainers. Accept the invite by visiting https://github.com/orgs/gatsbyjs/invitation. By joining the team, you’ll be able to label issues, review pull requests, and merge approved pull requests.

If there’s anything we can do to help, please don’t hesitate to reach out to us: tweet at @gatsbyjs and we’ll come a-runnin’.

Thanks again!

Andarist · 2019-09-04T11:32:31Z

@pieh cool, I’ll work on fixing those issues discovered during the PR in following days

* Upgrade XState and refactor its usage in gatsby-source-filesystem * test: add basic test suite for sourceNodes

sidharthachatterjee · 2019-09-13T03:59:21Z

Thanks for all the great work here @Andarist

Featuring this in the September Gazette in #17548

* Upgrade XState and refactor its usage in gatsby-source-filesystem * test: add basic test suite for sourceNodes

Andarist requested a review from a team as a code owner August 29, 2019 18:24

Andarist force-pushed the gatsby-source-filesystem/xstate-upgrade branch from a8b6eac to 4e78531 Compare August 30, 2019 08:22

pieh self-assigned this Sep 3, 2019

pieh changed the title ~~Upgrade XState and refactor its usage in gatsby-source-filesystem~~ refactor(gatsby-source-filesystem): Upgrade XState and refactor its usage Sep 3, 2019

pieh requested changes Sep 3, 2019

View reviewed changes

Upgrade XState and refactor its usage in gatsby-source-filesystem

d67e1b4

Andarist force-pushed the gatsby-source-filesystem/xstate-upgrade branch from 4e78531 to d67e1b4 Compare September 3, 2019 11:20

pieh and others added 2 commits September 4, 2019 00:23

test: add basic test suite for sourceNodes

a220303

Merge remote-tracking branch 'upstream/master' into gatsby-source-fil…

6753673

…esystem/xstate-upgrade

pieh approved these changes Sep 4, 2019

View reviewed changes

pieh merged commit 9185ece into gatsbyjs:master Sep 4, 2019

Andarist deleted the gatsby-source-filesystem/xstate-upgrade branch September 4, 2019 13:03

Andarist mentioned this pull request Sep 5, 2019

fix(gatsby-source-filesystem): Queue all operations which happen before chokidar's ready state #17404

Merged

waltercruz pushed a commit to waltercruz/gatsby that referenced this pull request Sep 8, 2019

refactor(gatsby-source-filesystem): Upgrade XState (gatsbyjs#17192)

2e8d172

* Upgrade XState and refactor its usage in gatsby-source-filesystem * test: add basic test suite for sourceNodes

Andarist mentioned this pull request Sep 9, 2019

fix(gatsby-source-filesystem): Fix timing issue - processing nodes while flushing #17519

Closed

mwfrost pushed a commit to mwfrost/gatsby that referenced this pull request Apr 20, 2023

refactor(gatsby-source-filesystem): Upgrade XState (gatsbyjs#17192)

d4fb7e9

* Upgrade XState and refactor its usage in gatsby-source-filesystem * test: add basic test suite for sourceNodes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(gatsby-source-filesystem): Upgrade XState and refactor its usage #17192

refactor(gatsby-source-filesystem): Upgrade XState and refactor its usage #17192

Andarist commented Aug 29, 2019 •

edited

Andarist commented Aug 30, 2019

pieh commented Sep 3, 2019

pieh left a comment

pieh Sep 3, 2019

pieh Sep 3, 2019

Andarist Sep 3, 2019

pieh Sep 3, 2019

pieh Sep 3, 2019

pieh Sep 3, 2019

Andarist Sep 3, 2019

pieh Sep 3, 2019

pieh Sep 3, 2019

Andarist Sep 3, 2019

Andarist Sep 3, 2019

pieh Sep 3, 2019

Andarist Sep 3, 2019

Andarist commented Sep 3, 2019

pieh commented Sep 4, 2019

pieh left a comment

gatsbot bot commented Sep 4, 2019

Andarist commented Sep 4, 2019

sidharthachatterjee commented Sep 13, 2019

refactor(gatsby-source-filesystem): Upgrade XState and refactor its usage #17192

refactor(gatsby-source-filesystem): Upgrade XState and refactor its usage #17192

Conversation

Andarist commented Aug 29, 2019 • edited

Description

Andarist commented Aug 30, 2019

pieh commented Sep 3, 2019

pieh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Andarist commented Sep 3, 2019

pieh commented Sep 4, 2019

pieh left a comment

Choose a reason for hiding this comment

gatsbot bot commented Sep 4, 2019

Andarist commented Sep 4, 2019

sidharthachatterjee commented Sep 13, 2019

Andarist commented Aug 29, 2019 •

edited