Wait for all resources to load before indicating Watcher is idle #16

jwir3 · 2023-09-07T20:56:30Z

Issue: CAP-1082

What Changed

This adds a set of promises to the Watcher class that represent network requests made by the client prior to the archive being created. For each of the network requests, as long as it is not ignored (i.e. within the baseUrl), the promise will resolve when the network request resolves. Once all of these promises are resolved, OR when a 4s timer has expired, the Watcher will indicate it is idle.

How to test

You can run the tests using yarn test. A new test was added with a large image that takes an amount of time to load. Alternatively, run the test-archiver after having linked a project that can utilize it with yarn link.

Change Type

📦 Published PR as canary version: 0.0.25--canary.16.982fd10.0

✨ Test out this PR locally via:

npm install @chromaui/test-archiver@0.0.25--canary.16.982fd10.0
# or 
yarn add @chromaui/test-archiver@0.0.25--canary.16.982fd10.0

linear · 2023-09-07T20:56:32Z

CAP-1082 Wait for all resources to load

Right now we simply wait 1s after takeArchive is called.

tmeasday

I think you missed an important but hairy case -- when a request triggers a second request.

Also I think we need to figure out a way to test this (the timeouts and the cascading requests). Might be something useful in the capture tests for that.

tmeasday · 2023-09-07T23:33:40Z

src/resource-archive/index.ts

+    // eslint-disable-next-line @typescript-eslint/no-implied-eval
+    let timeoutId: any;
+    const timeoutPromise = new Promise((r) => {
+      timeoutId = setTimeout(r, 4000);


Should we add the unresolved requests/promises to the log?

Yes, I think that's a good idea. I'll add logging information for this.

tmeasday · 2023-09-07T23:35:18Z

src/resource-archive/index.ts

+
+    await Promise.race([
+      timeoutPromise,
+      Promise.all([...this.unfulfilledRequests.values(), ...this.unsettledResolvers.values()]),


What happens if a request creates further requests in resolving?

I think probably we need to do something more complicated, ie check at the end of each request if "all requests are resolved".

Ok, I can add this logic. I had another concern, too, that Promise.all might resolve before any of the requests are actually added to the set. I suppose I could add another 500ms wait at the very beginning, but this seems like the incorrect solution.

tmeasday · 2023-09-07T23:37:06Z

src/resource-archive/index.ts

+  private unfulfilledRequests: Map<string, Promise<any>>;
+
+  private unsettledResolvers: Map<string, any>;


I think these names could be more descriptive. Also why is the second one string=>any?

I'm not sure I quite see why they need to be separate lists either? But maybe I am missing something (if so can you put a comment explaining ;) )

The first one holds Promises, but the second one holds functions that resolve Promises. They probably don't need to be separate lists - I'm not 100% sure why I separated them, other than I think I was experimenting with a different way of doing some of these. Honestly, I don't think that the unsettledResolvers really needs to exist. I'll refactor this a bit and see if I can coalesce them.

tmeasday · 2023-09-07T23:40:15Z

src/resource-archive/index.ts

+    let resolver: any = null;
+    const prom = new Promise((resolve, reject) => {
+      resolver = resolve;
+    });


Probably worth pulling this out to a helper to make it clearer, maybe something like:

const complete = this.registerUnfulfilledRequest(request.url); // later complete();

Then the registerUnfulfilledRequest could look something like:

registerUnfulfilledRequest() { const complete = // something similar to what you have complete.then(() => { if (/* all requests are resolved */) { this.allRequestsDoneComplete(); // this triggers a "super" promise representing all requests } } return complete; }

jwir3 · 2023-09-08T20:38:57Z

Also I think we need to figure out a way to test this (the timeouts and the cascading requests). Might be something useful in the capture tests for that.

How would you recommend approaching the testing of this? I can look through the capture tests, which is a good idea, but my naive initial idea was to add something like this to the express server:

      app.get('/timeout', async (req, res) => {
        await setTimeout(() => {
          res.sendStatus(204);
        }, 7000);
      });

Then make a test similar to the following:

    // eslint-disable-next-line jest/expect-expect
    it('should timeout if a request takes too long', async () => {
      const timeoutUrl = `${baseUrl}/timeout`;
      const indexPath = `/?inject=${encodeURIComponent(`<img src="${timeoutUrl}">`)}`;

      const complete = await createResourceArchive(page);

      await page.goto(new URL(indexPath, baseUrl).toString());
      const archive = await complete();

      // expectArchiveContains(archive, [indexPath, '/img.png', '/style.css']);
    });

This doesn't achieve what I want, though, because the test itself times out, rather than the setTimeout promise rejecting. Perhaps I'm doing something wrong?

@codykaup or @jmhobbs thoughts on how I might be able to implement a test for timeouts?

codykaup · 2023-09-11T14:12:39Z

It looks like we have some capture tests for this and we accomplish it by setting the timeout to basically 0 then run the test. Since all the requests take longer than our timeout, we get the error we want.

I'll admit, I'm not up-to-date on this project so maybe this is warranted but it seems like we're doing a lot of heavy lifting with promises while watching these requests. The spirit of our internal navigation watcher is much like Playwright's waitUntil argument. We're essentially keeping track of requests that start/end/fail and reset a timer each time something updates. As soon as don't see any updates for X amount of time or we hit our max timeout, we exit.

tmeasday · 2023-09-11T23:50:30Z

I think setting the timeout much lower in the test is probably the key unlock here. Otherwise your testing strategy sounds right to me @jwir3.

I'll admit, I'm not up-to-date on this project so maybe this is warranted but it seems like we're doing a lot of heavy lifting with promises while watching these requests. The spirit of our internal navigation watcher is much like Playwright's waitUntil argument. We're essentially keeping track of requests that start/end/fail and reset a timer each time something updates. As soon as don't see any updates for X amount of time or we hit our max timeout, we exit.

I think this a matter of personal preference and so I would definitely leave it up to @jwir3 or y'all.

But I would say the technique of await Promise.all(requestsThatHadStartedWhenIdleWasCalled) isn't going to work because there may be other promises added later. I guess that makes keeping a list of promises less useful, rather than just a list of booleans of "has this request finished?".

To my mind, a conceptually similar approach is a single "summary" promise of "are all requests done?" that we (maybe) resolve when each request is done. So something like:

wait() {
  await Promise.race([ timeoutPromise, this.allRequestsDonePromise ])
}

// In handler
this.client
  .send(method, params)
  .then((value) => {
     // Or simply remove from list
     this.pendingRequests.set(request.url, true);

     if (!Object.values(this.pendingRequests).any(done => !done)) {
       this.this.allRequestsDoneResolver()
     }
  });

jwir3 · 2023-09-13T16:16:18Z

@tmeasday @tevanoff @skitterm

I think this is ready for review now. I'm actually using playwright's built-in functionality to wait for the network to be idle. This is similar to what we do in capture, but, for some reason, we weren't able to use playwright's functionality there. It's possible that the capture use case is so much more complex than this one that this is the reason, but I'm honestly not sure.

Note: ~~The CI tests are still failing, so I'll clean that up today if you want to review the code.~~ Fixed now.

This was previously aliased instead of being actually renamed. This renames both the file in which the function is declared, as well as the function itself.

This replaces the 1s wait inside of Watcher with waiting on a set of two promises. The first is a global network timer. The global network timer will reject if it expires and there are still network requests pending. The second promise is a network idle promise that will resolve if playwright detects that the network has not been used in some amount of time. Fixes CAP-1082.

This adds a jest test for when the archiver times out (i.e. the resources requested do not load in the total time allotted), which, by default, is 2s. Refs CAP-1082.

tmeasday

Oh neat! Well that's simpler :)

I wonder if we can use that same thing in capture somehow. I guess we'll learn more in this project.

tevanoff

Nice, looks good to me!

thafryer · 2023-09-14T14:18:30Z

🚀 PR was released in v0.0.25 🚀

jwir3 requested a review from tmeasday September 7, 2023 20:56

tmeasday requested changes Sep 7, 2023

View reviewed changes

jwir3 force-pushed the jwir3/cap-1082-wait-for-all-resources-to-load branch from 6ef64af to 0517b98 Compare September 8, 2023 18:47

jwir3 force-pushed the jwir3/cap-1082-wait-for-all-resources-to-load branch 2 times, most recently from 1a5e701 to 8820447 Compare September 13, 2023 16:14

jwir3 requested review from tevanoff and skitterm September 13, 2023 16:16

jwir3 force-pushed the jwir3/cap-1082-wait-for-all-resources-to-load branch 2 times, most recently from fe63176 to 941a61b Compare September 13, 2023 16:34

jwir3 added 3 commits September 13, 2023 11:35

♻️ Rename takeSnapshot to takeArchive.

c2fdfd2

This was previously aliased instead of being actually renamed. This renames both the file in which the function is declared, as well as the function itself.

✅ Add a test for global network timeout.

982fd10

This adds a jest test for when the archiver times out (i.e. the resources requested do not load in the total time allotted), which, by default, is 2s. Refs CAP-1082.

jwir3 force-pushed the jwir3/cap-1082-wait-for-all-resources-to-load branch from 941a61b to 982fd10 Compare September 13, 2023 16:35

tmeasday approved these changes Sep 13, 2023

View reviewed changes

tevanoff approved these changes Sep 13, 2023

View reviewed changes

jwir3 merged commit 3fbbf9a into main Sep 14, 2023
2 checks passed

thafryer added the released label Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wait for all resources to load before indicating Watcher is idle #16

Wait for all resources to load before indicating Watcher is idle #16

jwir3 commented Sep 7, 2023 •

edited by thafryer

linear bot commented Sep 7, 2023

tmeasday left a comment

tmeasday Sep 7, 2023

jwir3 Sep 8, 2023

tmeasday Sep 7, 2023

jwir3 Sep 8, 2023

tmeasday Sep 7, 2023 •

edited

jwir3 Sep 8, 2023

tmeasday Sep 7, 2023

jwir3 commented Sep 8, 2023

codykaup commented Sep 11, 2023

tmeasday commented Sep 11, 2023 •

edited

jwir3 commented Sep 13, 2023 •

edited

tmeasday left a comment

tevanoff left a comment

thafryer commented Sep 14, 2023

		private unfulfilledRequests: Map<string, Promise<any>>;

		private unsettledResolvers: Map<string, any>;

Wait for all resources to load before indicating Watcher is idle #16

Wait for all resources to load before indicating Watcher is idle #16

Conversation

jwir3 commented Sep 7, 2023 • edited by thafryer

What Changed

How to test

Change Type

linear bot commented Sep 7, 2023

tmeasday left a comment

Choose a reason for hiding this comment

tmeasday Sep 7, 2023

Choose a reason for hiding this comment

jwir3 Sep 8, 2023

Choose a reason for hiding this comment

tmeasday Sep 7, 2023

Choose a reason for hiding this comment

jwir3 Sep 8, 2023

Choose a reason for hiding this comment

tmeasday Sep 7, 2023 • edited

Choose a reason for hiding this comment

jwir3 Sep 8, 2023

Choose a reason for hiding this comment

tmeasday Sep 7, 2023

Choose a reason for hiding this comment

jwir3 commented Sep 8, 2023

codykaup commented Sep 11, 2023

tmeasday commented Sep 11, 2023 • edited

jwir3 commented Sep 13, 2023 • edited

tmeasday left a comment

Choose a reason for hiding this comment

tevanoff left a comment

Choose a reason for hiding this comment

thafryer commented Sep 14, 2023

jwir3 commented Sep 7, 2023 •

edited by thafryer

tmeasday Sep 7, 2023 •

edited

tmeasday commented Sep 11, 2023 •

edited

jwir3 commented Sep 13, 2023 •

edited