ref(replay): refactor replayerStepper calls to be memoized #74606

michellewzhang · 2024-07-19T22:30:22Z

closes https://github.com/getsentry/team-replay/issues/450
this is the branch off from ref(replay): refactor replayerStepper to be called less #74540, the experimental refactor of replayerStepper
TLDR: memoize the replayerStepper calls so that we remember the results and don't have to reload the data every time.

current state of things:
we have distinct replayerStepper calls in the breadcrumbs + memory tabs. in total, we could be calling it 3 times because it's used by
(1) the breadcrumbs tab to render the HTML snippet,
(2) for hydration error text diffs,
(3) and the DOM nodes chart.

the plan:
improve things by calling replayerStepper once (or at least less) and memoize the results. side note: it was hard to combine the hydration error text diff replayerStepper call with the others due to the types, so we had to keep that call separate.

the question:
is it faster to run one replayerStepper instance, and collect all the data at one time (iterate over the frames exactly once)? this means the speed of any tab will always be limited by the slower call, since we're going through ALL the data at once, even some that we might not need yet. like this:

sentry/static/app/utils/replays/replayReader.tsx

Lines 363 to 366 in 70c9054

    
           visitFrameCallbacks: { 
        
             extractDomNodes, 
        
             countDomNodes: countDomNodes(this.getRRWebMutations()), 
        
           },

or is it better to continue to have 2 stepper instances that iterate over 2 different sets of frames (breadcrumbs loops over hundreds of frames, and DOM node count iterates over thousands, which means the speeds could be drastically different). in this situation, each tab would handle their own array of necessary frames.

the experiments:
the first PR (#74540) tried to explore a refactor where replayerStepper is called once for the two DOM node actions (counting, for the memory chart, and extracting, for the breadcrumbs HTML). changes were made to the stepper so it could accept a list of callbacks, and return all types of data in one shot, so we only need one hidden replayer instance on the page for the DOM node functions.

unscientifically, it seemed like loading breadcrumbs + memory took the same amount of time as loading the memory chart. (see video below -- as soon as the breadcrumbs tab is done loading, the memory tab is already done).

Screen.Recording.2024-07-19.at.11.49.24.AM.mov

this makes sense because we’re iterating over thousands of frames for the DOM nodes chart, so that’s the bottleneck. this means the breadcrumbs tab loads slower.

this PR:
compared to that is the approach where we have 1 stepper instance for breadcrumbs, and one for DOM node count, each iterating over their own lists (which is what i'm doing in this PR). this approach showed breadcrumb data on the screen about 2x as fast as the approach above. therefore 2 stepper instances on the screen is better for users, especially since breadcrumbs tab is more popular than the memory tab.

the video below demonstrates the memoization in action. once the breadcrumbs or memory tabs are loaded, switching back to them does not cause re-loading, because the results are cached. notice that the breadcrumbs tab loading is not dependent on the loading of the DOM tab, unlike the video above where the breadcrumbs tab has to "wait" for the memory tab. (i'm also on slower wifi in this clip below)

Screen.Recording.2024-07-19.at.3.28.54.PM.mov

michellewzhang · 2024-07-19T22:35:31Z

static/app/views/replays/detail/memoryPanel/index.tsx

 import DomNodesChart from 'sentry/views/replays/detail/memoryPanel/domNodesChart';
 import MemoryChart from 'sentry/views/replays/detail/memoryPanel/memoryChart';

-function useCountDomNodes({replay}: {replay: null | ReplayReader}) {


moved to its own file, useCountDomNodes.tsx

michellewzhang · 2024-07-19T22:36:12Z

static/app/utils/replays/hooks/useExtractedPageHtml.tsx

renamed to useExtractPageHtml.tsx

michellewzhang · 2024-07-19T22:36:25Z

static/app/utils/replays/hooks/useExtractedDomNodes.tsx

renamed to useExtractDomNodes.tsx

michellewzhang · 2024-07-19T22:37:48Z

static/app/utils/replays/extractHtml.tsx

-export default function extractDomNodes({
-  frames,
-  rrwebEvents,
-  startTimestampMs,
-}: Args): Promise<Map<ReplayFrame, Extraction>> {
-  return replayerStepper({
-    frames,
-    rrwebEvents,
-    startTimestampMs,
-    shouldVisitFrame: frame => {
-      const nodeId = getNodeId(frame);
-      return nodeId !== undefined && nodeId !== -1;
-    },
-    onVisitFrame: (frame, collection, replayer) => {
-      const mirror = replayer.getMirror();
-      const nodeId = getNodeId(frame);
-      const html = extractHtml(nodeId as number, mirror);
-      collection.set(frame as ReplayFrame, {
-        frame,
-        html,
-        timestamp: frame.timestampMs,
-      });
-    },
-  });
-}


moved to replayReader.tsx, so i renamed this file

michellewzhang · 2024-07-19T22:38:14Z

static/app/utils/replays/countDomNodes.tsx

moved to replayReader.tsx, exported type was moved into the corresponding hook

billyvg

What a great PR write-up!

billyvg · 2024-07-23T18:59:12Z

It might be nice to have a loading indicator for the dom node chart

michellewzhang · 2024-07-23T19:11:54Z

It might be nice to have a loading indicator for the dom node chart

@billyvg we had a placeholder before but seems our isFetching check no longer works. pushing a fix now! lmk if you have a strong preference for loading indicator vs placeholder

Screen.Recording.2024-07-23.at.12.12.23.PM.mov

billyvg · 2024-07-23T19:31:31Z

static/app/utils/replays/replayerStepper.tsx

        window.setTimeout(() => {
          const timestamp =
            'offsetMs' in frame ? frame.offsetMs : frame.timestamp - startTimestampMs;
          replayer.pause(timestamp);
        }, 0);


would RAF be better here @ryan953?

we could try it, and see if the memory chart renders the same as before.

also, we could experiment with different granularity in the memory chart.

Here's an answer i found on the differences: https://stackoverflow.com/questions/38709923/why-is-requestanimationframe-better-than-setinterval-or-settimeout

seems to be the same behavior after using RAF so i pushed up the change

billyvg · 2024-07-23T19:32:30Z

It might be nice to have a loading indicator for the dom node chart

@billyvg we had a placeholder before but seems our isFetching check no longer works. pushing a fix now! lmk if you have a strong preference for loading indicator vs placeholder

Screen.Recording.2024-07-23.at.12.12.23.PM.mov

Placeholder looks great!

michellewzhang added 10 commits July 18, 2024 16:03

ref(replay): refactor replayerStepper to be called once

7782d24

♻️ ref

e31f109

♻️ consolidate & standardize hooks

ec15b19

🚚 rename files

f3154c6

♻️ some types

af73b5a

🏷️ types

873e5c9

♻️ ref so it works

38867b2

♻️ clean up

70c9054

ref(replay): refactor replayerStepper calls to be memoized

d4190ce

♻️ types

c0f6811

michellewzhang requested a review from a team as a code owner July 19, 2024 22:30

github-actions bot added the Scope: Frontend Automatically applied to PRs that change frontend components label Jul 19, 2024

michellewzhang mentioned this pull request Jul 19, 2024

ref(replay): refactor replayerStepper to be called less #74540

Closed

michellewzhang requested a review from billyvg July 19, 2024 22:35

michellewzhang commented Jul 19, 2024

View reviewed changes

static/app/utils/replays/hooks/useExtractedPageHtml.tsx

Copy link

Member Author

michellewzhang Jul 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

renamed to useExtractPageHtml.tsx

michellewzhang commented Jul 19, 2024

View reviewed changes

static/app/utils/replays/hooks/useExtractedDomNodes.tsx

Copy link

Member Author

michellewzhang Jul 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

renamed to useExtractDomNodes.tsx

michellewzhang commented Jul 19, 2024

View reviewed changes

ryan953 approved these changes Jul 19, 2024

View reviewed changes

🏷️ TYPO

8f160ef

vercel bot deployed to Preview July 19, 2024 22:42 View deployment

billyvg approved these changes Jul 23, 2024

View reviewed changes

♻️ new isLoading

fb68487

michellewzhang force-pushed the mz/replayer-ref-v2 branch from b382322 to fb68487 Compare July 23, 2024 19:14

vercel bot deployed to Preview July 23, 2024 19:17 View deployment

billyvg reviewed Jul 23, 2024

View reviewed changes

♻️ use RAF

e27a72d

vercel bot deployed to Preview July 23, 2024 20:22 View deployment

michellewzhang merged commit 11e8424 into master Jul 23, 2024

michellewzhang deleted the mz/replayer-ref-v2 branch July 23, 2024 20:39

github-actions bot locked and limited conversation to collaborators Aug 8, 2024

michellewzhang restored the mz/replayer-ref-v2 branch August 22, 2024 17:45

michellewzhang deleted the mz/replayer-ref-v2 branch August 29, 2024 20:44

	visitFrameCallbacks: {
	extractDomNodes,
	countDomNodes: countDomNodes(this.getRRWebMutations()),
	},

Uh oh!

ref(replay): refactor replayerStepper calls to be memoized #74606

ref(replay): refactor replayerStepper calls to be memoized #74606

Uh oh!

Conversation

michellewzhang commented Jul 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michellewzhang Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

michellewzhang Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

michellewzhang Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

michellewzhang Jul 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

michellewzhang Jul 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

billyvg left a comment

Choose a reason for hiding this comment

Uh oh!

billyvg commented Jul 23, 2024

Uh oh!

michellewzhang commented Jul 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

billyvg Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

ryan953 Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

ryan953 Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

michellewzhang Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

billyvg commented Jul 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

michellewzhang commented Jul 19, 2024 •

edited

Loading

michellewzhang Jul 19, 2024 •

edited

Loading

michellewzhang Jul 19, 2024 •

edited

Loading

michellewzhang commented Jul 23, 2024 •

edited

Loading