Add support for inline stacks #3556

mstange · 2021-09-16T02:04:47Z

Firefox profile: Without inline stacks / With inline stacks
Rust profile: Without inline stacks / With inline stacks

To try this out, you'll need to use profiler-symbol-server at the moment - inline stacks don't work yet with the Mozilla symbolication API or when profiling local builds. Here's how you can try it out:

Make sure profiler-symbol-server is installed, or run cargo install profiler-symbol-server.
In Firefox, start the profiler, and record for a bit.
Open the browser console and execute await Services.profiler.dumpProfileToFileAsync("/Users/mstange/Desktop/gfx-profile.json") (with an appropriate path)
Now you have a raw, unsymbolicated gecko profile.
To see what happens with today's profiler.firefox.com, run profiler-symbol-server path/to/profile.json. Then stop profiler-symbol-server with Ctrl+C.
To see what happens with the deploy proview, run PROFILER_URL="https://deploy-preview-3556--perf-html.netlify.app" profiler-symbol-server path/to/profile.json.

Overview

This PR consists of three big pieces:

The profile format change which lets us represent inline frames in the profile.
The symbolication change to accept a new inlines array per address from the symbolication API and to take this information into account during symbolication.
A visual annotation in the call tree, to differentiate inlined function calls from non-inlined function calls.

The format

The following is true today, before this PR:

A single code address (e.g. 0x1234) maps to a single function.
For native code, for each library, every address corresponds to a single frame in the frameTable.
Multiple frames from the same function share the same func, but they'll all have different addresses. A frame also has a line number - the line in the function that generated the instruction at the frame's address.

But with inline stacks, a single address can now map to more than one function, and more than one line number. Should we still have a single frame per address? If we want to keep a single frame per address, where would we store the list of functions?

Alternative 1 (unimplemented): Keep one frame per address, add side-table for inline information

This is not the format I chose to implement. I'm describing it here anyway because it's an alternative I considered at one point during development, and I think it's valuable to record for posterity.

We could replace the frameTable's func column with an inlineStackInfo column, which would have an index into a new inlineStackInfoTable. This new table would have the following fields:

parentInlineStackInfo
func
line

This would mean that symbolication just needs to create the inlineStackInfoTable and can keep the frameTable mostly unchanged, just updating each frame's inlineStackInfo field. (Today it just updates the frame's func field.) And it could leave the stackTable unchanged, too. And then the actual expansion into multiple funcs would happen whenever we convert from a "stack" to a "call node".

One place where this expansion would need to happen would be when we create the CallNodeTable for the call tree. At the moment, we create the call tree by collapsing sibling "stack" nodes into the same "call" node, if those "stack" nodes share the same "func". With this extra table, creating the call tree would become a fair bit more complicated: Each frame would need to be expanded into its set of funcs with the help of the inlineStackInfoTable, and then call nodes would need to be created based on those expanded funcs.

Furthermore, all of our call tree transforms would need to be updated: For example, some of them have "call paths", but they operate on the stackTable, and matching the combination of stackTable + inlineStackInfoTable against a call path might be complicated. Furthermore, we have some transforms that change the tree structure of the stack table, for example the "Merge node" transform. This transform would now need to be able to operate both on the stack table and on the inlineStackInfoTable. This seems hard.

In general, we probably have a ton of places that make use of the frameTable's func column; all of them would need to be updated. This might be doable but would be rather invasive.

I'm also not sure if it would be a good format. The tree structure of the call tree is now in two places: In the stack table and the inlineStackInfoTable. It might be harder to understand during debugging.

I was too afraid to do this, so I chose a simpler implementation.

Alternative 2 (this is the one I chose): Have multiple frames per address, add an `inlineDepth` column to the `frameTable`

This is the approach I chose. Rather than insisting on only having one frame per address, I am now creating multiple frames for the same address: One frame for each "inline depth". Frames at the outer level have an inlineDepth of zero.

For example, let's say you have a frame with address 0x123. We look up 0x123 with the symbolication API, and get the following result:

{
  "inlines": [
    {
      "function": "profiler_thread_is_being_profiled_for_markers",
      "file": "ProfilerMarkers.h"
      "line": 129
    },
    {
      "function": "profiler_add_marker",
      "file": "ProfilerMarkers.h"
      "line": 149
    }
  ],
  "function": "BackgroundHangThread::NotifyActivity",
  "file": "toolkit/components/backgroundhangmonitor/BackgroundHangMonitor.cpp",
  "line": 255
}

There will be three funcs: "BackgroundHangThread::NotifyActivity @ toolkit/components/backgroundhangmonitor/BackgroundHangMonitor.cpp", "profiler_add_marker @ ProfilerMarkers.h", and "profiler_thread_is_being_profiled_for_markers @ ProfilerMarkers.h".
Now we will create three frames:

address: 0x123, inlineDepth: 0, line: 255, func: <"BackgroundHangThread::NotifyActivity">
address: 0x123, inlineDepth: 1, line: 149, func: <"profiler_add_marker">
address: 0x123, inlineDepth: 2, line: 129, func: <"profiler_thread_is_being_profiled_for_markers">

And the stackTable gets new stack nodes to connect these new frames up in the right ways.

And the samples table is updated to point to the "inner" stack nodes. Same for any other references to stacks elsewhere in the thread.

Comparison

Compared to the first alternative, this has the following advantages:

The format is much simpler.
A ton of existing code can stay the way it is, because the stack tree and the call node tree keep the same structure.

These advantages come at the expense of the simplicity of the symbolication code:

Symbolication is more complicated. Symbolication now needs to create new frames and change the tree structure in the stack table. It also needs to replace any stack references in the rest of the thread to be valid with the updated stack table.

I think this is the right trade-off.

Other challenges

There are a few other things that I needed to get right.

One of them is re-symbolication. Re-symbolication is a rare scenario, but it's worth getting right anyway. In this scenario, we want to throw out all the old information because it might have bad symbols and bad inlining data, and start from scratch based on the frame addresses. To do this, I'm throwing out all frames with inlineDepth > 0 at the start of symbolication, and I'm creating a "flattened" stack table that has the inline frames removed. This isn't very pretty, and it's necessary because symbolication now permanently mutates the structure of the stack table (but in a somewhat straightforward and non-destructive way).

Another challenge was about dealing with call paths in the redux state. The selected node, and the right-clicked node, and the expanded nodes are all stored as call paths, i.e. arrays of func indexes from the root. Once symbolication completes, some of the nodes in the call tree might be replaced by multiple functions. This means that the stored call paths need to be modified to take this into account. To avoid any call tree nodes from closing, we need to add some of the new nodes to the set of expanded nodes, if any of their descendant nodes are expanded.

All right, that's all I can think of for now.

Happy reviewing! Please review each commit individually.

codecov · 2021-09-16T02:08:41Z

Codecov Report

Merging #3556 (8607bb5) into main (aba7879) will decrease coverage by 0.12%.
The diff coverage is 79.18%.

❗ Current head 8607bb5 differs from pull request most recent head 02e1fad. Consider uploading reports for the commit 02e1fad to get more accurate results

@@            Coverage Diff             @@
##             main    #3556      +/-   ##
==========================================
- Coverage   88.81%   88.68%   -0.13%     
==========================================
  Files         261      259       -2     
  Lines       21745    21905     +160     
  Branches     5568     5598      +30     
==========================================
+ Hits        19313    19427     +114     
- Misses       2254     2294      +40     
- Partials      178      184       +6

Impacted Files	Coverage Δ
src/profile-logic/data-structures.js	`95.45% <ø> (ø)`
src/profile-logic/mozilla-symbolication-api.js	`21.68% <0.00%> (-4.41%)`	⬇️
src/profile-logic/process-profile.js	`91.22% <ø> (ø)`
src/profile-logic/symbol-store.js	`93.18% <ø> (ø)`
src/test/fixtures/profiles/call-nodes.js	`100.00% <ø> (ø)`
src/components/shared/TreeView.js	`77.63% <50.00%> (-0.24%)`	⬇️
src/profile-logic/profile-data.js	`90.48% <69.23%> (-0.08%)`	⬇️
src/reducers/profile-view.js	`95.08% <75.00%> (+0.18%)`	⬆️
src/profile-logic/symbolication.js	`87.27% <85.03%> (-3.79%)`	⬇️
src/profile-logic/call-tree.js	`93.30% <90.90%> (-0.12%)`	⬇️
... and 36 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update aba7879...02e1fad. Read the comment docs.

locales/en-US/app.ftl

canova

Wow, that's some solid work! Thanks for the PR, I like the new inline badge and it's great to have inline stacks!
I added some comments but I don't think there is any major blockers, they are mostly nits and some small things.

Also sorry about the delay in the reviews, it took me some time to concentrate and finish it.

One of them is re-symbolication. Re-symbolication is a rare scenario, but it's worth getting right anyway. In this scenario, we want to throw out all the old information because it might have bad symbols and bad inlining data, and start from scratch based on the frame addresses. To do this, I'm throwing out all frames with inlineDepth > 0 at the start of symbolication, and I'm creating a "flattened" stack table that has the inline frames removed. This isn't very pretty, and it's necessary because symbolication now permanently mutates the structure of the stack table (but in a somewhat straightforward and non-destructive way).

It's unclear to me how symbols server could return bad inlining data. Do you have anything specific in your mind or is this to mostly make sure (since it's new addition)?

src/types/profile.js

src/profile-logic/mozilla-symbolication-api.js

src/actions/receive-profile.js

src/reducers/profile-view.js

locales/en-US/app.ftl

res/img/svg/inlined-icon.svg

src/components/shared/TreeView.js

src/profile-logic/call-tree.js

src/test/store/symbolication.test.js

mstange · 2021-11-30T00:36:17Z

It's unclear to me how symbols server could return bad inlining data. Do you have anything specific in your mind or is this to mostly make sure (since it's new addition)?

"Bugs" :)

For example, there could be a bug in profiler-get-symbols where it returns incorrect filenames for some inlined frames. And then we fix the bug and want to see the same profile again with correct filenames, so we re-symbolicate.

Or we could have correct inlining data for most functions, but we might be missing function names for some addresses. Then we plug that hole so that we get function names for those addresses. If we want to re-symbolicate to pick up those new function names, we always re-symbolicate all addresses of that library, so the functions with inline information will be re-symbolicated, too. In that case we don't want to end up with duplicated inline frames.

mstange · 2021-11-30T00:37:35Z

Thanks for the review!

canova · 2021-11-30T11:58:31Z

"Bugs" :)

For example, there could be a bug in profiler-get-symbols where it returns incorrect filenames for some inlined frames. And then we fix the bug and want to see the same profile again with correct filenames, so we re-symbolicate.

Or we could have correct inlining data for most functions, but we might be missing function names for some addresses. Then we plug that hole so that we get function names for those addresses. If we want to re-symbolicate to pick up those new function names, we always re-symbolicate all addresses of that library, so the functions with inline information will be re-symbolicated, too. In that case we don't want to end up with duplicated inline frames.

I see, thanks :)

This column will be needed for inline call stack symbolication.

…ny call paths that are stored in the redux state.

…olication.

This patch is the last symbolication-related change in this series. With this patch, inline stacks are now fully supported.

This answers the question "Why wasn't this inlined!?" which some people who are used to the old inline-less call tree might have.

mstange self-assigned this Sep 16, 2021

mstange force-pushed the inline-stacks branch from 132159d to ee6a31a Compare October 6, 2021 16:21

mstange force-pushed the inline-stacks branch 5 times, most recently from 8ecea2d to eb6c2d4 Compare November 23, 2021 03:12

mstange requested a review from canova November 23, 2021 03:12

mstange marked this pull request as ready for review November 23, 2021 03:12

mstange requested a review from a team as a code owner November 23, 2021 03:12

flodolo reviewed Nov 23, 2021

View reviewed changes

locales/en-US/app.ftl Outdated Show resolved Hide resolved

mstange force-pushed the inline-stacks branch from eb6c2d4 to 1cab786 Compare November 23, 2021 15:50

mstange requested a review from flodolo November 23, 2021 15:52

flodolo approved these changes Nov 23, 2021

View reviewed changes

mstange force-pushed the inline-stacks branch 3 times, most recently from 456fcea to 8607bb5 Compare November 23, 2021 19:25

julienw mentioned this pull request Nov 26, 2021

Streamline from-browser initialization #3649

Merged

canova approved these changes Nov 29, 2021

View reviewed changes

mstange added 7 commits November 30, 2021 16:30

Add an inlineDepth column to the frame table.

d8b77c2

This column will be needed for inline call stack symbolication.

Add inlines field to response JSON definition.

c18e99d

Propagate inline information to symbolication.

b4fe808

Allow symbolication to replace a single func with multiple funcs in a…

09873e7

…ny call paths that are stored in the redux state.

Remove any existing inline frame stack nodes at the beginning of symb…

f3d7100

…olication.

Create the new frameTable earlier.

c902564

Resolve inline frames during symbolication.

adceb90

This patch is the last symbolication-related change in this series. With this patch, inline stacks are now fully supported.

mstange added 3 commits November 30, 2021 16:30

Add a sourceFramesInlinedIntoSymbol column to the CallNodeTable.

5c678c4

Add a badge to the call tree for inlined functions.

f754f68

This answers the question "Why wasn't this inlined!?" which some people who are used to the old inline-less call tree might have.

Add some test for inline symbolication.

02e1fad

mstange force-pushed the inline-stacks branch from 8607bb5 to 02e1fad Compare November 30, 2021 21:31

mstange enabled auto-merge November 30, 2021 21:35

mstange merged commit a7edd3d into firefox-devtools:main Nov 30, 2021

mstange mentioned this pull request Dec 7, 2021

[Deploy preview] Inline callstacks demo #2556

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for inline stacks #3556

Add support for inline stacks #3556

mstange commented Sep 16, 2021 •

edited

codecov bot commented Sep 16, 2021 •

edited

canova left a comment

mstange commented Nov 30, 2021

mstange commented Nov 30, 2021

canova commented Nov 30, 2021

Add support for inline stacks #3556

Add support for inline stacks #3556

Conversation

mstange commented Sep 16, 2021 • edited

Overview

The format

Alternative 1 (unimplemented): Keep one frame per address, add side-table for inline information

Alternative 2 (this is the one I chose): Have multiple frames per address, add an inlineDepth column to the frameTable

Comparison

Other challenges

codecov bot commented Sep 16, 2021 • edited

Codecov Report

canova left a comment

Choose a reason for hiding this comment

mstange commented Nov 30, 2021

mstange commented Nov 30, 2021

canova commented Nov 30, 2021

mstange commented Sep 16, 2021 •

edited

Alternative 2 (this is the one I chose): Have multiple frames per address, add an `inlineDepth` column to the `frameTable`

codecov bot commented Sep 16, 2021 •

edited