Improve Callback Graph #1179

jjaraalm · 2020-04-04T16:52:08Z

Attempt to improve and significantly expand the features available in the callback graph to improve its usefulness in debugging and testing. Will try to cover the ideas given in #1176. Comments/suggestions would be welcome, especially for styling.

Current Status

Changes

Replaced viz.js with cytoscape.js via react-cytoscape. This results in some minor visual and layout changes.
CallbackGraphContainer now subscribes to layout and paths so it can introspect the current state.
Added new reducers to compute profiling information and report all changed props (not just those that are inputs).

Issues

State layout is finicky. dagre was giving horrible results even when cycles were pruned away. Using breadthfirst for now, but layout properties may need to be tailored to viewport size instead of being fixed.
Outputs that return no_update are still being flashed during execution highlighting. No way to detect them right now.

Contributor Checklist

optionals

I have added entry in the CHANGELOG.md
If this PR needs a follow-up in dash docs, community thread, I have mentioned the relevant URLS as follow
- this github #PR number updates the dash docs
- here is the show and tell thread in plotly dash community

chriddyp · 2020-04-04T18:19:03Z

Love where this is going @jjaraalm !

nicolaskruchten · 2020-04-05T13:04:27Z

Amazing work! I’m happy to see GraphVis go in favour of a more modern alternative that enables rich interactions :) thanks for doing this!

jjaraalm · 2020-04-05T18:05:45Z

@chriddyp @nicolaskruchten Thanks! It's a fun project, and I think it could be useful too!

chriddyp · 2020-04-06T03:21:06Z

That profiling screenshot is so good!

It would be cool if we could time the callback execution on the backend and pass that time delta up with the _update-component API call. Then, we could distinguish between "total time", "time processing backend", "time over network", "time to render".Then, folks could see if simple callbacks on the backend are causing a delay because they're sending a tremendous amount of data over the wire (which would be a great candidate for a clientside callback!) vs a complex backend task that sends a simple payload. I think we'd have to expand the _update-component API response to include extra meta information beyond just the component props to do this.

jjaraalm · 2020-04-09T13:04:56Z

Agreed, a better breakdown of the various timings would be helpful. Especially since right now, I'm only looking at the delay between request and response. Even simple callbacks (repeatedly clicking a button) can look expensive if they happen often and the queue gets backed up. This shouldn't be too hard to do and I'll look into it. Glad you all are interested!

chriddyp · 2020-04-09T23:48:05Z

Awesome stuff @jjaraalm ! Feel free to ping me if you have any questions 👍

chriddyp · 2020-04-09T23:48:51Z

(FYI I am assigning myself just to denote to the rest of our team that I'll be your ambassador for this pull request)

jjaraalm · 2020-04-11T16:48:35Z

@chriddyp sounds good.

I've added support for reporting server-side timings using Server-Timing headers. This way (in modern browsers) we get timing reported both in dev-tools on a per-request basis and through introspection as an aggregate total.

Users can tag and report timings for resources using a new API like:

@app.callback(Output('slow-store', 'data'), [Input('style-output', 'style'), Input('out', 'children')])
def combine_data(style, value):

    time.sleep(0.1)
    dash.callback_context.record_timing('task_1', 0.1, 'The first task.')

    time.sleep(0.7)
    dash.callback_context.record_timing('task_2', 0.7)

    time.sleep(0.2)
    dash.callback_context.record_timing('task_3', 0.2, 'Cleanup task.')
    
    return dict(style=style, value=value)

which translates into the HTTP headers:

Server-Timing: dash_total;dur=1002
Server-Timing: task_1;desc="The first task.";dur=100
Server-Timing: task_2;dur=700
Server-Timing: task_3;desc="Cleanup task.";dur=200

The dash_total resource is automatically computed on all requests.

Distinguishing between "queue time" and "network time" seems difficult and I'm not quite sure how to do it. The difference between totalTime - totalComputeTime should be the combination of both though. I'll take a look into rendering time next.

chriddyp · 2020-04-14T20:50:57Z

I've added support for reporting server-side timings using Server-Timing headers.

Brilliant idea! I was not aware of these headers before now, this is very nice.

Re dash_total: this is very nice! Let's make this configurable via an argument like dev_tools_clientside_profiling so that it is only included if debug=True and then turned off when deployed with e.g. gunicorn. This is for security reasons as it's unadvised to share any information about the backend to the front-end in production.

Distinguishing between "queue time" and "network time" seems difficult and I'm not quite sure how to do it. The difference between totalTime - totalComputeTime should be the combination of both though.

Yeah good point, this is a tricky one to record but also to communicate to our users.

I suppose that there are two queues:

Browser queue: The browser will only send around 6 (https://stackoverflow.com/questions/561046/) requests to the server at a time, so if a user has a Dash app with 60 charts, 10 batches of requests will be made.
The server itself will queue things up, whether that's being done with flask or gunicorn or a load balancer.

So, when we display totalTime - totalComputeTime to our users, I suppose we label this like "Network & Request Queue Time".

In some cases, the queuing time will dominate the time. In most cases, I think the payload size itself will dominate. So, another metric we could display could be the size of the request. We can get this from the content-length header. It would be nice to display this in shorthand like kb or mb

I'll take a look into rendering time next.

Thinking a bit about this now, this one actually might be pretty tricky. I think the main components that a user might care about is the graph component as will handle the most amount of data on the page. We do asynchronous rendering here, so even if we knew when React "finished" rendering the component, we wouldn't be able to capture the actual rendering time.

This might be a rabbit hole, so feel free to stay out of it for now, I think the other performance breakdowns will be the most valuable for our users!

Playing around with this locally, another thing that comes to mind is conditionally re-rendering the callback tree based off of which components are visible on the page. So, if you have a large multi-page application, you would only see the callbacks that are associated with outputs or inputs that are currently rendered on the page. This data is available in the paths object in the store.

We merged & released some deeper changes to the dash-renderer code in #1103. It looks like this introduced some conflicts in this PR, sorry about that!

This is looking really great otherwise and I know the community will love it. Are you interested in continuing to add new features to this or would you be looking to try to merge this in soon and follow up with more features? Either way is fine!

jjaraalm · 2020-04-16T17:04:15Z

@chriddyp I'm in no rush to get this merged in. I'd love to add more features, but just need to find time to work on it at some point. I didn't realize that Graphs were async! Good point, I was just going to try and monitor to see when react reported everything complete. It would be nice to know, but sounds like we should postpone it for now.

Re: #1103, I've been waiting on that so I'm more than happy to accommodate. Interestingly, I noticed that #1103 (or some other change in dev) messes with the viz.js callback graph and makes it very unstable for me. Looks like it's being continually rebuilt and there's some non-deterministic code (probably in viz.js) that jiggles the layout. Not sure if anyone's opened a bug report on it.

alexcjohnson · 2020-04-16T21:50:47Z

Yep, that was me in #1103 mucking with the existing callback graph implementation - had to tweak it a bit in order to not totally choke on the dict IDs. Sorry about the merge conflicts! Feel free to file a bug report about the instability you're seeing, but my personal feeling is what you have here is already such an improvement that we should work to get it merged without adding any more features than it already has; then we can add all these other ideas in future PRs.

chriddyp · 2020-04-17T05:21:02Z

but my personal feeling is what you have here is already such an improvement that we should work to get it merged without adding any more features than it already has; then we can add all these other ideas in future PRs.

I agree! The work done in this PR already improves things 10x, I would love to see the community start to use this new version 👍

nicolaskruchten · 2020-04-17T13:35:57Z

I'd love to see a screenshot of what this looks like when multiple props are involved in a single component, similar to the cities-dropdown here: https://github.com/nicolaskruchten/dash_callback_chain/blob/master/example.png

rpkyle · 2020-08-15T16:37:51Z

@chriddyp sounds good.

I've added support for reporting server-side timings using Server-Timing headers. This way (in modern browsers) we get timing reported both in dev-tools on a per-request basis and through introspection as an aggregate total.
Server-Timing: dash_total;dur=1002
Server-Timing: task_1;desc="The first task.";dur=100
Server-Timing: task_2;dur=700
Server-Timing: task_3;desc="Cleanup task.";dur=200
The dash_total resource is automatically computed on all requests.

@jjaraalm Thanks for all your efforts to improve the callback graph implementation -- the enhanced context provided by this PR will be very helpful for debugging purposes.

One brief question about dash_total; I see it in Server-Timing above, but I'm unsure where it's added to the headers. Will this happen elsewhere, or might we want to include something like the following just before for name, info ...?

response.headers.add("Server-Timing", 'dash_total;dur={}'.format(dash_total["dur"]))

alexcjohnson · 2020-08-15T18:53:35Z

I see it in Server-Timing above, but I'm unsure where it's added to the headers.

@rpkyle the current implementation actually has this called __dash_server - it's created here, finalized here, and processed along with the custom times here.

Marc-Andre-Rivet

Seems good except for the linting failure 💃

alexcjohnson · 2020-08-30T03:45:23Z

dash/testing/browser.py

+                    c.parentElement.removeChild(c);
+                });
+            """
+            )


New feature for dash.testing - percy_snapshot adds a kwarg convert_canvases that turns every <canvas> into an <img> with matching contents, then puts the canvases back afterward. This lets us capture Cytoscape, among other things (cc @xhlulu)

Sizing isn't quite as expected, but I assume that's just Percy rendering as a different screen size from what's used in circleci.

alexcjohnson · 2020-08-30T13:47:43Z

dash-renderer/src/components/error/CallbackGraph/CallbackGraphContainer.react.js

+        name: 'cose',
+        padding: 10,
+        animate: false
+    }


@nicolaskruchten I couldn't come up with a single layout that always seemed better than the others, so made a dropdown to choose among three. Another side benefit of this is if you drag nodes around to tweak the layout, and you want to get back to the original, you can change to a different algo and then back to the original.

alexcjohnson · 2020-08-30T13:51:38Z

tests/integration/devtools/test_callback_timing.py

+from dash.dependencies import Output, Input
+
+
+def test_dvct001_callback_timing(dash_thread_server):


@rpkyle this test should cover parity between Python and R for the Server-Timing header, including record_timing.

Jonathan Jara-Almonte added 2 commits April 3, 2020 20:41

Move callback graph to cytoscape and add clientside coloring

6d9cdfd

Update styling and add live component and callback introspection

18539a2

jjaraalm requested review from alexcjohnson and Marc-Andre-Rivet as code owners April 4, 2020 16:52

Jonathan Jara-Almonte added 2 commits April 4, 2020 13:22

Add State to the callback graph with dashed lines

45c3b98

Remove redundant value fields in introspection data

5d99381

Change to BFS layout for better State support

42c7664

Jonathan Jara-Almonte added 3 commits April 5, 2020 14:01

Add store and reducers for callback profiling and change notifications

d4652ca

Connect CallbackGraph to profiling and change notifications

5a4b996

Add basic profiling and execution highlighting

221d4ba

chriddyp self-requested a review April 9, 2020 23:47

chriddyp self-assigned this Apr 9, 2020

Jonathan Jara-Almonte added 3 commits April 10, 2020 19:58

Cleanup effects and highlight the sub-callback graph on select.

a89994b

Add support for Server-Timing headers and resource timing API

1fbd48c

Fix error when callback selected before first run.

147d3be

Switch to react-json-tree because of bugs

2e2215c

Update JSON styling

6390f01

Merge branch 'dev' into callback_graph

fe759e8

rpkyle self-requested a review August 15, 2020 16:27

rpkyle mentioned this pull request Aug 15, 2020

Add support for callback graph improvements and timing plotly/dashR#224

Merged

1 task

black

f6ac62b

Marc-Andre-Rivet approved these changes Aug 18, 2020

View reviewed changes

alexcjohnson added 5 commits August 19, 2020 19:58

fiddling with cb graph layout

ef7b60c

Merge branch 'dev' into callback_graph

75160ce

user-selectable callback graph layout algos

65e14c5

callback graph tests

6ee83d5

Merge branch 'dev' into callback_graph

18035a7

alexcjohnson reviewed Aug 30, 2020

View reviewed changes

py2 test fix?

4b10e65

alexcjohnson reviewed Aug 30, 2020

View reviewed changes

alexcjohnson and others added 8 commits September 2, 2020 23:12

Merge branch 'dev' into callback_graph

02bbe5f

fix changelog

fd28d05

Update CHANGELOG.md

ec345d3

fix bad merge on requestedCallbacks.ts

8af36d5

trigger build

fdc662e

update dash-test-components lock file

a710e5f

trigger build

607773d

trigger build

c065fd2

Marc-Andre-Rivet merged commit 5e717f1 into plotly:dev Sep 3, 2020

Marc-Andre-Rivet added feature size: 2 labels Sep 14, 2020

Marc-Andre-Rivet added this to the OSS milestone Sep 14, 2020

almarklein mentioned this pull request Nov 4, 2020

Reduce noise in the callback graph plotly/dash-slicer#9

Closed

alexcjohnson mentioned this pull request Dec 8, 2020

Callback map sharing #789

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Callback Graph #1179

Improve Callback Graph #1179

jjaraalm commented Apr 4, 2020 •

edited by alexcjohnson

Loading

chriddyp commented Apr 4, 2020

nicolaskruchten commented Apr 5, 2020

jjaraalm commented Apr 5, 2020

chriddyp commented Apr 6, 2020

jjaraalm commented Apr 9, 2020

chriddyp commented Apr 9, 2020

chriddyp commented Apr 9, 2020

jjaraalm commented Apr 11, 2020

chriddyp commented Apr 14, 2020

jjaraalm commented Apr 16, 2020

alexcjohnson commented Apr 16, 2020

chriddyp commented Apr 17, 2020

nicolaskruchten commented Apr 17, 2020

rpkyle commented Aug 15, 2020

alexcjohnson commented Aug 15, 2020

Marc-Andre-Rivet left a comment

alexcjohnson Aug 30, 2020

alexcjohnson Aug 30, 2020

alexcjohnson Aug 30, 2020

		from dash.dependencies import Output, Input


		def test_dvct001_callback_timing(dash_thread_server):

Improve Callback Graph #1179

Improve Callback Graph #1179

Conversation

jjaraalm commented Apr 4, 2020 • edited by alexcjohnson Loading

Current Status

Changes

Issues

Contributor Checklist

optionals

chriddyp commented Apr 4, 2020

nicolaskruchten commented Apr 5, 2020

jjaraalm commented Apr 5, 2020

chriddyp commented Apr 6, 2020

jjaraalm commented Apr 9, 2020

chriddyp commented Apr 9, 2020

chriddyp commented Apr 9, 2020

jjaraalm commented Apr 11, 2020

chriddyp commented Apr 14, 2020

jjaraalm commented Apr 16, 2020

alexcjohnson commented Apr 16, 2020

chriddyp commented Apr 17, 2020

nicolaskruchten commented Apr 17, 2020

rpkyle commented Aug 15, 2020

alexcjohnson commented Aug 15, 2020

Marc-Andre-Rivet left a comment

Choose a reason for hiding this comment

alexcjohnson Aug 30, 2020

Choose a reason for hiding this comment

alexcjohnson Aug 30, 2020

Choose a reason for hiding this comment

alexcjohnson Aug 30, 2020

Choose a reason for hiding this comment

jjaraalm commented Apr 4, 2020 •

edited by alexcjohnson

Loading