dataflow: add dataflow rendering to EXPLAIN plans #7301

maddyblue · 2021-07-03T20:59:36Z

If there's a view with an index on a column, a point select on that column will be significantly faster. But an EXPLAIN will not surface the use of the index. We should add this information to EXPLAIN. In this case looking at the rendered dataflow will show index instead of scan. Users will be able to self-serve improve their queries better with this information.

Update: 2022/02/08

We added a rudimentary version of this in #8515. We were holding off documenting it, because it's very verbose and we expected to follow-up sooner. We didn't follow-up, it's still verbose, but came in handy a few times already, so I'm proposing the following sub-tasks:

Rename explain physical plan to explain dataflow plan
Add docs under docs/sql/explain following the existing structure (sections "Reading ...", "Operators in", and an extra warning that it's experimental)

There were various discussions in the pats to take a holistic look at explain output, but we didn't find a proper owner (cc @JLDLaughlin @andrioni). I think it's fine to do the above minimal things even if we expect things to change in the future.

The text was updated successfully, but these errors were encountered:

uce · 2021-08-05T13:22:18Z

This is mostly ready on the dataflow side but requires adding the SQL commands and pretty printing (we probably need to discuss what we actually want in there).

uce · 2021-11-03T14:41:00Z

The initial work has been done, but we'll keep this open to scope out the follow-up tasks.

aalexandrov · 2021-11-04T16:45:54Z

@uce: can you put labels to the areas where the outstanding tasks fall? I've tentatively added A-optimization here but I'm not sure whether there is something more to be done there.

aalexandrov · 2022-10-04T17:36:26Z

A variant of this was done in #13137.

maddyblue added the C-feature Category: new feature or request label Jul 3, 2021

uce added this to Needs Triage in Compute Aug 5, 2021

uce moved this from Needs Triage to Icebox in Compute Aug 5, 2021

uce added this to the 1.0 milestone Aug 5, 2021

uce mentioned this issue Aug 24, 2021

[dataflow] Choose most expressions as key for Get #8013

Merged

uce moved this from Icebox to To do in Compute Sep 10, 2021

uce assigned asenac Sep 10, 2021

asenac mentioned this issue Oct 4, 2021

Explain physical plan #8515

Merged

2 tasks

aalexandrov added the A-optimization Area: query optimization and transformation label Nov 4, 2021

asenac removed their assignment Nov 19, 2021

uce mentioned this issue Nov 29, 2021

Have EXPLAIN distinguish rendering difference between temporal and non-temporal filters #6201

Open

heeringa removed this from the 1.0 milestone Mar 29, 2022

aalexandrov mentioned this issue Jun 17, 2022

[Epic] Add more user-relevant information to explain plans #13138

Closed

12 tasks

aalexandrov mentioned this issue Sep 2, 2022

Cleanup tasks after the new EXPLAIN code is complete #13299

Closed

1 task

antiguru removed this from To do in Compute Sep 15, 2022

aalexandrov closed this as completed Oct 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataflow: add dataflow rendering to EXPLAIN plans #7301

dataflow: add dataflow rendering to EXPLAIN plans #7301

maddyblue commented Jul 3, 2021 •

edited by uce

uce commented Aug 5, 2021

uce commented Nov 3, 2021

aalexandrov commented Nov 4, 2021

aalexandrov commented Oct 4, 2022

dataflow: add dataflow rendering to EXPLAIN plans #7301

dataflow: add dataflow rendering to EXPLAIN plans #7301

Comments

maddyblue commented Jul 3, 2021 • edited by uce

Update: 2022/02/08

uce commented Aug 5, 2021

uce commented Nov 3, 2021

aalexandrov commented Nov 4, 2021

aalexandrov commented Oct 4, 2022

maddyblue commented Jul 3, 2021 •

edited by uce