New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-28935][SQL][DOCS] Document SQL metrics for Details for Query Plan #25658
Conversation
Test build #110033 has finished for PR 25658 at commit
|
docs/web-ui.md
Outdated
The metrics of SQL operators show in the block of operators. The SQL metrics can be useful when | ||
we want to dive into the execution details of each operator, for example, how many rows are output | ||
after a Filter operator. The related metrics are different for each type of operator, for example | ||
Exchange has the metrics called "shuffle bytes writte total" which shows the number of bytes written |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: written ?
docs/web-ui.md
Outdated
<tr><td> <code>metadata time</code> </td><td> the time spent on getting metadata like number of partitions, number of files </td><td> FileSourceScan </td></tr> | ||
<tr><td> <code>shuffle bytes written</code> </td><td> number of bytes written </td><td> CollectLimit, TakeOrderedAndProject, ShuffleExchange </td></tr> | ||
<tr><td> <code>shuffle records written</code> </td><td> number of records written </td><td> CollectLimit, TakeOrderedAndProject, ShuffleExchange </td></tr> | ||
<tr><td> <code>shuffle write time</code> </td><td> the time on shuffle writing </td><td> CollectLimit, TakeOrderedAndProject, ShuffleExchange </td></tr> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: the time spent on writing shuffle data ?
docs/web-ui.md
Outdated
@@ -363,6 +363,42 @@ number of written shuffle records, total data size, etc. | |||
Clicking the 'Details' link on the bottom displays the logical plans and the physical plan, which | |||
illustrate how Spark parses, analyzes, optimizes and performs the query. | |||
|
|||
### SQL metrics | |||
|
|||
The metrics of SQL operators show in the block of operators. The SQL metrics can be useful when |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@viirya a little confused on "show in the block of operators" ? Is there a way to reword this ?
@viirya Looks good to me. I have some minor comments :-) |
LGTM after a few minor updates! |
thanks for updating! @gatorsmile |
Test build #110263 has finished for PR 25658 at commit
|
Thanks! Merged to master. |
### What changes were proposed in this pull request? This patch adds the description of common SQL metrics in web ui document. ### Why are the changes needed? The current web ui document describes query plan but does not describe the meaning SQL metrics. For end users, they might not understand the meaning of the metrics. ### Does this PR introduce any user-facing change? No. This is just documentation change. ### How was this patch tested? Built the docs locally. ![image](https://user-images.githubusercontent.com/11567269/64463485-1583d800-d0b9-11e9-9916-141f5c09f009.png) Closes apache#25658 from viirya/SPARK-28935. Lead-authored-by: Liang-Chi Hsieh <viirya@gmail.com> Co-authored-by: Xiao Li <gatorsmile@gmail.com> Signed-off-by: Xiao Li <gatorsmile@gmail.com>
What changes were proposed in this pull request?
This patch adds the description of common SQL metrics in web ui document.
Why are the changes needed?
The current web ui document describes query plan but does not describe the meaning SQL metrics. For end users, they might not understand the meaning of the metrics.
Does this PR introduce any user-facing change?
No. This is just documentation change.
How was this patch tested?
Built the docs locally.