[multistage] add maxRowsInJoin, maxRowsInWindow, numGroups to query response#17784
[multistage] add maxRowsInJoin, maxRowsInWindow, numGroups to query response#17784dang-stripe wants to merge 1 commit intoapache:masterfrom
Conversation
Codecov Report❌ Patch coverage is Additional details and impacted files@@ Coverage Diff @@
## master #17784 +/- ##
============================================
+ Coverage 63.17% 63.22% +0.04%
Complexity 1454 1454
============================================
Files 3176 3183 +7
Lines 191025 191459 +434
Branches 29206 29273 +67
============================================
+ Hits 120688 121044 +356
- Misses 60920 60979 +59
- Partials 9417 9436 +19
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
I would like to look for other approaches instead of this one. I have the feeling we are bloating up the response with too many attributes that are either circumstancial or derivable from the query stats. I understand this means to move the processing to the client side, which may not be the best, but we can have other alternatives like some optional response decorator that takes tle stageStats and create new attributes based on that. With a system like this you can create your own decorators without increasing the already too large metadata response |
|
@gortiz replicating the logic for how these limits are enforced on the client seems like a significant burden. i believe stage stats is not a stable API yet either meaning this can break across releases. if that is the expectation, ideally there should be a canonical library for processing stage stats. i feel that the current approach supplements the existing fields in the query response ( |
summary
this addresses #17565
maxRowsInJoin,maxRowsInWindow,numGroupsto the query response so we can monitor how close queries are to their respective limits before queries start failing (throw overflow mode) or returning partial results (break overflow mode)numGroupsfor single stage.cc @Jackie-Jiang @gortiz @suvodeep-pyne
testing
exchange_ratesis a dim table.