feat(MEP): Implement the run_query method by wmak · Pull Request #31652 · getsentry/sentry

wmak · 2022-02-08T00:32:41Z

This checks the orderby for each entity, and orders the queries so the entity with the most orderbys are first.
- If more than one orderby contains functions throw an error since the query is now impossible
This then uses the groupby to group the results of the up to 3 queries back together
This also implements the count_unique function so that we can have a test across 2 entities
Part 1 here

- This adds a MetricsQueryBuilder, which works very similarily to our QueryBuilder, but with specific handlers for how metrics construct queries - This MetricsQueryBuilder does not yet construct snql queries, and will not because table queries will require multiple queries to construct similar table data - that is, if we want [transaction, p95, count_unique(user)], we need a query against distributions with [transaction, p95] followed by a second query for [transaction, count_unique(user)] against the sets table - This is so we can maintain a sortby

- This checks the orderby for each entity, and orders the queries so the entity with the most orderbys are first. - If more than one orderby contains functions throw an error since the query is now impossible - This then uses the groupby to group the results of the up to 3 queries back together

wmak · 2022-02-10T19:02:51Z

Moving to draft, need to rework the conditions for 2nd+ queries

- This also adds better tests since the previous ones were not sufficient

k-fish

This is fine, but some of the code should be split into functions for clarity either now or in the future, and it could use more tests. Should also get SnS review for how the query is being built if they have a chance.

evanh

This idea makes sense. Couple questions. Also I don't know if you planned to add this now or were going to do it later, but I would think about how to instrument this, particularly the results aggregation.

evanh · 2022-02-14T20:57:33Z

+                                ],
+                            ),
+                            Op.IN,
+                            Function("tuple", groupby_values),


It seems like groupby_values is unbounded. Is there a limit to how big this list can be?

Good point, We probably don't want to allow doing the 10k limit on these table queries. I'll enforce a 51 limit on this builder for now since we only plan to use this for the performance table 👍

Also keep in mind there is a maximum size of the actual query you can build. So if you have 100k conditions, the SQL string will be too large for Clickhouse to process.

evanh · 2022-02-14T21:04:50Z

+                        groupby_values.append(groupby_key)
+                    value_map[value_map_key].update(row)
+                result["meta"] += current_result["meta"]
+        result["data"] = list(value_map.values())


Something I thought of: metrics makes no guarantees that there will be data for every aggregation for every group by. This is a contrived example, but there could be a p95 for transaction x but not unique users. Is your code supposed to handle that case?

yes 👍 I'll add an explicit test for this, but what should happen is that we'll get p95 for x, and nothing for count_unique.

Ah, very good point, i just realized for this scenario if we sort by unique users we'll get an unexpected result (x just won't show up at all), and I don't think there's a workaround either 🤔 Going to document this as a known deficiency for now.

wmak · 2022-02-14T22:21:21Z

re: instrumenting, I'll tackle that when its hooked up to an endpoint. I find it easier to do at that point since I'll be able to see the spans in context, thanks for the reminder! 🙏

- Adding a max limit to the metric query builder since we don't ever want to do the group by filtering on more than a reasonable number - Picking 51 for now since that aligns with tables in the UI (+1 for pagination) - Adding a test & skipped test with note for the case where the data is sparse between the tables (ie. only in distribution but not set)

Zylphrex · 2022-02-15T19:07:49Z

+        result = query.run_query("test_query")
+        assert len(result["data"]) == 2
+        assert result["data"][0] == {
+            "transaction": indexer.resolve("foo_transaction"),


The result here is an integer, right? Should it be mapped back to the string for the response?

TBH, still haven't decided if that makes more sense in a post-processing step, or here in the run_query function. Going to leave this for now and revisit when I connect this to an endpoint.

evanh · 2022-02-15T19:19:37Z

+                )
+                current_result = raw_snql_query(
+                    query,
+                    referrer,


I thought about this and I think I would do a separate referrer to delineate between the "primary" and secondary queries, since the conditions will change between those two types of queries.

👍 Good idea

wmak and others added 5 commits February 7, 2022 17:48

style(lint): Auto commit lint changes

25878ae

fix: typing

6df14e4

ref: Implement count_unique(user)

1349785

wmak requested a review from a team February 8, 2022 00:32

wmak mentioned this pull request Feb 8, 2022

feat(MEP): Add initial framework for metric queries #31649

Merged

wmak commented Feb 10, 2022

View reviewed changes

Comment thread src/sentry/search/events/builder.py

Base automatically changed from wmak/feat/metrics-query-builder to master February 10, 2022 18:57

visual-snapshot Bot requested a review from a team as a code owner February 10, 2022 18:57

wmak marked this pull request as draft February 10, 2022 19:02

fix: Need to group all the values by the group by not just txn

c801642

- This also adds better tests since the previous ones were not sufficient

vercel Bot deployed to Preview – storybook February 11, 2022 22:33 View deployment

vercel Bot deployed to Preview – sentry February 11, 2022 22:33 View deployment

Merge branch 'master' into wmak/feat/run-metrics-query

6411607

vercel Bot deployed to Preview – storybook February 11, 2022 22:40 View deployment

vercel Bot deployed to Preview – sentry February 11, 2022 22:40 View deployment

ref: Woops leftover from merge conflict

3d0924e

vercel Bot deployed to Preview – sentry February 11, 2022 22:44 View deployment

vercel Bot deployed to Preview – storybook February 11, 2022 22:44 View deployment

wmak marked this pull request as ready for review February 11, 2022 22:45

k-fish approved these changes Feb 11, 2022

View reviewed changes

Comment thread tests/sentry/search/events/test_builder.py Outdated

Comment thread src/sentry/search/events/builder.py Outdated

Comment thread src/sentry/search/events/builder.py

Comment thread src/sentry/search/events/builder.py

Comment thread tests/sentry/search/events/test_builder.py

ref: Addressing PR comments

9be718a

vercel Bot deployed to Preview – sentry February 14, 2022 18:43 View deployment

vercel Bot deployed to Preview – storybook February 14, 2022 18:43 View deployment

evanh reviewed Feb 14, 2022

View reviewed changes

vercel Bot deployed to Preview – storybook February 15, 2022 02:19 View deployment

vercel Bot deployed to Preview – sentry February 15, 2022 02:19 View deployment

wmak requested a review from evanh February 15, 2022 02:20

Zylphrex reviewed Feb 15, 2022

View reviewed changes

evanh reviewed Feb 15, 2022

View reviewed changes

ref: Updating referrer, removing accidental assert False

f7a327d

vercel Bot deployed to Preview – sentry February 17, 2022 18:41 View deployment

vercel Bot deployed to Preview – storybook February 17, 2022 18:41 View deployment

wmak enabled auto-merge (squash) February 17, 2022 18:45

Merge branch 'master' into wmak/feat/run-metrics-query

f55c81d

vercel Bot deployed to Preview – storybook February 18, 2022 16:26 View deployment

vercel Bot deployed to Preview – sentry February 18, 2022 16:26 View deployment

wmak merged commit 28405b1 into master Feb 18, 2022

wmak deleted the wmak/feat/run-metrics-query branch February 18, 2022 19:12

github-actions Bot locked and limited conversation to collaborators Mar 6, 2022

Uh oh!

Conversation

wmak commented Feb 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

wmak commented Feb 10, 2022

Uh oh!

k-fish left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

evanh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wmak commented Feb 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wmak commented Feb 8, 2022 •

edited

Loading

wmak commented Feb 14, 2022 •

edited

Loading