feat(ddm): Allow multiple use case ids to be queried at once and parallelize #66298

iambriccardo · 2024-03-05T09:44:06Z

This PR implements a new fully parallelized implementation of fetching metrics meta. The need for such implementation arose for two reasons:

The previous implementation was slow
The previous implementation didn't support multiple use case ids to be fetched in a single request

The new implementation parallelizes all queries for fetching data across entities and use case ids. It also maximizes parallelization when reverse resolving metric ids.

Closes: #66126

…llelize

iambriccardo · 2024-03-05T09:47:00Z

src/sentry/snuba/metrics/fields/base.py

    request = Request(
-        dataset=Dataset.Metrics.value,
+        dataset=Dataset.Metrics.value


For some reason, the wrong dataset was used but we still got data back, interesting.

iambriccardo · 2024-03-05T09:47:42Z

src/sentry/snuba/metrics/datasource.py

+            break
+
+    stored_metrics = get_stored_metrics_of_projects(projects, use_case_ids, start, end)
+    metrics_blocking_state = (


Decided to move out the check for the use case, since it's not a responsibility of get_metrics_blocking_state_of_projects since it should be use case agnostic.

codecov · 2024-03-05T10:31:33Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 84.29%. Comparing base (e4571d1) to head (abcd6c3).
Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #66298      +/-   ##
==========================================
- Coverage   84.29%   84.29%   -0.01%     
==========================================
  Files        5306     5306              
  Lines      237101   237095       -6     
  Branches    41031    41033       +2     
==========================================
- Hits       199866   199859       -7     
- Misses      37016    37017       +1     
  Partials      219      219

Files	Coverage Δ
src/sentry/api/endpoints/organization_metrics.py	`91.37% <100.00%> (+0.15%)`	⬆️
src/sentry/snuba/metrics/datasource.py	`97.61% <100.00%> (-0.18%)`	⬇️
src/sentry/snuba/metrics/fields/base.py	`94.55% <100.00%> (+0.03%)`	⬆️

... and 6 files with indirect coverage changes

obostjancic · 2024-03-05T10:40:26Z

src/sentry/snuba/metrics/datasource.py

    start: datetime | None = None,
    end: datetime | None = None,
 ) -> Sequence[MetricMeta]:
    if not projects:
        return []

-    stored_metrics = get_stored_metrics_of_projects(projects, use_case_id, start, end)
-    metrics_blocking_state = get_metrics_blocking_state_of_projects(projects, use_case_id)
+    has_custom_use_case_id = False


Suggested change

has_custom_use_case_id = False

has_custom_use_case_id = UseCaseID.CUSTOM in use_case_ids

Does it work for lists? I will check now

It works, nice!

ofc, python is pseudo code 😅

obostjancic · 2024-03-05T10:41:33Z

src/sentry/snuba/metrics/datasource.py

+    entity_keys = defaultdict(set)
+    for use_case_id in use_case_ids:
+        entity_keys[use_case_id] = entity_keys[use_case_id].union(
+            get_entity_keys_of_use_case_id(use_case_id=use_case_id)
        )

-    grouped_stored_metrics = {}
-    for stored_metric in stored_metrics:
-        grouped_stored_metrics.setdefault(stored_metric["metric_id"], []).append(
-            stored_metric["project_id"]
-        )
+    # We compute a list of all the queries that we want to run in parallel across entities and use cases.
+    requests = []
+    use_case_id_to_index = defaultdict(list)
+    for use_case_id, entity_keys in entity_keys.items():
+        for entity_key in entity_keys:
+            requests.append(
+                _get_metrics_by_project_for_entity_query(
+                    entity_key=entity_key,
+                    project_ids=project_ids,
+                    org_id=org_id,
+                    use_case_id=use_case_id,
+                    start=start,
+                    end=end,
+                )
+            )
+            use_case_id_to_index[use_case_id].append(len(requests) - 1)


nit: i would do these two things in the same nested loops

Which two things?

computing entity_keys dict and then computing a list of requests

I did put them separate to make the implementation work even if you pass two use case ids and if entity keys are different, but it's a bit overengineered. I can do like you said, will simplify! Thanks for the suggestion

obostjancic · 2024-03-05T10:42:20Z

tests/sentry/snuba/metrics/test_datasource.py

+        # mris = get_stored_metrics_of_projects([self.project], [UseCaseID.TRANSACTIONS])
+        # assert mris == {
+        #     "d:transactions/duration@millisecond": [self.project.id],
+        # }
+        #
+        # mris = get_stored_metrics_of_projects([self.project], [UseCaseID.SESSIONS])
+        # assert mris == {
+        #     "d:sessions/duration@second": [self.project.id],
+        #     "c:sessions/session@none": [self.project.id],
+        #     "s:sessions/user@none": [self.project.id],
+        # }
+        #
+        # mris = get_stored_metrics_of_projects([self.project], [UseCaseID.CUSTOM])
+        # assert mris == {
+        #     custom_mri: [self.project.id],
+        # }


Suggested change

# mris = get_stored_metrics_of_projects([self.project], [UseCaseID.TRANSACTIONS])

# assert mris == {

# "d:transactions/duration@millisecond": [self.project.id],

# }

#

# mris = get_stored_metrics_of_projects([self.project], [UseCaseID.SESSIONS])

# assert mris == {

# "d:sessions/duration@second": [self.project.id],

# "c:sessions/session@none": [self.project.id],

# "s:sessions/user@none": [self.project.id],

# }

#

# mris = get_stored_metrics_of_projects([self.project], [UseCaseID.CUSTOM])

# assert mris == {

# custom_mri: [self.project.id],

# }

sentry-io · 2024-03-05T14:37:40Z

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

‼️ QueryExecutionTimeMaximum: DB::Exception: Estimated query execution time (71.86432810202771 seconds) is too long. Maximum: 3... /api/0/organizations/{organization_slug}/metric... View Issue
‼️ TypeError: 'NoneType' object is not iterable /api/0/organizations/{organization_slug}/metric... View Issue
‼️ SnubaError: HTTPConnectionPool(host='127.0.0.1', port=10006): Read timed out. (read timeout=30) /api/0/organizations/{organization_slug}/metric... View Issue

_{Did you find this useful? React with a 👍 or 👎}

…llelize (#66298)

feat(ddm): Allow multiple use case ids to be queried at once and para…

926ebad

…llelize

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Mar 5, 2024

Fix

7154ccd

vercel bot deployed to Preview March 5, 2024 09:46 View deployment

iambriccardo commented Mar 5, 2024

View reviewed changes

vercel bot deployed to Preview March 5, 2024 09:48 View deployment

iambriccardo marked this pull request as ready for review March 5, 2024 09:51

iambriccardo requested a review from a team as a code owner March 5, 2024 09:51

Fix

a8b63d8

vercel bot deployed to Preview March 5, 2024 10:01 View deployment

obostjancic approved these changes Mar 5, 2024

View reviewed changes

Fix

a1f8dab

vercel bot deployed to Preview March 5, 2024 10:48 View deployment

Fix

abcd6c3

vercel bot deployed to Preview March 5, 2024 10:53 View deployment

iambriccardo merged commit ebfdc02 into master Mar 5, 2024
49 checks passed

iambriccardo deleted the riccardo/feat/multi-use-case-endpoint branch March 5, 2024 11:46

aliu3ntry pushed a commit that referenced this pull request Mar 6, 2024

feat(ddm): Allow multiple use case ids to be queried at once and para…

23a731e

…llelize (#66298)

github-actions bot locked and limited conversation to collaborators Mar 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ddm): Allow multiple use case ids to be queried at once and parallelize #66298

feat(ddm): Allow multiple use case ids to be queried at once and parallelize #66298

iambriccardo commented Mar 5, 2024 •

edited

iambriccardo Mar 5, 2024

iambriccardo Mar 5, 2024

codecov bot commented Mar 5, 2024 •

edited

obostjancic Mar 5, 2024

iambriccardo Mar 5, 2024

iambriccardo Mar 5, 2024

obostjancic Mar 5, 2024

obostjancic Mar 5, 2024

iambriccardo Mar 5, 2024

obostjancic Mar 5, 2024

iambriccardo Mar 5, 2024

obostjancic Mar 5, 2024

iambriccardo Mar 5, 2024

sentry-io bot commented Mar 5, 2024 •

edited

	has_custom_use_case_id = False
	has_custom_use_case_id = UseCaseID.CUSTOM in use_case_ids

feat(ddm): Allow multiple use case ids to be queried at once and parallelize #66298

feat(ddm): Allow multiple use case ids to be queried at once and parallelize #66298

Conversation

iambriccardo commented Mar 5, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 5, 2024 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sentry-io bot commented Mar 5, 2024 • edited

Suspect Issues

iambriccardo commented Mar 5, 2024 •

edited

codecov bot commented Mar 5, 2024 •

edited

sentry-io bot commented Mar 5, 2024 •

edited