feat(ACI): Send historical data to seer on update #102934

ceorourke · 2025-11-07T01:15:51Z

When a static or percent based detector is changed to become a dynamic detector we need to send Seer historical data for that detector so it can detect anomalies. Also when a dynamic detector's snuba query query or aggregate changes we need to update the data Seer has so it's detecting anomalies on the correct data. This PR also handles not updating the existing data if the call to Seer fails for any reason.

ceorourke · 2025-11-07T01:18:37Z

tests/sentry/incidents/endpoints/validators/test_validators.py

-                "id": self.data_condition_group.id,
-                "organizationId": self.organization.id,


These aren't sent by the front end so I wanted this to be the same. Especially for the ids, it doesn't make sense that we'd be sending these on creation.

ceorourke · 2025-11-07T01:18:55Z

tests/sentry/incidents/endpoints/validators/test_validators.py

@@ -553,6 +551,379 @@ def test_transaction_dataset_deprecation_multiple_data_sources(self) -> None:
        ):
            validator.save()

+
+class TestMetricAlertsUpdateDetectorValidator(TestMetricAlertsDetectorValidator):
+    def test_update_with_valid_data(self) -> None:


We didn't have a simple update test case so I added one

ceorourke · 2025-11-07T01:40:06Z

src/sentry/seer/anomaly_detection/store_data_workflow_engine.py

+        raise DetectorException(
+            f"Could not create detector, data condition {dcg_id} not found or too many found."
+        )
+    # use setattr to avoid saving the models until the Seer call has successfully finished,


This is the same as we do it for alert rules https://github.com/getsentry/sentry/blob/master/src/sentry/seer/anomaly_detection/store_data.py#L122

codecov · 2025-11-07T19:07:12Z

Codecov Report

❌ Patch coverage is 86.53846% with 7 lines in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...er/anomaly_detection/store_data_workflow_engine.py	81.57%	7 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #102934      +/-   ##
===========================================
+ Coverage   79.08%    80.70%   +1.61%     
===========================================
  Files        9224      9226       +2     
  Lines      393906    394080     +174     
  Branches    25109     25109              
===========================================
+ Hits       311527    318032    +6505     
+ Misses      81931     75600    -6331     
  Partials      448       448

src/sentry/incidents/metric_issue_detector.py

sentry · 2025-11-07T21:20:56Z

src/sentry/incidents/metric_issue_detector.py

            resolution=timedelta(seconds=data_source.get("resolution", snuba_query.resolution)),
            environment=data_source.get("environment", snuba_query.environment),
            event_types=data_source.get("event_types", [event_type for event_type in event_types]),
        )


Bug: update_detector_data receives a single data source dict instead of a {"data_sources": [...]} structure, preventing snuba_query updates.
_{Severity: CRITICAL | Confidence: 0.95}

🔍 Detailed Analysis

When update_detector_data is invoked at src/sentry/incidents/metric_issue_detector.py:249, it receives validated_data_source, which is a single dictionary. However, the update_detector_data function expects a dictionary containing a "data_sources" key with a list of data sources. This mismatch causes the internal logic to skip updating the snuba_query object's fields. Consequently, when a dynamic detector's snuba query is updated, the old query, aggregate, and event types are sent to Seer instead of the new values, leading to anomaly detection operating on incorrect metrics.

💡 Suggested Fix

Modify the call to update_detector_data at src/sentry/incidents/metric_issue_detector.py:249 to pass {"data_sources": [validated_data_source]} instead of validated_data_source directly, aligning with the expected input structure.

🤖 Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/incidents/metric_issue_detector.py#L249 Potential issue: When `update_detector_data` is invoked at `src/sentry/incidents/metric_issue_detector.py:249`, it receives `validated_data_source`, which is a single dictionary. However, the `update_detector_data` function expects a dictionary containing a `"data_sources"` key with a list of data sources. This mismatch causes the internal logic to skip updating the `snuba_query` object's fields. Consequently, when a dynamic detector's snuba query is updated, the old query, aggregate, and event types are sent to Seer instead of the new values, leading to anomaly detection operating on incorrect metrics.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

mifu67

The update logic itself looks good; just a question about when we should be updating.

src/sentry/seer/anomaly_detection/store_data_workflow_engine.py

mifu67 · 2025-11-11T23:24:47Z

src/sentry/incidents/metric_issue_detector.py


+        # Handle a dynamic detector's snuba query changing
+        if instance.config.get("detection_type") == AlertRuleDetectionType.DYNAMIC:
+            if snuba_query.query != data_source.get(


Do we need to check other snuba query fields as well (like timeWindow)? Should we just resend the data every time we update a dynamic detector?

Good question, I will look into this for a follow up. I think every time we change it isn't necessary (e.g. it could just be changing the name of the detector) but we might be missing some cases we should be updating.

ram-senth · 2025-11-12T20:17:12Z

src/sentry/seer/anomaly_detection/store_data_workflow_engine.py

+            event_types,
+        )
+    except (TimeoutError, MaxRetryError, ParseError, ValidationError):
+        raise ValidationError("Couldn't send data to Seer, unable to update detector")


Wouldn't it be better to track timeout, retry failure and parse errors separately as opposed to converting them to ValidationError?

I converted them to a ValidationError because this is in the update detector method and this is how we can surface the error to the user - I can break it out into separate ones to be more informative though:

except TimeoutError: raise ValidationError("Timed out sending data to Seer, unable to update detector

src/sentry/seer/anomaly_detection/store_data_workflow_engine.py

cursor · 2025-11-13T00:29:26Z

src/sentry/incidents/metric_issue_detector.py

+        if instance.config.get("detection_type") == AlertRuleDetectionType.DYNAMIC:
+            if snuba_query.query != data_source.get(
+                "query"
+            ) or snuba_query.aggregate != data_source.get("aggregate"):


Bug: Prevent Unnecessary Seer API Calls

The condition checking if a dynamic detector's query or aggregate changed compares against data_source.get("query") and data_source.get("aggregate") which return None when these fields aren't being updated. This causes the comparison snuba_query.query != None to be True even when the query hasn't changed, triggering unnecessary Seer API calls. The defaults should be the existing values: data_source.get("query", snuba_query.query) and data_source.get("aggregate", snuba_query.aggregate).

src/sentry/incidents/metric_issue_detector.py

sentry · 2025-11-13T19:01:25Z

Issues attributed to commits in this pull request

This pull request was merged and Sentry observed the following issues:

‼️ ThreadLeakAssertionError: <Thread(Thread-5 (run),... in local
‼️ ThreadLeakAssertionError: <Thread(Thread-5 (run),... in local
‼️ ThreadLeakAssertionError: <Thread(Thread-5 (run),... in local

) Follow up to #102934 (comment) to update Seer when anything on the snuba query changes - we were missing some instances where the data changed in a way Seer would want to know about but we weren't sending the updates.

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Nov 7, 2025

vercel bot deployed to Preview November 7, 2025 01:17 View deployment

ceorourke commented Nov 7, 2025

View reviewed changes

vercel bot deployed to Preview November 7, 2025 01:31 View deployment

vercel bot deployed to Preview November 7, 2025 01:38 View deployment

ceorourke commented Nov 7, 2025

View reviewed changes

vercel bot deployed to Preview November 7, 2025 01:42 View deployment

vercel bot deployed to Preview November 7, 2025 01:50 View deployment

vercel bot deployed to Preview November 7, 2025 18:42 View deployment

ceorourke force-pushed the ceorourke/send-historical-data-to-seer-on-update branch from c4fdd12 to 2adcc69 Compare November 7, 2025 18:50

vercel bot deployed to Preview November 7, 2025 18:53 View deployment

vercel bot deployed to Preview November 7, 2025 19:55 View deployment

ceorourke marked this pull request as ready for review November 7, 2025 21:18

ceorourke requested review from a team as code owners November 7, 2025 21:18

cursor bot reviewed Nov 7, 2025

View reviewed changes

src/sentry/incidents/metric_issue_detector.py Show resolved Hide resolved

sentry bot reviewed Nov 7, 2025

View reviewed changes

mifu67 self-requested a review November 11, 2025 19:00

mifu67 approved these changes Nov 11, 2025

View reviewed changes

vercel bot deployed to Preview November 12, 2025 01:00 View deployment

ram-senth reviewed Nov 12, 2025

View reviewed changes

ceorourke force-pushed the ceorourke/send-historical-data-to-seer-on-update branch from 89441c2 to 7a87a33 Compare November 12, 2025 23:21

cursor bot reviewed Nov 12, 2025

View reviewed changes

src/sentry/seer/anomaly_detection/store_data_workflow_engine.py Show resolved Hide resolved

vercel bot deployed to Preview November 12, 2025 23:24 View deployment

ceorourke requested a review from a team as a code owner November 13, 2025 00:26

vercel bot deployed to Preview November 13, 2025 00:29 View deployment

cursor bot reviewed Nov 13, 2025

View reviewed changes

ram-senth approved these changes Nov 13, 2025

View reviewed changes

ceorourke added 13 commits November 13, 2025 09:51

handle updating dynamic detector

09787ed

Add tests, change update logic

4cb19c9

smol changes

e57e1c7

update correctly

5e8ce0f

dry up tests

cce55bf

dry a little more

f26545b

dry slightly more

5b82132

dry up fetching related models

79206a9

typing

8c804a8

update for resolution conditions

63f7021

rm unused priority level

c9efa97

break out exceptions

d4afe11

comparison delta is only used for percent detectors - updating test

2d5dec1

ceorourke force-pushed the ceorourke/send-historical-data-to-seer-on-update branch from 0236524 to 2d5dec1 Compare November 13, 2025 17:51

vercel bot deployed to Preview November 13, 2025 17:54 View deployment

cursor bot reviewed Nov 13, 2025

View reviewed changes

src/sentry/incidents/metric_issue_detector.py Show resolved Hide resolved

ceorourke merged commit 69c0f91 into master Nov 13, 2025
65 checks passed

ceorourke deleted the ceorourke/send-historical-data-to-seer-on-update branch November 13, 2025 18:17

ceorourke mentioned this pull request Nov 13, 2025

feat(ACI): Send updated data to Seer on all snuba query changes #103332

Merged

		"id": self.data_condition_group.id,
		"organizationId": self.organization.id,

Uh oh!

feat(ACI): Send historical data to seer on update #102934

feat(ACI): Send historical data to seer on update #102934

Conversation

ceorourke commented Nov 7, 2025

Uh oh!

ceorourke Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

ceorourke Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

ceorourke Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

sentry bot Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

mifu67 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mifu67 Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

ceorourke Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

ram-senth Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

ceorourke Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot Nov 13, 2025

Choose a reason for hiding this comment

Bug: Prevent Unnecessary Seer API Calls

Uh oh!

Uh oh!

Uh oh!

sentry bot commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issues attributed to commits in this pull request

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Nov 7, 2025 •

edited

Loading

ceorourke Nov 12, 2025 •

edited

Loading

sentry bot commented Nov 13, 2025 •

edited

Loading