fix(releases): Fix environment filtering for per-project new groups count in old releases serializer #101003

srest2021 · 2025-10-06T19:12:39Z

Previously, the per-project new groups count was being populated by ReleaseProject's new_groups. However, this count becomes incorrect when filtering by environment. Here we populate the per-project new groups count with the counts obtained from ReleaseProjectEnvironment when available. This fix only applies to the old serializer.

We also fix a Django aggregation that calculates the new groups counts when environments are present. Previously the query was grouping once per row, and ordering by first seen, like so:
... GROUP BY "sentry_releaseprojectenvironment"."id" ORDER BY "sentry_releaseprojectenvironment"."first_seen" DESC

But if we try to filter by multiple environments, this won't work because we can have multiple rows for a single (release, project) pairing, and whichever environment is ordered last will "win" and have its values set as the new groups counts.

In this PR, we make a new helper function that skips the unnecessary ordering, and we modify the query to group only by project and release id. We also add test coverage for new groups counts.

srest2021 · 2025-10-06T20:29:04Z

@sentry review

codecov · 2025-10-06T20:49:42Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #101003      +/-   ##
===========================================
+ Coverage   76.52%    81.14%   +4.62%     
===========================================
  Files        8653      8658       +5     
  Lines      384048    384155     +107     
  Branches    24249     24186      -63     
===========================================
+ Hits       293880    311721   +17841     
+ Misses      89824     72090   -17734     
  Partials      344       344

cmanallen · 2025-10-07T18:11:44Z

src/sentry/api/serializers/models/release.py

-            aggregated_new_issues_count=Sum("new_issues_count")
-        ).values_list("project_id", "release_id", "aggregated_new_issues_count"):
+        for project_id, release_id, new_groups in (
+            release_project_envs.order_by()


Ordering is not required and specifying order_by with no arguments does not order the results.

cmanallen · 2025-10-07T18:17:51Z

src/sentry/api/serializers/models/release.py

-        ).values_list("project_id", "release_id", "aggregated_new_issues_count"):
+        for project_id, release_id, new_groups in (
+            release_project_envs.order_by()
+            .values("project_id", "release_id")


This can be removed since you're calling values_list below.

srest2021 · 2025-10-07T19:16:49Z

Hi @cmanallen here is my explanation for why I changed the query (sorry for the wall of text)!

The old query will sometimes fail when we filter by multiple environments.

Assuming we’re working in release 1.0.0 (id “r123”) and we have 4 new groups for project A (id “a123”) and 2 new groups for project B (id "b123”):

For project A: 3 issues in production, 1 issue in staging (total = 4)
For project B: 2 issues in production, 0 issues in staging (total = 2)

This is the same setup as in test_new_groups_environment_filtering. Say we are filtering by the production and staging environments.

The old query will generate SQL like this:
SELECT "sentry_releaseprojectenvironment"."project_id" AS "project_id", "sentry_releaseprojectenvironment"."release_id" AS "release_id", SUM("sentry_releaseprojectenvironment"."new_issues_count") AS "aggregated_new_issues_count" FROM "sentry_releaseprojectenvironment" INNER JOIN "sentry_environment" ON ("sentry_releaseprojectenvironment"."environment_id" = "sentry_environment"."id") WHERE ("sentry_releaseprojectenvironment"."release_id" IN (r123) AND "sentry_environment"."name" IN (production, staging)) GROUP BY "sentry_releaseprojectenvironment"."id" ORDER BY "sentry_releaseprojectenvironment"."first_seen" DESC

Note that there’s an order-by at the very end despite no ordering being included in the code, and that we’re grouping by id. This will essentially group every single row individually.

We get a query set like so: <BaseQuerySet [(b123, r123, 0), (b123, r123, 2), (a123, r123, 1), (a123, r123, 3)]>

And as expected we get one item per row in the ReleaseProjectEnvironment table:

(b123, r123, 0) is # new groups in staging for project B
(b123, r123, 2) is # new groups in prod for project B
(a123, r123, 1) is # new groups in staging for project A
(a123, r123, 3) is # new groups in prod for project A

And when we calculate group_counts_by_release, we get this: {r123: {b123: 2, a123: 3}} instead of the correct counts: {r123: {b123: 2, a123: 4}}. Because we’re grouping each row individually, when we run the for project_id, release_id, new_groups loop on this queryset, all the rows for a project will overwrite each other and the last row for that project will “win”. So for example, because (a123, r123, 3) was the last row for project a, that’s why we get 3 new groups for project a instead of the correct total 3+1=4. That's why the old query will fail only if we want multiple environments / multiple rows per project/release pair.

So instead of grouping by row we want to group by (project_id, release_id). If we add .values("project_id", "release_id”) to the query, we get this SQL:
SELECT "sentry_releaseprojectenvironment"."project_id" AS "project_id", "sentry_releaseprojectenvironment"."release_id" AS "release_id", SUM("sentry_releaseprojectenvironment"."new_issues_count") AS "aggregated_new_issues_count" FROM "sentry_releaseprojectenvironment" INNER JOIN "sentry_environment" ON ("sentry_releaseprojectenvironment"."environment_id" = "sentry_environment"."id") WHERE ("sentry_releaseprojectenvironment"."release_id" IN (r123) AND "sentry_environment"."name" IN (production, staging)) GROUP BY 1, 2, "sentry_releaseprojectenvironment"."first_seen" ORDER BY "sentry_releaseprojectenvironment"."first_seen" DESC

Note that for some reason we’re also grouping by first_seen and we’re still ordering by first_seen as well. This query will give us the same exact query set and group_counts_by_release. I suspect it’s because we’re still not grouping only by project and release id.

Now if we also add .order_by() to the query, in addition to .values("project_id", "release_id”), we get this:
SELECT "sentry_releaseprojectenvironment"."project_id" AS "project_id", "sentry_releaseprojectenvironment"."release_id" AS "release_id", SUM("sentry_releaseprojectenvironment"."new_issues_count") AS "aggregated_new_issues_count" FROM "sentry_releaseprojectenvironment" INNER JOIN "sentry_environment" ON ("sentry_releaseprojectenvironment"."environment_id" = "sentry_environment"."id") WHERE ("sentry_releaseprojectenvironment"."release_id" IN (r123) AND "sentry_environment"."name" IN (production, staging)) GROUP BY 1, 2

Now we’re only grouping by project and release id. I think this is because adding order_by() clears whatever default ordering was being added on top of the original query.

And with this query we finally get the correct query set: <BaseQuerySet [(a123, r123, 4), (b123, r123, 2)]> and the correct group_counts_by_release: {r123: {b123: 2, a123: 4}}

Let me know if this makes sense to you.

cmanallen

@srest2021 The .values() method alters the query's result type. It does not modify the query. Using a bare order_by does reset ordering but ask yourself the question, why should ordering matter at all for this function?

Let's abstract the queries into their own functions. Then relentlessly unit test the functions and ensure they are deterministic and that you totally and completely understand what each component of the function is returning and being transformed to. Retrieving counts should not be order dependent otherwise we're making a mistake in calculation.

Radical simplicity should be your goal. Strip away everything. Start from scratch. They are small queries. Re-write them in their most ideal form.

cursor · 2025-10-08T01:20:33Z

src/sentry/api/serializers/models/release.py

+        if project is not None:
+            release_project_envs = release_project_envs.filter(project=project)
+
+        return release_project_envs


Bug: N+1 Query Issue in Release Data Retrieval

The new _get_release_project_envs_unordered method, used when environments are specified, omits select_related("project"). This means __get_release_data_with_environments will trigger N+1 database queries when accessing the project relation.

Is this not just accessing project id? We don't access any other attributes of release_project_envs.project.

cmanallen · 2025-10-08T13:46:45Z

src/sentry/api/serializers/models/release.py

+    def _get_release_project_envs_unordered(self, item_list, environments, project):
+        release_project_envs = ReleaseProjectEnvironment.objects.filter(
+            release__in=item_list
+        ).select_related("release")


Why are we joining the release model? This call to select_related seems unnecessary.

We access release_project_env.release.version in other parts of __get_release_data_with_environments, for example:

sentry/src/sentry/api/serializers/models/release.py

Line 507 in 6f7b903

release_project_env.release.version not in first_seen

Previously, we were using release.newGroups to populate the per-project new issues counts for releases. However, this count is the total number of new issues for this release. Here, we instead use release.projects[x].newGroups, which now contains the correct number of new issues for this release in the selected project. We switch to using release.projects[x].newGroups in the following places: releases index, releases drawer, session health reverts #99555, which switched from release.projects[x].newGroups to release.newGroups followup to #101003, which fixed the backend bug when calculating release.projects[x].newGroups ### Demo Setup RELEASE 1.0.0 Project A - **3** total new groups - Development - **2** new groups (Error & TypeError) - Production - **1** new group (SyntaxError) Project B - **4** total new groups - Development - **3** new groups (ReferenceError & EvalError & URIError - Production - **1** new group (SyntaxError) RELEASE 2.0.0 Project A - **1** total new group - Development - **1** new group (RangeError) ### Demo Before: the new issues counts for each project are all equal to the total number of new issues in that release (eg, 7 new groups for each project in Release 1.0.0 == 2+1+3+1) https://github.com/user-attachments/assets/043ceed5-f0cb-4c6a-a954-837830d2c091 After: we get the correct per-project new issues counts, even when filtering by project or env or both https://github.com/user-attachments/assets/df5c4057-235d-4b4f-a93f-3c7aea2af656

fix environment filtering for new groups counmt

5180085

srest2021 changed the title ~~fix(releases): Fix environment filtering for new groups count~~ fix(releases): Fix environment filtering for per-project new groups count Oct 6, 2025

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Oct 6, 2025

vercel bot deployed to Preview October 6, 2025 19:13 View deployment

renaming

d7ac732

vercel bot deployed to Preview October 6, 2025 20:30 View deployment

cleaning up

94d3251

vercel bot deployed to Preview October 6, 2025 20:38 View deployment

srest2021 added 2 commits October 6, 2025 13:48

remove change

c2e43c7

minimize diff

d10c805

vercel bot deployed to Preview October 6, 2025 20:52 View deployment

srest2021 marked this pull request as ready for review October 6, 2025 20:59

add query fix

05eef4b

vercel bot deployed to Preview October 6, 2025 21:27 View deployment

oops remove commented out line

a4add81

vercel bot deployed to Preview October 6, 2025 21:32 View deployment

Merge branch 'master' into srest2021/fix-new-groups-count-old-serializer

b43f15b

vercel bot deployed to Preview October 6, 2025 21:34 View deployment

srest2021 requested a review from a team October 6, 2025 22:04

cmanallen approved these changes Oct 7, 2025

View reviewed changes

cmanallen requested changes Oct 7, 2025

View reviewed changes

rewrite get envs & get release data with envs

42ac977

This comment was marked as outdated.

Sign in to view

vercel bot deployed to Preview October 7, 2025 22:18 View deployment

get_release_data_with_environments tests

292de0f

This comment was marked as outdated.

Sign in to view

vercel bot deployed to Preview October 7, 2025 23:37 View deployment

fix tests

47f3496

vercel bot deployed to Preview October 7, 2025 23:41 View deployment

remove added unit tests; clean up get_attrs

2ab71c9

vercel bot deployed to Preview October 8, 2025 01:08 View deployment

srest2021 changed the title ~~fix(releases): Fix environment filtering for per-project new groups count~~ fix(releases): Fix environment filtering for per-project new groups count in old releases serializer Oct 8, 2025

add select_related release

bca93ce

cursor bot reviewed Oct 8, 2025

View reviewed changes

vercel bot deployed to Preview October 8, 2025 01:21 View deployment

cmanallen reviewed Oct 8, 2025

View reviewed changes

cmanallen approved these changes Oct 8, 2025

View reviewed changes

srest2021 merged commit d568d52 into master Oct 8, 2025
65 checks passed

srest2021 deleted the srest2021/fix-new-groups-count-old-serializer branch October 8, 2025 17:35

srest2021 mentioned this pull request Oct 8, 2025

fix(releases): fix release per-project new issues count in UI #101212

Merged

jasonyuezhang mentioned this pull request Oct 8, 2025

fix(releases): fix release per-project new issues count in UI jasonyuezhang/sentry#53

Open

michellewzhang mentioned this pull request Oct 8, 2025

fix(releases): fix new issue count on releases index #100858

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(releases): Fix environment filtering for per-project new groups count in old releases serializer #101003

fix(releases): Fix environment filtering for per-project new groups count in old releases serializer #101003

Uh oh!

srest2021 commented Oct 6, 2025 •

edited

Loading

Uh oh!

srest2021 commented Oct 6, 2025

Uh oh!

codecov bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

cmanallen Oct 7, 2025

Uh oh!

cmanallen Oct 7, 2025

Uh oh!

srest2021 commented Oct 7, 2025 •

edited

Loading

Uh oh!

cmanallen left a comment

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Oct 8, 2025

Uh oh!

srest2021 Oct 8, 2025

Uh oh!

cmanallen Oct 8, 2025

Uh oh!

srest2021 Oct 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix(releases): Fix environment filtering for per-project new groups count in old releases serializer #101003

fix(releases): Fix environment filtering for per-project new groups count in old releases serializer #101003

Uh oh!

Conversation

srest2021 commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

srest2021 commented Oct 6, 2025

Uh oh!

codecov bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

cmanallen Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

cmanallen Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

srest2021 commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cmanallen left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Oct 8, 2025

Choose a reason for hiding this comment

Bug: N+1 Query Issue in Release Data Retrieval

Uh oh!

srest2021 Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

cmanallen Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

srest2021 Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

srest2021 commented Oct 6, 2025 •

edited

Loading

codecov bot commented Oct 6, 2025 •

edited

Loading

srest2021 commented Oct 7, 2025 •

edited

Loading