ref(db): Refactor EnvironmentProject to reduce rollbacks #89265

vgrozdanic · 2025-04-10T11:09:37Z

Similar to how it was done in: #88885

Refactor of add_project method of Environment model, preparing it for gradual rollout and to decrease the number of rollbacks it does since almost every attempt of insertions for this model results in a rollback.

In this method EnvironmentProject was doing overly optimistic inserts leading to us having almost 100 rollbacks/second coming just from this model

Why are we doing this

Currently we are doing around 300 rollbacks per second, mostly caused by overly optimistic writes - almost all of the writes result in the rollback because the data already exists in the table, and for those occasions get_or_create is more suitable since SELECT statement is more performant than ROLLBACK when they happen most of the times.

Datadog notebok with investagtion, where 3 problematic models where detected:

~~GroupRelease~~
Commit
EnvironmentProject

Models will be refactored one at the time, and the refactor will be rolled out gradually: 10% - 50% - 100%

…t model

vgrozdanic · 2025-04-14T12:17:47Z

src/sentry/models/environment.py

+            if in_random_rollout("environmentproject.new_add_project.rollout"):
+                _, created = EnvironmentProject.objects.get_or_create(
+                    project=project, environment=self, defaults={"is_hidden": is_hidden}
+                )
+                if not created:
+                    # We've already created the object, should still cache the action.
+                    cache.set(cache_key, 1, 3600)


This is just a refactor without changing any logic that was being done before:

is_hidden is only set during creation - same as it was before

if the EnvironmentProject already exists, we write to cache - same as it was before (assumption is that this is a protection on db to not overload it with to many write requests, we keep it as cache lookup is less expensive than DB lookup)

Shouldn't we also write to cache when the EnvironmentProject record is first created? I know that is a change in logic, but it would help reduce the number of queries we're running.

I think it's not a change in logic to cache both paths. Previously, we'd cache immediately after create and inside the exception. So it's more correct to remove if not created and just cache both

Agree, edited the code to always cache the value

codecov · 2025-04-14T12:33:57Z

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #89265      +/-   ##
==========================================
+ Coverage   87.73%   87.75%   +0.01%     
==========================================
  Files       10172    10138      -34     
  Lines      574129   573043    -1086     
  Branches    22612    22425     -187     
==========================================
- Hits       503728   502863     -865     
+ Misses      69985    69743     -242     
- Partials      416      437      +21

markstory · 2025-04-14T14:49:15Z

src/sentry/models/environment.py

+            if in_random_rollout("environmentproject.new_add_project.rollout"):
+                _, created = EnvironmentProject.objects.get_or_create(
+                    project=project, environment=self, defaults={"is_hidden": is_hidden}
+                )
+                if not created:
+                    # We've already created the object, should still cache the action.
+                    cache.set(cache_key, 1, 3600)


Shouldn't we also write to cache when the EnvironmentProject record is first created? I know that is a change in logic, but it would help reduce the number of queries we're running.

…90042) After rolling out to 100% last week, we can now safely remove old code which is no longer used/ Continuation of getsentry/sentry-options-automator#3597 and #89265

Similar to how it was done in: #88885 Refactor of `add_project` method of `Environment` model, preparing it for gradual rollout and to decrease the number of rollbacks it does since almost every attempt of insertions for this model results in a rollback. In this method `EnvironmentProject` was doing overly optimistic inserts leading to us having almost 100 rollbacks/second coming just from this model ## Why are we doing this Currently we are doing around 300 rollbacks per second, mostly caused by overly optimistic writes - almost all of the writes result in the rollback because the data already exists in the table, and for those occasions `get_or_create` is more suitable since SELECT statement is more performant than ROLLBACK when they happen most of the times. [Datadog notebok with investagtion](https://app.datadoghq.com/notebook/12067672/postgres-rollback-investigation?range=604800000&start=1743184480708&live=true), where 3 problematic models where detected: - ~GroupRelease~ - `Commit` - `EnvironmentProject` Models will be refactored one at the time, and the refactor will be rolled out gradually: 10% - 50% - 100%

…90042) After rolling out to 100% last week, we can now safely remove old code which is no longer used/ Continuation of getsentry/sentry-options-automator#3597 and #89265

ref(db): Clean up too optimistic insertion code for EnvironmentProjec…

6d874ac

…t model

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Apr 10, 2025

vercel bot deployed to Preview April 10, 2025 11:10 View deployment

rename flag

1cd823c

vgrozdanic force-pushed the vgrozdanic/refactor-environment-model branch from fc5bb36 to 1cd823c Compare April 10, 2025 11:11

vercel bot deployed to Preview April 10, 2025 11:20 View deployment

vgrozdanic added 2 commits April 14, 2025 14:11

Merge branch 'master' into vgrozdanic/refactor-environment-model

bc4ef3a

keep same logic

9b2ad45

vgrozdanic commented Apr 14, 2025

View reviewed changes

vgrozdanic marked this pull request as ready for review April 14, 2025 12:17

vgrozdanic requested a review from a team April 14, 2025 12:18

vercel bot deployed to Preview April 14, 2025 12:22 View deployment

armenzg approved these changes Apr 14, 2025

View reviewed changes

markstory approved these changes Apr 14, 2025

View reviewed changes

always cache

28b9737

vercel bot deployed to Preview April 15, 2025 07:40 View deployment

Merge branch 'master' into vgrozdanic/refactor-environment-model

84b260c

vercel bot deployed to Preview April 15, 2025 07:47 View deployment

vgrozdanic merged commit de38201 into master Apr 15, 2025
60 checks passed

vgrozdanic deleted the vgrozdanic/refactor-environment-model branch April 15, 2025 08:13

vgrozdanic mentioned this pull request Apr 22, 2025

ref(db): Clean old code that caused rollbacks for EnvironmentProject #90042

Merged

github-actions bot locked and limited conversation to collaborators Apr 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ref(db): Refactor EnvironmentProject to reduce rollbacks #89265

ref(db): Refactor EnvironmentProject to reduce rollbacks #89265

Uh oh!

vgrozdanic commented Apr 10, 2025 •

edited

Loading

Uh oh!

vgrozdanic Apr 14, 2025

Uh oh!

markstory Apr 14, 2025

Uh oh!

wedamija Apr 14, 2025

Uh oh!

vgrozdanic Apr 15, 2025

Uh oh!

codecov bot commented Apr 14, 2025 •

edited

Loading

Uh oh!

markstory Apr 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

ref(db): Refactor EnvironmentProject to reduce rollbacks #89265

ref(db): Refactor EnvironmentProject to reduce rollbacks #89265

Uh oh!

Conversation

vgrozdanic commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are we doing this

Uh oh!

vgrozdanic Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

markstory Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

wedamija Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

vgrozdanic Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

markstory Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vgrozdanic commented Apr 10, 2025 •

edited

Loading

codecov bot commented Apr 14, 2025 •

edited

Loading