perf(eventstream): Add caching layer, raise sort #103756

thetruecpaul · 2025-11-20T20:06:44Z

Two changes:

Add a caching layer so that we can avoid hitting the DB for hot groups.
We don't need the sort in 95% of cases. Let's raise that into Python and only do it when necessary.

src/sentry/services/eventstore/query_preprocessing.py

codecov · 2025-11-20T20:17:44Z

Codecov Report

❌ Patch coverage is 96.96970% with 1 line in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
.../sentry/services/eventstore/query_preprocessing.py	96.96%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff            @@
##           master   #103756    +/-   ##
=========================================
  Coverage   80.59%    80.60%            
=========================================
  Files        9274      9279     +5     
  Lines      395956    396164   +208     
  Branches    25250     25250            
=========================================
+ Hits       319138    319320   +182     
- Misses      76389     76415    +26     
  Partials      429       429

src/sentry/services/eventstore/query_preprocessing.py

Two changes: 1. Add a caching layer so that we can avoid hitting the DB for hot groups. 2. We don't need the sort in 95% of cases. Let's raise that into Python and only do it when necessary.

wedamija

I have some broader concerns about how we're doing this. It looks like for every snuba query that uses SnubaQueryParams we're potentially making multiple calls to postgres? The caching will help here, but we've added 1000s of queries a second to the database. Let's see if this drops a lot and then maybe we need to revisit this solution in general.

I think it's surprising behaviour that just initializing SnubaQueryParams causes postgres queries

src/sentry/services/eventstore/query_preprocessing.py

wedamija · 2025-11-21T17:35:22Z

src/sentry/services/eventstore/query_preprocessing.py

+        running_data = {
+            (group_id, datetime.now(UTC) + timedelta(minutes=1)) for group_id in group_ids
+        }


Should this just be a dict of <group_id>: <max_date>? So when we add rows in later, we just do running_data[group_id] = max(running_data[group_id], new_date)?

I find the set easier to reason about and don't think there'll be that much duplication (only expect one Redirect per group)

thetruecpaul requested a review from wedamija November 20, 2025 20:06

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Nov 20, 2025

vercel bot deployed to Preview November 20, 2025 20:09 View deployment

sentry bot reviewed Nov 20, 2025

View reviewed changes

src/sentry/services/eventstore/query_preprocessing.py Show resolved Hide resolved

cursor bot reviewed Nov 20, 2025

View reviewed changes

src/sentry/services/eventstore/query_preprocessing.py Outdated Show resolved Hide resolved

src/sentry/services/eventstore/query_preprocessing.py Outdated Show resolved Hide resolved

thetruecpaul force-pushed the cpaul/112025/redirectcachee branch from 4da8ef3 to edef3d0 Compare November 20, 2025 20:27

vercel bot deployed to Preview November 20, 2025 20:29 View deployment

thetruecpaul force-pushed the cpaul/112025/redirectcachee branch from edef3d0 to 171400b Compare November 20, 2025 20:32

vercel bot deployed to Preview November 20, 2025 20:35 View deployment

cursor bot reviewed Nov 20, 2025

View reviewed changes

src/sentry/services/eventstore/query_preprocessing.py Show resolved Hide resolved

src/sentry/services/eventstore/query_preprocessing.py Outdated Show resolved Hide resolved

wedamija reviewed Nov 20, 2025

View reviewed changes

src/sentry/services/eventstore/query_preprocessing.py Show resolved Hide resolved

wedamija reviewed Nov 20, 2025

View reviewed changes

src/sentry/services/eventstore/query_preprocessing.py Show resolved Hide resolved

src/sentry/services/eventstore/query_preprocessing.py Show resolved Hide resolved

thetruecpaul force-pushed the cpaul/112025/redirectcachee branch from 171400b to 30ca5dc Compare November 20, 2025 22:53

vercel bot deployed to Preview November 20, 2025 22:55 View deployment

perf(eventstream): Add caching layer, raise sort

0ac93a0

Two changes: 1. Add a caching layer so that we can avoid hitting the DB for hot groups. 2. We don't need the sort in 95% of cases. Let's raise that into Python and only do it when necessary.

thetruecpaul force-pushed the cpaul/112025/redirectcachee branch from 30ca5dc to 0ac93a0 Compare November 20, 2025 23:38

thetruecpaul requested a review from wedamija November 20, 2025 23:38

vercel bot deployed to Preview November 20, 2025 23:41 View deployment

thetruecpaul requested a review from a team November 21, 2025 01:06

wedamija approved these changes Nov 21, 2025

View reviewed changes

thetruecpaul merged commit a305a6e into master Nov 21, 2025
66 of 67 checks passed

thetruecpaul deleted the cpaul/112025/redirectcachee branch November 21, 2025 19:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

perf(eventstream): Add caching layer, raise sort #103756

perf(eventstream): Add caching layer, raise sort #103756

thetruecpaul commented Nov 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Nov 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wedamija left a comment

Uh oh!

Uh oh!

wedamija Nov 21, 2025

Uh oh!

thetruecpaul Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

perf(eventstream): Add caching layer, raise sort #103756

perf(eventstream): Add caching layer, raise sort #103756

Conversation

thetruecpaul commented Nov 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wedamija left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wedamija Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

thetruecpaul Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Nov 20, 2025 •

edited

Loading