feat(dynamic-sampling): Add per-org EAP transaction volume query by constantinius · Pull Request #115161 · getsentry/sentry

constantinius · 2026-05-08T07:53:00Z

Add get_eap_transaction_volumes() to retrieve per-project transaction volumes from EAP spans, with optional ordering (order_by_volume) and max_transactions limit. Uses the existing run_eap_spans_table_query_in_chunks() for batched iteration.

Replaces #115047 (which got corrupted during a rebase).

Closes https://linear.app/getsentry/issue/TET-2306/create-transaction-volume-query-for-eap

Add get_eap_transaction_volumes() to retrieve per-project transaction volumes from EAP spans, with optional ordering and max_transactions limit. Uses the existing run_eap_spans_table_query_in_chunks() for batched iteration and adds a new Snuba referrer for the query. Co-Authored-By: Claude Sonnet 4 <noreply@example.com>

linear-code · 2026-05-08T08:00:16Z

TET-2306

Co-authored-by: Simon Hellmayr <shellmayr@users.noreply.github.com>

…e span

…r-org-get-eap-transaction-volumes-v2

…ually select the root transaction

…r-org-get-eap-transaction-volumes-v2

…s.run_table_query and set default max_transactions

…r-org-get-eap-transaction-volumes-v2

shellmayr · 2026-05-27T12:36:53Z

+        if (transaction := row.get(DynamicSamplingQueryFields.TRANSACTION)) is None:
+            continue


Can we add this filter to the query directly? (not sure if this is exposed)

Good call - added has:sentry.dsc.transaction to the query string so the filter happens server-side, and dropped the corresponding is None check in the result loop.

…e transaction filter and adjust test cases

…on_volumes function

shellmayr

LGTM 👍

cursor · 2026-05-27T13:43:09Z


-        if not more_results:
+        # either we run out of results or we hit the max results limit, in both cases we should stop
+        if not more_results or (max_results is not None and offset >= max_results):


Unused max_results parameter adds dead code to chunking function

Low Severity

The max_results parameter was added to run_eap_spans_table_query_in_chunks along with new branching logic (current_chunk_size variable, conditional chunk-size recalculation, extra termination condition), but get_eap_transaction_volumes calls Spans.run_table_query directly instead of using this helper. No caller in the codebase passes max_results, making this parameter and its associated logic untested dead code.

^{Reviewed by Cursor Bugbot for commit 21a0167. Configure here.}

…ynamic sampling

sentry · 2026-05-28T07:36:40Z

+        project_id = _get_aggregate_int(row, DynamicSamplingQueryFields.DSC_PROJECT_ID)
+        project_volumes = volumes_by_project[project_id]


Bug: The function get_eap_transaction_volumes lacks a guard for missing sentry.dsc.project_id fields, causing transaction data to be incorrectly aggregated under project_id = 0.
_{Severity: MEDIUM}

Suggested Fix

Add a guard in get_eap_transaction_volumes to check if dsc_project_id is present in the row before processing it, similar to the logic in get_eap_project_volumes. If the ID is missing, the row should be skipped to prevent incorrect aggregation.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/dynamic_sampling/per_org/tasks/queries.py#L233-L234 Potential issue: The function `get_eap_transaction_volumes` processes query results to aggregate transaction volumes. If a row from the query is missing the `sentry.dsc.project_id` field, the helper `_get_aggregate_int` will default to returning `0`. This leads to transaction data being incorrectly assigned to `project_id = 0`, which is not a valid project ID, corrupting the final aggregation. A similar function, `get_eap_project_volumes`, contains an explicit guard to skip rows with a missing `dsc_project_id`, but this new function lacks that defensive check.

sentry · 2026-05-28T07:36:41Z

+        project_id = _get_aggregate_int(row, DynamicSamplingQueryFields.DSC_PROJECT_ID)
+        project_volumes = volumes_by_project[project_id]
+
+        project_volumes.transaction_counts.append((str(transaction), total))


Bug: A missing or null sentry.dsc.transaction field is converted to the string "None" and stored as a transaction name, leading to incorrect data.
_{Severity: MEDIUM}

Suggested Fix

Before converting the transaction variable to a string, add a check to ensure it is not None. If transaction is None, the row should be skipped to avoid storing "None" as a transaction name. This adds a defensive guard that is missing.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/dynamic_sampling/per_org/tasks/queries.py#L236 Potential issue: In `get_eap_transaction_volumes`, if a row is returned from the query where the `sentry.dsc.transaction` field is missing or null, `row.get(...)` will return `None`. The code then calls `str(transaction)`, which converts `None` into the literal string `"None"`. This string is then stored as a valid transaction name, leading to incorrect data. While a `has:sentry.dsc.transaction` filter in the query is intended to prevent this, the code lacks a defensive check to handle cases where a null value might still be returned, a pattern that is present in similar functions.

sentry · 2026-05-28T07:36:41Z

+    if not get_eap_transaction_volumes(config):
+        return DynamicSamplingStatus.NO_TRANSACTION_VOLUMES


Bug: The scheduler prematurely stops for organizations with no transaction volume, incorrectly marking the task as failed and preventing dynamic sampling from running.
_{Severity: HIGH}

Suggested Fix

The early return based on an empty result from get_eap_transaction_volumes should be removed or made conditional. If this data is not yet used, the check is premature. If it is required for specific configurations, it should be guarded by a relevant config flag, similar to how get_eap_project_volumes is handled.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/dynamic_sampling/per_org/tasks/scheduler.py#L120-L121 Potential issue: The scheduler task `run_calculations_per_org_task` unconditionally calls `get_eap_transaction_volumes`. If this function returns an empty list (e.g., for an organization with no recent transactions), the scheduler immediately returns `DynamicSamplingStatus.NO_TRANSACTION_VOLUMES` and stops processing. This incorrectly prevents the dynamic sampling logic from completing successfully for valid organizations that simply have no transaction volume in the query window. Unlike the conditional check for project volumes, this check is always active, blocking the success path (`return None`) for any organization without transactions.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 7eb1d00. Configure here.}

constantinius requested review from a team as code owners May 8, 2026 07:53

github-actions Bot added the Scope: Backend Automatically applied to PRs that change backend components label May 8, 2026

constantinius requested a review from shellmayr May 8, 2026 08:00

shellmayr reviewed May 8, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

constantinius and others added 2 commits May 8, 2026 10:12

Update src/sentry/dynamic_sampling/per_org/tasks/queries.py

e467801

Co-authored-by: Simon Hellmayr <shellmayr@users.noreply.github.com>

🛠️ apply pre-commit fixes

b9d048b

sentry Bot reviewed May 8, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py

shellmayr reviewed May 8, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

constantinius added 2 commits May 12, 2026 10:49

fix: simplified row iteration and made it return max_results rows only

900d4b7

fic: using sentry.dsc.project_id instead of actual project id of th…

83b424b

…e span

constantinius requested a review from shellmayr May 12, 2026 10:04

cursor Bot reviewed May 12, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py

Merge branch 'master' into constantinius/feat/dynamic-sampling/add-pe…

e6a53fc

…r-org-get-eap-transaction-volumes-v2

vercel Bot deployed to Preview May 19, 2026 10:47 View deployment

constantinius added 2 commits May 21, 2026 09:58

fix: switch query from transaction to sentry.dsc.transaction to act…

d0e390c

…ually select the root transaction

Merge branch 'master' into constantinius/feat/dynamic-sampling/add-pe…

d94b08f

…r-org-get-eap-transaction-volumes-v2

vercel Bot deployed to Preview May 21, 2026 08:01 View deployment

shellmayr reviewed May 21, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

ref(dynamic-sampling): update get_eap_transaction_volumes to use Span…

c7dcf64

…s.run_table_query and set default max_transactions

sentry Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

Merge branch 'master' into constantinius/feat/dynamic-sampling/add-pe…

f98a507

…r-org-get-eap-transaction-volumes-v2

vercel Bot deployed to Preview May 21, 2026 12:52 View deployment

sentry Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

fix: using project list from config and not direct DB lookup

8be2552

constantinius requested a review from shellmayr May 21, 2026 12:54

constantinius added 2 commits May 27, 2026 12:30

Merge branch 'master' into constantinius/feat/dynamic-sampling/add-pe…

0a0f344

…r-org-get-eap-transaction-volumes-v2

chore: using enums for filter selections

2a8f9cb

Merge branch 'master' into constantinius/feat/dynamic-sampling/add-pe…

6d809f0

…r-org-get-eap-transaction-volumes-v2

vercel Bot deployed to Preview May 27, 2026 12:31 View deployment

shellmayr reviewed May 27, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

shellmayr reviewed May 27, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

fix(dynamic-sampling): rename dynamic sampling query fields for clarity

e854eb0

shellmayr reviewed May 27, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

fix: rename enum usage

ee4048f

shellmayr reviewed May 27, 2026

View reviewed changes

fix: remove superfluous check

66a55c7

sentry-warden Bot reviewed May 27, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

cursor Bot reviewed May 27, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py Outdated

constantinius added 2 commits May 27, 2026 14:49

fix: simplify ordering

f45ed21

fix(dynamic-sampling): update EAP transaction volumes query to includ…

d9e425b

…e transaction filter and adjust test cases

cursor Bot reviewed May 27, 2026

View reviewed changes

Comment thread tests/sentry/dynamic_sampling/per_org/tasks/test_queries.py

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py

fix(dynamic-sampling): streamline ordering logic in get_eap_transacti…

21a0167

…on_volumes function

shellmayr reviewed May 27, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/queries.py

shellmayr approved these changes May 27, 2026

View reviewed changes

cursor Bot reviewed May 27, 2026

View reviewed changes

feat(dynamic-sampling): add transaction volume check and status for d…

7eb1d00

…ynamic sampling

constantinius enabled auto-merge (squash) May 28, 2026 07:34

sentry Bot reviewed May 28, 2026

View reviewed changes

cursor Bot reviewed May 28, 2026

View reviewed changes

Comment thread src/sentry/dynamic_sampling/per_org/tasks/scheduler.py

tests: fixing tests

85afc9b

constantinius merged commit 2cab4fe into master May 28, 2026
84 checks passed

constantinius deleted the constantinius/feat/dynamic-sampling/add-per-org-get-eap-transaction-volumes-v2 branch May 28, 2026 08:09

		if (transaction := row.get(DynamicSamplingQueryFields.TRANSACTION)) is None:
		continue

		project_id = _get_aggregate_int(row, DynamicSamplingQueryFields.DSC_PROJECT_ID)
		project_volumes = volumes_by_project[project_id]

		if not get_eap_transaction_volumes(config):
		return DynamicSamplingStatus.NO_TRANSACTION_VOLUMES

Uh oh!

Conversation

constantinius commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

linear-code Bot commented May 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shellmayr May 27, 2026

Choose a reason for hiding this comment

Uh oh!

constantinius May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shellmayr left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot May 27, 2026

Choose a reason for hiding this comment

Unused max_results parameter adds dead code to chunking function

Uh oh!

sentry Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

sentry Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

sentry Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

constantinius commented May 8, 2026 •

edited

Loading

Unused `max_results` parameter adds dead code to chunking function