fix: take flaky_fail_count into account for total tests in new TA impl #275

joseph-sentry · 2025-07-03T17:12:40Z

in the old TA implementation the possible outcomes were: pass, fail, and skip.
flaky failures were a subset of fails which meant we shouldn't include the
flaky_fail_count in the total tests calculation

In the new implementation that's not the case, flakiness is represented by the
outcome of a testrun, which means that we have to count flaky fails in the total
tests

This commit also does some light refactoring of the polars aggregation
expressions for simplicity, and fixes a bug where we weren't running fill_nan
when it was needed.

codecov-notifications · 2025-07-03T17:18:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

codecov · 2025-07-03T17:19:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.33%. Comparing base (0e2bf73) to head (9626f7d).
Report is 1 commits behind head on main.

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #275      +/-   ##
==========================================
+ Coverage   94.31%   94.33%   +0.01%     
==========================================
  Files        1229     1229              
  Lines       45293    45296       +3     
  Branches     1448     1448              
==========================================
+ Hits        42720    42729       +9     
+ Misses       2269     2263       -6     
  Partials      304      304

Flag	Coverage Δ
apiunit	`96.52% <100.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

in the old TA implementation the possible outcomes were: pass, fail, and skip. flaky failures were a subset of fails which meant we shouldn't include the flaky_fail_count in the total tests calculation In the new implementation that's not the case, flakiness is represented by the outcome of a testrun, which means that we have to count flaky fails in the total tests This commit also does some light refactoring of the polars aggregation expressions for simplicity, and fixes a bug where we weren't running fill_nan when it was needed.

ajay-sentry · 2025-07-11T23:11:45Z

apps/codecov-api/graphql_api/types/test_results_aggregates/test_results_aggregates.py

-            .top_k(min(100, max(table.height // 20, 1)))
-            .sum()
-        ).alias("total_slow_tests"),
+        pl.lit(num_slow_tests).alias("total_slow_tests"),


@cursor what does .lit do here

ajay-sentry · 2025-07-11T23:13:29Z

apps/codecov-api/utils/test_results.py

@@ -63,11 +68,11 @@ def dedup_table(table: pl.DataFrame) -> pl.DataFrame:
        .agg(
            pl.col("testsuite").alias("testsuite"),
            pl.col("flags").explode().unique().alias("flags"),
-            failure_rate_expr.fill_nan(0).alias("failure_rate"),
-            flake_rate_expr.fill_nan(0).alias("flake_rate"),
+            failure_rate_expr.alias("failure_rate"),


DId the old approach not do the same thing? Or did you just want everything on the same line?

If the former, I kinda thought the other way was more readable

ajay-sentry · 2025-07-11T23:15:13Z

apps/codecov-api/utils/test_results.py

@@ -93,6 +98,17 @@ def _has_commits_before_cutoff(repoid: int) -> bool:
    ).exists()


+@lru_cache(maxsize=1000)


why 1000? Is the default of 128 not good enough for us?

ajay-sentry

how come the test results got swapped around in that test? Couple other small things but nothing major

joseph-sentry requested review from ElioDiNino, ajay-sentry, michelletran-codecov and a team and removed request for ElioDiNino, ajay-sentry and michelletran-codecov July 7, 2025 21:41

joseph-sentry added 2 commits July 9, 2025 13:35

ref: factor use_new_impl out

a939494

joseph-sentry force-pushed the joseph/fix-ta-aggregate branch from db169bf to 9626f7d Compare July 9, 2025 17:35

joseph-sentry enabled auto-merge July 9, 2025 17:41

ElioDiNino approved these changes Jul 11, 2025

View reviewed changes

ajay-sentry reviewed Jul 11, 2025

View reviewed changes

ajay-sentry approved these changes Jul 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: take flaky_fail_count into account for total tests in new TA impl #275

fix: take flaky_fail_count into account for total tests in new TA impl #275

joseph-sentry commented Jul 3, 2025

Uh oh!

codecov-notifications bot commented Jul 3, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 3, 2025 •

edited

Loading

Uh oh!

ajay-sentry Jul 11, 2025

Uh oh!

ajay-sentry Jul 11, 2025

Uh oh!

ajay-sentry Jul 11, 2025

Uh oh!

ajay-sentry Jul 11, 2025

Uh oh!

ajay-sentry left a comment

Uh oh!

Uh oh!

		@@ -93,6 +98,17 @@ def _has_commits_before_cutoff(repoid: int) -> bool:
		).exists()


		@lru_cache(maxsize=1000)

fix: take flaky_fail_count into account for total tests in new TA impl #275

Are you sure you want to change the base?

fix: take flaky_fail_count into account for total tests in new TA impl #275

Conversation

joseph-sentry commented Jul 3, 2025

Uh oh!

codecov-notifications bot commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

codecov bot commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ajay-sentry Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

ajay-sentry Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

ajay-sentry Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

ajay-sentry Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

ajay-sentry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov-notifications bot commented Jul 3, 2025 •

edited

Loading

codecov bot commented Jul 3, 2025 •

edited

Loading