Consolidate implementation of `calculate_diff` #528

Swatinem · 2025-02-24T12:55:01Z

This cleans up the duplicated implementations of Filtered/Report.calculate_diff and moves it to a new well typed file.

I’m a bit surprised that this does not yield any speedup in the diff calculation benchmarks (compared to #525).

I think the difference between the base and FilteredReport is based on the actual filtering and line-mapping code that happens, instead of the different and less optimized version that was previously there.

                                          Benchmark Results                                          
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━┓
┃                                       Benchmark ┃   Time (best) ┃ Rel. StdDev ┃ Run time ┃  Iters ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━┩
│            test_report_diff_calculation[Report] │         803ns │        0.9% │    2.92s │ 15,975 │
│    test_report_diff_calculation[FilteredReport] │       1,886ns │        1.1% │    2.92s │ 10,437 │
└─────────────────────────────────────────────────┴───────────────┴─────────────┴──────────┴────────┘

codspeed-hq · 2025-02-24T13:11:57Z

CodSpeed Performance Report

Merging #528 will create unknown performance changes

_{Comparing swatinem/unify-calculate-diff (ca08e87) with main (0222015)}

Summary

⚠️ No benchmarks were detected in both the base of the PR and the PR.\

This cleans up the duplicated implementations of `Filtered/Report.calculate_diff` and moves it to a new well typed file.

giovanni-guidini

LGTM. Thanks for adding more types to this part of the code 🥳

As you mentioned no speed improvement, but in the comparison the stddev for the Report class was greatly reduced... that might be a win? Having a more bounded code I guess. The speed report doesn't really report quantiles for us to know, though.

I wonder if using @dataclass instead of TypedDict could improve speed. But my guess is that it would be worse (cause you need to transform from dict into a class instance).

giovanni-guidini · 2025-02-25T15:10:46Z

shared/reports/diff.py

+
+class DiffSegment(TypedDict):
+    lines: list[str]
+    header: tuple[int, int, int, int]


A comment here like # (base_start_line, base_length, head_start_line, head_length) would be useful for people that don't know what the numbers represent

Or breaking up the header further, but might be overkill

giovanni-guidini · 2025-02-25T15:11:09Z

shared/reports/diff.py

+
+
+class DiffFile(TypedDict):
+    type: str


do we know what the possible types are?

joseph-sentry

also lgtm:

to answer one of @giovanni-guidini's questions from above:

shared/shared/torngit/base.py

Line 142 in 3bcf3c9

def diff_to_json(self, diff):

I got confused because i was looking at the response schemas from the Github API and I couldn't find "type" or "segments"

Swatinem · 2025-02-26T09:39:54Z

Thanks for the suggestions. I improved the types and added doc comments.

On that note, I just hate how python doc comments come after the item they are documenting. This is so unintuitive compared to pretty much every other language that does it differently.

overwatch-beta · 2025-03-11T21:03:04Z

✅ Sentry found no issues in your recent changes ✅

Swatinem requested a review from a team February 24, 2025 12:55

Swatinem self-assigned this Feb 24, 2025

Swatinem force-pushed the swatinem/unify-calculate-diff branch from d008aaf to e454565 Compare February 24, 2025 13:08

Swatinem force-pushed the swatinem/unify-calculate-diff branch from e454565 to ec03cd7 Compare February 24, 2025 13:39

Consolidate implementation of calculate_diff

f664924

This cleans up the duplicated implementations of `Filtered/Report.calculate_diff` and moves it to a new well typed file.

Swatinem force-pushed the swatinem/unify-calculate-diff branch from ec03cd7 to f664924 Compare February 25, 2025 12:21

giovanni-guidini approved these changes Feb 25, 2025

View reviewed changes

joseph-sentry approved these changes Feb 25, 2025

View reviewed changes

improve typing and docs

ca08e87

Swatinem added this pull request to the merge queue Feb 26, 2025

Merged via the queue into main with commit e4eb57b Feb 26, 2025
8 checks passed

Swatinem deleted the swatinem/unify-calculate-diff branch February 26, 2025 09:44

JerrySentry restored the swatinem/unify-calculate-diff branch March 11, 2025 21:00

JerrySentry added a commit that referenced this pull request Mar 11, 2025

Add upload-overwatch.yml workflow for PR #528

bda44b2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Consolidate implementation of `calculate_diff` #528

Consolidate implementation of `calculate_diff` #528

Uh oh!

Swatinem commented Feb 24, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Feb 24, 2025 •

edited

Loading

Uh oh!

giovanni-guidini left a comment

Uh oh!

giovanni-guidini Feb 25, 2025

Uh oh!

giovanni-guidini Feb 25, 2025

Uh oh!

joseph-sentry left a comment

Uh oh!

Swatinem commented Feb 26, 2025

Uh oh!

Uh oh!

overwatch-beta bot commented Mar 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Consolidate implementation of calculate_diff #528

Consolidate implementation of calculate_diff #528

Uh oh!

Conversation

Swatinem commented Feb 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq bot commented Feb 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #528 will create unknown performance changes

Summary

Uh oh!

giovanni-guidini left a comment

Choose a reason for hiding this comment

Uh oh!

giovanni-guidini Feb 25, 2025

Choose a reason for hiding this comment

Uh oh!

giovanni-guidini Feb 25, 2025

Choose a reason for hiding this comment

Uh oh!

joseph-sentry left a comment

Choose a reason for hiding this comment

Uh oh!

Swatinem commented Feb 26, 2025

Uh oh!

Uh oh!

overwatch-beta bot commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Sentry found no issues in your recent changes ✅

Uh oh!

Uh oh!

Consolidate implementation of `calculate_diff` #528

Consolidate implementation of `calculate_diff` #528

Swatinem commented Feb 24, 2025 •

edited

Loading

codspeed-hq bot commented Feb 24, 2025 •

edited

Loading

overwatch-beta bot commented Mar 11, 2025 •

edited

Loading