Bundle analysis: create timeseries dataset upon upload #601

JerrySentry · 2024-06-04T15:43:10Z

When the upload bundle stat file is called, we will create the supported datasets for trend data. When there's values in Dataset model, measurements will be able to start populating during processing of the stat file (this will be implemented next in the worker repo). This means that for BA there will not be a opt-in feature similar to flags/component trends, all users will be automatically opted in for this. Additionally there will not be a "backfilling" process for past bundle stats.

closes codecov/engineering-team#1771

Legal Boilerplate

Look, I get it. The entity doing business as "Sentry" was incorporated in the State of Delaware in 2015 as Functional Software, Inc. In 2022 this entity acquired Codecov and as result Sentry is going to need some rights from me in order to utilize my contributions in this PR. So here's the deal: I retain all rights, title and interest in and to my contributions, and by keeping this boilerplate intact I confirm that Sentry can use, modify, copy, and redistribute my contributions, under Sentry's choice of terms.

codecov-qa · 2024-06-04T15:52:42Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.49%. Comparing base (0ee2d5c) to head (8907367).

✅ All tests successful. No failed tests found.

@@           Coverage Diff           @@
##             main     #601   +/-   ##
=======================================
  Coverage   91.49%   91.49%           
=======================================
  Files         615      615           
  Lines       16371    16379    +8     
=======================================
+ Hits        14978    14986    +8     
  Misses       1393     1393

Flag	Coverage Δ
unit	`91.49% <100.00%> (+<0.01%)`	⬆️
unit-latest-uploader	`91.49% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
upload/views/bundle_analysis.py	`98.68% <100.00%> (+0.15%)`	⬆️

📣 Codecov offers a browser extension for seamless coverage viewing on GitHub. Try it in Chrome or Firefox today!

codecov-notifications · 2024-06-04T15:52:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

codecov-public-qa · 2024-06-04T15:52:50Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.49%. Comparing base (0ee2d5c) to head (8907367).

✅ All tests successful. No failed tests found ☺️

@@           Coverage Diff           @@
##             main     #601   +/-   ##
=======================================
  Coverage   91.49%   91.49%           
=======================================
  Files         615      615           
  Lines       16371    16379    +8     
=======================================
+ Hits        14978    14986    +8     
  Misses       1393     1393

Flag	Coverage Δ
unit	`91.49% <100.00%> (+<0.01%)`	⬆️
unit-latest-uploader	`91.49% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
upload/views/bundle_analysis.py	`98.68% <100.00%> (+0.15%)`	⬆️

codecov · 2024-06-04T15:56:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.93%. Comparing base (0ee2d5c) to head (8907367).

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@               Coverage Diff                @@
##               main       #601        +/-   ##
================================================
+ Coverage   95.92000   95.93000   +0.01000     
================================================
  Files           793        793                
  Lines         17689      17697         +8     
================================================
+ Hits          16969      16977         +8     
  Misses          720        720

Flag	Coverage Δ
unit	`91.49% <100.00%> (+<0.01%)`	⬆️
unit-latest-uploader	`91.49% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

matt-codecov · 2024-06-04T17:35:32Z

upload/views/bundle_analysis.py

@@ -142,4 +144,29 @@ def post(self, request):
                is_shelter_request=self.is_shelter_request(),
            ),
        )
+
+        if settings.TIMESERIES_ENABLED:


do we want to jump right to creating or do we want to create if it doesn't already exist / isn't backfilled? the latter is what is done for coverage i think

codecov-api/timeseries/helpers.py

Lines 341 to 348 in 66ec9fa

dataset = Dataset.objects.filter(

name=MeasurementName.COVERAGE.value,

repository_id=repository.pk,

).first()

if settings.TIMESERIES_ENABLED and dataset and dataset.is_backfilled():

# timeseries data is ready

return coverage_measurements(

I think its better to create it as soon as possible then it is to create it when we fetch the measurements, that way we don't miss out on a datapoint in the general flow of upload->process->query since the process step is when we insert the measurement datapoints which relies on the presence of the dataset rows being inserted.

With the coverage measurements its ok because when querying and not having dataset created it can still compute and return measurements on the fly and then do a backfill. Also with BA we don't plan on doing backfills because the previous bundle schemas don't support it.

oh on second read the code is get_or_create(), i thought it was just create(). my concern was that you would create multiple Datasets with the same type/repoID which sounded wrong. but this is all good

ajay-sentry · 2024-06-07T22:57:43Z

upload/tests/views/test_bundle_analysis.py

+
+
+@pytest.mark.django_db(databases={"default", "timeseries"})
+def test_upload_bundle_analysis_measurement_datasets_created(


jw, is it possible to parametrize this and the other test with the pytest.mark.parametrize() decorator?

codecov-api/services/tests/test_repo_providers.py

Line 41 in 7fc9bc5

@pytest.mark.parametrize("using_integration", [True, False])

ajay-sentry

lgtm

matt-codecov · 2024-06-10T18:00:14Z

upload/views/bundle_analysis.py

@@ -142,4 +144,29 @@ def post(self, request):
                is_shelter_request=self.is_shelter_request(),
            ),
        )
+
+        if settings.TIMESERIES_ENABLED:


oh on second read the code is get_or_create(), i thought it was just create(). my concern was that you would create multiple Datasets with the same type/repoID which sounded wrong. but this is all good

Bundle analysis: create timeseries dataset upon upload

185a6db

Merge branch 'main' into jun_04_ba_dataset

b207165

JerrySentry marked this pull request as ready for review June 4, 2024 17:26

JerrySentry requested a review from a team as a code owner June 4, 2024 17:26

matt-codecov reviewed Jun 4, 2024

View reviewed changes

Merge branch 'main' into jun_04_ba_dataset

e21556c

ajay-sentry reviewed Jun 7, 2024

View reviewed changes

ajay-sentry approved these changes Jun 7, 2024

View reviewed changes

Merge branch 'main' into jun_04_ba_dataset

bdcafe1

JerrySentry enabled auto-merge June 10, 2024 14:09

Merge branch 'main' into jun_04_ba_dataset

8907367

JerrySentry disabled auto-merge June 10, 2024 17:49

JerrySentry enabled auto-merge June 10, 2024 17:49

matt-codecov approved these changes Jun 10, 2024

View reviewed changes

JerrySentry added this pull request to the merge queue Jun 10, 2024

Merged via the queue into main with commit bccac24 Jun 10, 2024
21 of 22 checks passed

JerrySentry deleted the jun_04_ba_dataset branch June 10, 2024 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bundle analysis: create timeseries dataset upon upload #601

Bundle analysis: create timeseries dataset upon upload #601

JerrySentry commented Jun 4, 2024 •

edited

Loading

codecov-qa bot commented Jun 4, 2024 •

edited

Loading

codecov-notifications bot commented Jun 4, 2024

codecov-public-qa bot commented Jun 4, 2024 •

edited

Loading

codecov bot commented Jun 4, 2024 •

edited

Loading

matt-codecov Jun 4, 2024

JerrySentry Jun 4, 2024

matt-codecov Jun 10, 2024

ajay-sentry Jun 7, 2024

ajay-sentry left a comment

matt-codecov Jun 10, 2024

	dataset = Dataset.objects.filter(
	name=MeasurementName.COVERAGE.value,
	repository_id=repository.pk,
	).first()

	if settings.TIMESERIES_ENABLED and dataset and dataset.is_backfilled():
	# timeseries data is ready
	return coverage_measurements(



		@pytest.mark.django_db(databases={"default", "timeseries"})
		def test_upload_bundle_analysis_measurement_datasets_created(

Bundle analysis: create timeseries dataset upon upload #601

Bundle analysis: create timeseries dataset upon upload #601

Conversation

JerrySentry commented Jun 4, 2024 • edited Loading

Legal Boilerplate

codecov-qa bot commented Jun 4, 2024 • edited Loading

Codecov Report

codecov-notifications bot commented Jun 4, 2024

Codecov Report

codecov-public-qa bot commented Jun 4, 2024 • edited Loading

Codecov Report

codecov bot commented Jun 4, 2024 • edited Loading

Codecov Report

matt-codecov Jun 4, 2024

Choose a reason for hiding this comment

JerrySentry Jun 4, 2024

Choose a reason for hiding this comment

matt-codecov Jun 10, 2024

Choose a reason for hiding this comment

ajay-sentry Jun 7, 2024

Choose a reason for hiding this comment

ajay-sentry left a comment

Choose a reason for hiding this comment

matt-codecov Jun 10, 2024

Choose a reason for hiding this comment

JerrySentry commented Jun 4, 2024 •

edited

Loading

codecov-qa bot commented Jun 4, 2024 •

edited

Loading

codecov-public-qa bot commented Jun 4, 2024 •

edited

Loading

codecov bot commented Jun 4, 2024 •

edited

Loading