Skip to content

Conversation

@pvk-developer
Copy link
Member

@pvk-developer pvk-developer commented Feb 28, 2025

Resolves #6
CU-86b40fw92

Here is a completed run from this branch: https://github.com/datacebo/download-analytics/actions/runs/13683203905

I realized that the data collected for the newly added libraries is for the current month. In order to make it consistent with SDV I will run a heavier query for the newly added libraries to make sure they are in sync with sdv's data (basically force the query to run on the newly added libraries since SDV was first released).

Updated results can be found here: https://drive.google.com/drive/u/1/folders/10QHbqyvptmZX4yhu2Y38YJbVHqINRr0n

@pvk-developer pvk-developer changed the base branch from main to issue-7-bump-python-version-for-the-repo February 28, 2025 15:50
Base automatically changed from issue-7-bump-python-version-for-the-repo to main February 28, 2025 17:54
@pvk-developer pvk-developer force-pushed the issue-8-add-new-libraries-to-daily branch from 2e0cb2e to f2282d7 Compare February 28, 2025 18:12
totals = base.sum()
totals.name = 'total'
base = pd.concat([base, totals], ignore_index=True)
base = pd.concat([base, totals.to_frame().T], ignore_index=True)
Copy link
Collaborator

@gsheni gsheni Mar 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why add a transpose?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This fixes a bug that I introduced in the previous addition. totals is a series that was 'appended' to the frame and now we are using pd.concat I referenced to this

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the new output for total:

 base.reset_index().iloc[::-1]['total']
82    2011571.0
81       6536.0
80      65395.0
79      10876.0
78      57403.0
        ...    
4         435.0
3         569.0
2         187.0
1          93.0
0          60.0
Name: total, Length: 83, dtype: float64

@pvk-developer pvk-developer requested a review from gsheni March 7, 2025 10:01
@pvk-developer pvk-developer merged commit 37833b8 into main Mar 8, 2025
1 check passed
@pvk-developer pvk-developer deleted the issue-8-add-new-libraries-to-daily branch March 8, 2025 18:41
gsheni pushed a commit that referenced this pull request Jul 21, 2025
* Add additional libraries to the download analytics collection

* Fix daily build

* Use fixed httplib2 version

* Improve way of inserting columns

* Update metrics.py

* Update metrics.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add New Libraries to Daily Download Analytics Collection

4 participants