Skip to content

Update "Analyzing PyPI package downloads" guide to use more efficient file_downloads table #782

@tswast

Description

@tswast

The the-psf:pypi.file_downloads table is partitioned by day, which makes time-limited queries easier to write than having to concatenate tables. Also, it is clustered by project, which makes the vast majority of queries people would want to do (count downloads for a specific package) much more efficient.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions