Skip to content
This repository has been archived by the owner on Dec 20, 2021. It is now read-only.

Removing "bot download" from counts #19

Closed
kootenpv opened this issue Oct 4, 2015 · 3 comments
Closed

Removing "bot download" from counts #19

kootenpv opened this issue Oct 4, 2015 · 3 comments

Comments

@kootenpv
Copy link

kootenpv commented Oct 4, 2015

What does this do about the fact that PyPi is being crawled very often? It would be great if we could get a better estimate of "actual" counts, or did you do something to account for the number of bots out there?

@aclark4life
Copy link
Owner

Nothing and nope; what do you suggest we do to account for "bot download"?

@kootenpv
Copy link
Author

kootenpv commented Oct 4, 2015

Perhaps what is possible is to script uploading a nonsense unguessable package name to pip; every day a new update.

I guess from that we can estimate the number of bots active, say, given a week.

After some weeks/months of data, you can try to model a prediction backwards, and could correct the numbers.

Perhaps someone can think of a better approach.

@aclark4life
Copy link
Owner

Let's assume the fix for this is for vanity to query the dataset mentioned here, instead of relying on PyPI: https://mail.python.org/pipermail/distutils-sig/2016-May/028986.html, now tracking that in #22 .

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants