-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
usage_global data for Firefox is implausible #6733
Comments
As for the overall usage, ~2.8% is quite different from the ~5.1% om https://radar.cloudflare.com/adoption-and-usage. But for the analysis I'm doing it's the version breakdown that matters most. I'm sure there are problems with the overall browser breakdown as well, but that's not what this issue is about. |
Looked into this issue and I found the problem, it seems the importer script wasn't resetting values before importing data for a new month. As a result that last time StatCounter would report data for a version that amount was still being used. So it would usually be a small amount but those certainly add up. I've fixed the script to reset all amounts to 0 before importing now which should fix the issue. The issue was specific to the global usage data which is processed a little differently from the regional data, which shouldn't have this issue. For example the standalone worldwide data file (generated like other regions) seen here. Thanks for bringing this up @foolip! Let me know if you run into any other oddities like this. |
Thank you @Fyrd, I see the data was updated in 87ff4c1. Did this issue affect all browsers or only Firefox? https://gist.github.com/foolip/1f6b3538aeeb03223f78c3742f3f3e07 is the new usage. All versions now add up to ~2.36%, was that an intended change? I also see that 0.00477 is common usage number, and that this is the smallest number. 0.00954 also appears and is 2×0.00477. Is there some kind of quantization going on here? These small numbers still add up to make the results an bit implausible. These usage numbers suggest that you need to support Firefox 78 and later to support >95% of Firefox users, because 4% of users are on versions 11+36+43+44+52+56+72. Is the underlying data for this, at least current if not historical, available for inspection? |
This was affecting all browsers. The source of the data is the StatCounter usage data for May, see https://gs.statcounter.com/browser-version-market-share/desktop-console/worldwide/#monthly-202305-202305-bar then "Download data (.csv)". It's multiplied by the Desktop/mobile ratio to result in the numbers you see. There might be some other tweaks but I think that's mostly it. Seems SC rounds usage numbers to two significant digits so 0.01 is the lowest it gets, resulting in values like 0.00477 after multiplying by the desktop/mobile ratio. |
Ah, this is interesting. Do you know if it's rounding, or if it might be flooring/truncating? If it's rounding, then 0.01 means the real usage is in 0.005≤x<0.015. If it's flooring, it means the real usage is 0.01≤x<0.02. Even without knowing the answer to this, perhaps it would be possible to scale the numbers so that they add up to the overall usage of that browser claimed by SC? |
OK, so I've tried it myself. I downloaded the CSV data from these two pages: The numbers in the CSV files don't match exactly what's shown on the pages, but I've compared the percentage from the first CSV file against the sum of versions in the second. The latter is smaller in the cases I've checked:
So it seems like it will always undercount. As long as one is aware of this that's fine. @Fyrd are the overall numbers also in |
Interesting! They are not, I hadn't ever dug quite this deep into the data to notice this discrepancy. |
Is this the data used when computing the global availability show on caniuse.com? If so, then fixing this somehow should move the numbers closer to 100%. |
You can use DNSSEC as an easy test for this. All browsers have partial support, so the number should be 100%, but it is currently 98.32% |
Yep, looks like the sum of all of the I actually wouldn't expect these numbers to add up to 100% since there is no "other" category, but I also don't know if 98.362% is the right number. |
Yes, I'll definitely look into doing so. |
In caniuse-db 1.0.30001574, the US usage of Firefox 11 (released in 2012) has jumped to 0.20214%, which is pretty implausible and surprising for those using a |
@andersk Can confirm this is coming from the StatCounter data (download CSV for the full data). Occasionally there are some strange anomalies. You can use their feedback form to inquire further, I can update the caniuse data if they make a change. |
This is the
usage_global
data from https://raw.githubusercontent.com/Fyrd/caniuse/cc3ccd6ca9c69dd811d2952d873abcd0750e34a2/fulldata-json/data-2.0.json:Firefox usage by version
There is something quite implausible about these numbers. The sum is ~2.8%, but versions 2-71 together make up ~0.48%. Firefox 72 was released on 2020-01-07, so that would mean that ~17% of Firefox users are on a version more than 3.5 years old.
It also seems suspect that usage is relatively evenly spread among all those old versions.
@Fyrd can you say anything about the source of this data and what might be going on here? Might it also affect the numbers for other browsers?
The text was updated successfully, but these errors were encountered: