Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decreasing number of annotations in google patents research in recent batches #88

Open
complexly opened this issue Nov 28, 2023 · 0 comments

Comments

@complexly
Copy link

Could someone help me understand why the number of annotations in google patents research are dropping in recent batches?

202208 verison: 59,089,580,018 rows
202212 verison: 60,246,963,593 rows
202304 verison: 63,130,241,301 rows
202307 verison: 48,064,657,811 rows
current verison: 41,000,981,833 rows

Data size seems increasing before 202307 but decreasing greatly afterwards. Is it due to model changes/coverage change, or some other reason? And which vesion of data should I rely on more for some analysis? Is the most recent version most trustable? Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant