-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consistent Data Loss over Multiple Pulls #22
Comments
33% is high, but not unheard of. If you like I can verify if you can share the tweet ids for one of them? |
Sure thing. I appreciate you taking a look. Thanks! |
@edsu That is pretty similar. I had 411,909 and 411,908 on two different requests. It appears that it may just be that the collection is somewhat volatile. Thanks for running it through twarc. I wanted to try that, but had trouble inputting my keys into the program. |
Ok, I'm glad that things seem to be similar. One thing you can do if you are interested in working with the original data is privately reach out to the person who collected it and see if they are willing to share it with you for research purposes. Let me know if you have trouble figuring out contact information if the dataset is in the DocNow Catalog. |
I ran 14 datasets through the tool and each returned a dataset with roughly 33% data loss. However, I've noticed that screenshots of other pulls have differing values. Is my consistent loss due to my Developer Account/App permissions or is it just chance and simply due to the dataset?
The text was updated successfully, but these errors were encountered: