-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rounding errors found in IDs #24
Comments
df['ID'].astype(str).apply(lambda x: len(x)).value_counts() outputs:
Meaning a majority of tweets have 19 digits. |
The maximum ID value as an integer is |
Looking only at IDs that end in all_ids = [x for x in df['ID'].astype(str).tolist() if x[-2:] == '00'] Using
|
✅ confirmed. This was deleted. |
Cannot scroll back far enough. Feed for Exelon stops in 2019. The original tweet exists though. Not exactly sure what happened here. |
✅ confirmed. This was deleted. |
✅ Are all the Lowe's tweets we know about |
✅ Looks like the original tweet was deleted |
Seems like in the raw data pull ( |
Pandas supports 64 bit integers by default, and Twitter suggests that's what it's using. Still can't figure out how that error creeped in, but it should be all set now. |
Oembed process revealed IDs that had been rounded:
This exists within the dataset in
fortune-100-blm-dataset
. I believe I manually entered values for Lowe's because of API limits, so that might be what's happening.00
to double check that it's limited to these tweets.fortune-100-blm-dataset
repo and double check scripts.The text was updated successfully, but these errors were encountered: