Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

twarc2 hydrate #697

Closed
frogger opened this issue May 5, 2023 · 2 comments
Closed

twarc2 hydrate #697

frogger opened this issue May 5, 2023 · 2 comments

Comments

@frogger
Copy link

frogger commented May 5, 2023

Hey, I am having real problems migrating to Twitter V2 since they pulled my app and I had to pay for a new one :( My current code doesn't work anymore and I tried twarc2 hydrate and got an invalid literal for int() error. I stripped my data back to just the Tweet ids with no header etc and that seems to work however I am struggling to comprehend the output as it is completely different. Any advice would be appreciated, this is for my masters dissertation so I'm getting a bit stressed!!

@edsu
Copy link
Member

edsu commented May 5, 2023

Yes the Twitter API V2 data is quite different. You may be interested in "flattening" the data after collecting, which makes it a little bit easier to process. Some details are in: https://news.docnow.io/twarc2-779278e66ea0

@edsu edsu closed this as completed May 5, 2023
@igorbrigadir
Copy link
Contributor

Also check out twarc-csv for exploring: https://github.com/DocNow/twarc-csv but note that Excel does poorly with tweet and user IDs, so you have to be careful with making those all explicitly TEXT when importing, otherwise the data is corrupted. Google Sheets works though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants