Skip to content
This repository has been archived by the owner on Apr 27, 2018. It is now read-only.

New Twitter Features: Few Suggestions, Request for Further Suggestions #217

Open
ianmilligan1 opened this issue Apr 6, 2016 · 1 comment
Labels

Comments

@ianmilligan1
Copy link
Collaborator

Right now we've got URL extraction, language extraction, hashtag extraction, and image extraction. We should have a few more features documented. I think this could begin with:

  • plain text extraction -> one line per tweet text;
  • user extraction (i.e. top ten users in a corpus);
  • retweeted tweet tracking (i.e. top ten retweeted tweets);

There may be other requests, so please let us know in this issue.

@ruebot
Copy link
Contributor

ruebot commented Apr 6, 2016

Might as well list everything we can do with twarc utils.
Twarc utils that I heavily use:

  • deduplicate
  • embeds (embedded media in a tweet)
  • filter_date
  • geojson
  • ids
  • retweets
  • tags (hashtags)
  • unshorten
  • users
  • validate

jq queries;

  • extract text

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants