GitHub - benmarwick/AAA2011-Tweets: R code for analyzing tweets relating to #AAA2011 (text mining, topic modelling, network analysis, clustering and sentiment analysis)

benmarwick / AAA2011-Tweets Public

Notifications You must be signed in to change notification settings
Fork 83
Star 71

R code for analyzing tweets relating to #AAA2011 (text mining, topic modelling, network analysis, clustering and sentiment analysis)

71 stars 83 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
AAA2011.R		AAA2011.R
README		README

Repository files navigation

R code for obtaining and analysing tweets from the 2011 meeting of the American Anthropological Association

The code details ten steps in the analysis and visualisation of the tweets:

1.  acquiring the raw Twitter data
2.  calculating some basic statistics with the raw Twitter data
3.  calculating  some basic retweet statistics
4.  calculating the ratio of retweets to tweets
5.  calculating some basic statistics about URLs in tweets
6.  basic text mining for token frequency and token association analysis
7.  calculating senitment scores of tweets, including on subsets containing tokens of interest
8.  hierarchical clustering of tokens based on multiscale bootstrap resampling
9.  topic modelling the tweet corpus using latent Dirichlet allocation
10. network analysis of tweeters based on retweets




Author: Ben Marwick
Contact: http://faculty.washington.edu/bmarwick/how-contact-me
Licence: http://creativecommons.org/licenses/by-nc-sa/2.0/
Date: Dec 2011