GitHub

VISUALIZE A TWITTER ACCOUNT

OUTPUT

As an example, here is the visualization I got for my twitter account:

SUMMARY

This script will output a visualization of you twitter account. The script will gather the last 200 tweets of all the accounts you follow and it will create a visualization that shows the following:

Username, account creation date, number of followers, and of accounts you follow
Number of tweets, retweets, and replies
Time series of last month's number of tweets
Wordcloud of tweets, retweets, replies, and all tweets together (This blog post explains how the wordclouds are done: http://peekaboo-vision.blogspot.de/2012/11/a-wordcloud-in-python.html)
Top 20 hashtags
Top 4 languages all tweets
Sentiment analysis of tweets (neutral tweets count as positive)
A map that shows where the people/accounts you follow are located

FILES:

viz_twitter.py: Script that outputs the visualization

USE

Clone project (https://help.github.com/articles/cloning-a-repository/)
Make sure you are running Python 3.X and install libraries in requirements.txt (pip install -r /path/to/requirements.txt). Use pip install libraryname to install any other library. You'll also need to install stopwords, both for English and Spanish: http://blog.nlpapi.co/how-to-install-nltk-corporastopwords/
Get your twitter keys:

Create a Twitter user account if you do not already have one.
Go to https://apps.twitter.com/ and log in with your Twitter user account. This step gives you a Twitter dev account under the same name as your user account.
Click “Create New App”
Fill out the form, agree to the terms, and click “Create your Twitter application”
In the next page, click on “Keys and Access Tokens” tab, and copy your “API key” and “API secret”. Scroll down and click “Create my access token”, and copy your “Access token” and “Access token secret”.

Add your keys to seetings.py file. Also add extra stopwords so they don't show up in wordclouds.
Inside the folder data place the file Sentiment Analysis Dataset.csv. You can get this file from http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/ .
Run script and enjoy your viz!!

OTHER INFO

Sentiment analysis uses a logistic regression model and it's only performed on tweets in English. This is the training set: http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/. Other models (Random Forest, SVM) were tested using CV, the best performing model was logistic regression.
Next step will be to apply topic modeling (LDA) based on this paper: http://users.cecs.anu.edu.au/~ssanner/Papers/sigir13.pdf
Script took XXX seconds to run with my twitter account. Getting the language of the tweets takes most of the time.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
images		images
models		models
.gitignore		.gitignore
README.md		README.md
Untitled.ipynb		Untitled.ipynb
requirements.txt		requirements.txt
settings.py		settings.py
viz_twitter.py		viz_twitter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VISUALIZE A TWITTER ACCOUNT

OUTPUT

SUMMARY

FILES:

USE

OTHER INFO

About

Releases

Packages

Languages

nachocarracedo/my_twitter

Folders and files

Latest commit

History

Repository files navigation

VISUALIZE A TWITTER ACCOUNT

OUTPUT

SUMMARY

FILES:

USE

OTHER INFO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages