GitHub - urschrei/tweetstodb: Example script to log tweets from a set of Twitter users to a Postgres DB

Setup

Not intended for production use

Install requirements using pip install -r requirements.txt
Ensure a Postgres DB called tweetstream exists:
- u: streamer
- p: streamer
Run DB migrations using alembic upgrade head to create the database table and indices
Get Twitter API credentials: https://dev.twitter.com/apps/new
Ensure you have a keys.py file containing the following string variables:
- con_key: the API consumer key
- con_secret: the API consumer secret
- acc_key: the API access key
- acc_secret: the API access secret
In main(), change the to_follow variable to the Twitter user whose followers' Tweets you wish to retrieve
Run python getstream.py from the command line

How It Works

The Tweepy library is used to connect to Twitter using OAuth
The Twitter firehose is then filtered to show only the tweets from accounts following a given account – in this case @brockleycentral.
The tweets are streamed into a PostgreSQL database using the SQLAlchemy library and a coroutine. This allows offline retrieval and analysis using e.g. the Pandas data analysis library (see below).

Analysis

If you wish to visualise the data, an IPython notebook is provided.

For offline analysis (using dumped CSV data), run this IPython notebook.

A subset of tweets is available as a zipped database dump: tweets.db.zip. If you wish to use this for analysis, ensure your db exists with the correct credentials, but do not run the alembic upgrade command, as the structure will be created by the import – unzip the file, and import into Postgres.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
migrations		migrations
alembic.ini		alembic.ini
data.csv		data.csv
example.png		example.png
getstream.py		getstream.py
license.txt		license.txt
models.py		models.py
readme.md		readme.md
requirements.txt		requirements.txt
tweets.db.zip		tweets.db.zip
visualise_tweets.ipynb		visualise_tweets.ipynb
visualise_tweets_csv.ipynb		visualise_tweets_csv.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

migrations

migrations

alembic.ini

alembic.ini

data.csv

data.csv

example.png

example.png

getstream.py

getstream.py

license.txt

license.txt

models.py

models.py

readme.md

readme.md

requirements.txt

requirements.txt

tweets.db.zip

tweets.db.zip

visualise_tweets.ipynb

visualise_tweets.ipynb

visualise_tweets_csv.ipynb

visualise_tweets_csv.ipynb

Repository files navigation

Setup

How It Works

Analysis

License

About

Releases

Packages

Languages

License

urschrei/tweetstodb

Folders and files

Latest commit

History

Repository files navigation

Setup

How It Works

Analysis

License

About

Resources

License

Stars

Watchers

Forks

Languages