GitHub - umd-mith/ferguson-analysis: code for analyzing Ferguson tweet data sets with redis

This project hosts some scripts for loading features from several Ferguson related Twitter datasets housed at MITH. Once loaded into Redis the features are used to generate a report of summary statistics. It was done as part of a study conducted by Rashawn Ray, Neil Fraistat, Melissa Brown and Ed Summers at the University of Maryland.

Unfortunately Twitter's terms of service do not currently allow the five datasets used in this study to be made publicly available. They do however let 3rd parties distribute datasets of tweet IDs. This repository includes the tweet identifiers for 31,689,607 tweets from 5 time periods. Each one was obtained by either search the Twitter API for the word ferguson or using filter streaming API for the word ferguson.

More details about these datasets can be obtained from the Maryland Institute for Technology in the Humanities.

To get started you'll want to install Python and then:

pip install -r requirements.txt

Then you'll need to reconstitute the Twitter data. Unfortunately you'll need to run this for a few weeks to get all 31 million tweets again, or at least the ones that have not been deleted:

./hydrate.sh

Once that's done you can run your analysis on the resulting json data or load the data into Redis to generate our report.

./load.py
./report.py

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hydrate.sh		hydrate.sh
load.py		load.py
report.md		report.md
report.py		report.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

umd-mith/ferguson-analysis

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages