Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
The Digital Methods Initiative Twitter Capture and Analysis Toolset (DMI-TCAT) is a set of tools to retrieve and collect tweets from Twitter and to analyze them in various ways. It is written mostly in PHP and runs in a webserver (LAMP) environment. On a Ubuntu or Debian machine, installation can be done with one command.
DMI-TCAT provides robust and reproducible data capture and analysis, and interlinks with existing analytical software. Analytical features are based on two main ideas:
- captured datasets can be refined in different ways (search queries, exclusions, date range, etc.) and
- the resulting selections of tweets can be analyzed in various ways, mainly by outputting files in standard formats (CSV for tabular files and GEXF for network files)
Check out the screenshots below to get an idea of the selection and overview as well as some of the possible file exports.
The following paper explains the rationale behind the tool and introduces some of the analytical possibilities. Please reference it when using DMI-TCAT for academic publications.
- Erik Borra, Bernhard Rieder, (2014) "Programmed method: developing a toolset for capturing and analyzing tweets", Aslib Journal of Information Management, Vol. 66 Iss: 3, pp.262 - 278.
- Installation Guide
- Upgrading TCAT
- Follow http://twitter.com/dmitools for updates on DMI-TCAT