DATA

Code related to a working paper that was first presented at the AFSP Annual Meeting in Paris, 2013. See Section 1 of this paper and its appendix, or read the HOWTO below for a technical summary.

June 2014 – Major update * Updated working paper * Added new appendix * Added five media scrapers * Updated Google Trends data

The scraper currently collects slightly over 6,300 articles from

HOWTO

The entry point is make.r:

get_articles will scrape the news sources (adjust page counters to current website search results to update the data)
get_corpus will extract all entities and list the most common ones (set minimum frequency with threshold; defaults to 10)
get_ranking will export the top 15 central nodes of the co-occurrence network to the tables folder, in Markdown format
get_network returns the co-occurrence network, optionally trimmed to its top quantile of weighted edges (set with threshold; defaults to 0)

The weighting scheme is inversely proportional to the number of entity pairs in each article.
The weighted degree formula is by Tore Opsahl and uses an alpha parameter of 1.

Name	Name	Last commit message	Last commit date
Latest commit briatte .gitignore Sep 4, 2015 1f48668 · Sep 4, 2015 History 33 Commits
afsp2013	afsp2013	docs	Jun 25, 2014
data	data	scraper and parser improvements	Jun 26, 2014
google-trends	google-trends	centrality rankings	Jun 24, 2014
plots	plots	scraper and parser improvements	Jun 26, 2014
tables	tables	scraper and parser improvements	Jun 26, 2014
README.md	README.md	README	Jun 26, 2014
functions.r	functions.r	avoid %e% bug	Sep 4, 2015
make.r	make.r	scraper and parser improvements	Jun 26, 2014