Contents of this directory
CrisisLexRec.txt is our recommended lexicon for Twitter querying. It contains an automatically-generated and human-curated list of 380 terms found to be frequently related to disasters. This was done by studying terms frequency distribution in the CrisisLexT6 collection; looking for terms that were discriminative and frequent in crisis tweets, and common across various crises; and performing a crowdsourced curation step to curate the list.
For details please consult the following reference:
[Olteanu et al. 2014] Alexandra Olteanu, Carlos Castillo, Fernando Diaz, Sarah Vieweg: "CrisisLex: A Lexicon for Collecting and Filtering Microblogged Communications in Crises". ICWSM 2014.
For inquiries please contact Alexandra Olteanu, or Carlos Castillo, or Fernando Diaz, or Sarah Vieweg.
- 2014-10-26: v1.0, initial release.