MPEDS Annotation Interface
The MPEDS Annotation Interface helps facilitate the creation of protest event datasets. While applied to the specific task of coding for protest events, this could also plausibly be used for the development of other types of event datasets or other types of text annotations tasks.
This system is built in Python using the Flask microframework. It can source articles parsed from Lexis-Nexis (using the
split-ln.py script), Apache Solr, or XML files formatted in News Industry Text Format, such as the LDC's New York Times Annotated Corpus.
To populate the database with example information, first run the setup script.
This will add five users: an admin (admin), two first-pass coders (coder1p_1, coder1p_2), and two second-pass coders (coder2p_1, coder2p_2). They will all have the password
default). It will add a variable hierarchy for second-pass coding. It will also enter metadata for all the articles in the
example-articles directory, and queue them up for the first-pass coders.
Then run the Flask test server with the following.
Development of this interface has been supported by a National Science Foundation Graduate Research Fellowship and National Science Foundation grants SES-1423784 and SES-1918342. Thanks to Emanuel Ubert, Katie Fallon, and David Skalinder for working with this system since its inception, and to countless annotators who have put a significant time working with and refining this system.