Skip to content
Given a Data For Research citations.xml file from JSTOR, this suite of software will cache content locally, index it, do some analysis against it, and create a few visualizations. It is meant to support "distant reading" against sets of scholarly journal articles.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


Given a citations.xml file, this suite of files will cache and index content identified through JSTOR's Data For Research service. The resulting (and fledgling) reports created by this suite enables the reader to "read distantly" against a collection of journal articles.

The suite requires a hodgepodge of software: Perl, Python, and the Bash Shell. Your milage may vary.

Sample usage: cat etc/citations-thoreau.xml | bin/ thoreau

This software is distributed under the GNU Public License.

"Release early. Release often".

Eric Lease Morgan

June 30, 2015

You can’t perform that action at this time.