Code used by Goldstone and Underwood in analyzing topic models of literary-studies journals
R Python Perl
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
coha
corpus_report
explore
minor_R_scripts
perl
python
.gitignore
AHR_metadata.tsv
MLR_mislabeled_fla.CSV
Makefile
README.md
StoplistCreator.py
UK2UStransrules.csv
Uk2USdiff.tsv
analyze_corpus.R
analyze_model.R
document_lengths.py
dubiousStopwords.txt
figures.Rmd
fix_outputs.R
fix_wkfs.R
getslopes.R
hls_wordcounts.tsv
hls_yearlycounts.tsv
instance_hls_k100_v100000.R
instance_hls_k48_v100K.R
journals_in_corpus.R
logging.R
make_figures.R
make_instance.R
make_instance_nobrit.R
make_model.R
make_stack_graph.R
merged_metadata.tsv
model_hls_k100_v100000.R
model_hls_k150_v100K.R
model_hls_k48_v100K.R
modeling_log.md
readme.txt
run_HLS_instance.R
stoplist.tsv
stoplist_final.py
stoplist_final.txt
topic_browser.R

README.md

This repository holds source code created by Andrew Goldstone and Ted Underwood for a project on topic-modeling the history of literary scholarship. We make it public so that others may, if they choose, rerun our analyses.

The R code depends on general-use functions found in a companion repository, dfr-analysis. Other researchers who wish to develop their own topic-modeling analyses may find the scripts in that repository more useful.

We are not able to release our source data, but we will supply the necessary JSTOR Data for Research queries on request. The topic models we generated are also available on request.

This repository's history has been edited to remove drafts of our forthcoming essay. A preprint version that essay, forthcoming in New Literary History, may be found at https://www.ideals.illinois.edu/handle/2142/49323.