Files for the final project of HIST3814o. Analysis of the Shawville Equity
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
mallet
README.md
commonwords.txt
equity-topics-docs-1970-1980.csv
equity-topics-docs-mun-el.csv
equity-topics-docs.csv
equity-topics-labels-1970-1980.csv
equity-topics-labels-mun-el.csv
equity-topics-labels.csv
equity_finding_aid.R
equity_finding_aid_mun.R
equity_finding_aid_with_word_frequency.R
equity_mallet_topic_modeller.R
equity_mallet_topic_modeller_1970-1980.R
equity_mallet_topic_modeller_mun.R
equityeditions.csv
equityeditions.html
equityeditions_mun.csv
equityeditions_mun.html
equityurls.txt
equityurls_1883-1999.txt
equityurls_test.txt
glasgowstoplist_mod.txt
hist3814o-final.Rproj
prov_el_txt_files.zip
stopwords.txt

README.md

hist3814o-final

Files for the final project of HIST3814o. Analysis of the Shawville Equity

Problem with equity_mallet_topic_modeller.R

In respository see program:

equity_mallet_topic_modeller.R

Text file data

https://github.com/jeffblackadar/hist3814o-final/blob/master/prov_el_txt_files.zip

Stop words

https://github.com/jeffblackadar/hist3814o-final/blob/master/glasgowstoplist_mod.txt

Problem

When it executes: km <- kmeans(topic_df_dist, n.topics) I get Error in sample.int(m, k) : cannot take a sample larger than the population when 'replace = FALSE I have not been able to figure this out. I see a few other people have this error on the internet and suspect it's how the "sweep" does the sample. This is a productive fail for me, but I wanted to see if it's fixable.