SimpleScience_28_Supplemental_Materials.pdf: Additional Analysis
SimpleSciGoldRawDataset(SimpleSciGold.csv): A set of 293 sentences from PLOS journal abstracts, each containing one (complex) term from the MeSH ontologyor consumer health vocabulary set (Vydiswaran et al. 2014), with an average of 21 simplifications per complex term.
Simplifications_cosim04_wiki3000: A file containing rules generated by our pipeline, with a=0.4(cosine similarity threshold) and k=3000 (simple word in general corpus threshold.