This folder contains data generated through the study of usage of cell line/type nomenclature in biomedical literature
Cells_ManualAnalysis.xlsx contains the data generated during the manual analysis of the text mining results
CellLineOnto.mwt is the dictionary generated from the Cell Line Ontology
CellOnto.mwt is the dictionary generated from the Cell Ontology
Cells_Representation_Analysis.V2.xlsx contains the data generated based on the representation of each individual class in literature
Source code of the tagger that can be used with any dictionary in the mwt format to annotate text, is available at https://github.com/jeekim/EuropePMC-Identifier-Extractor