This repository contains source code related to the publication
Zhang, R., Hristovski, D., Schutte, D., Kastrin, A., Fiszman, M., & Kilicoglu, H. (2021). Drug repurposing for COVID-19 via knowledge graph completion. Journal of Biomedical Informatics, 115, 103696. https://doi.org/10.1016/j.jbi.2021.103696
- Python 3.6 with packages
lxml
,numpy
, andpandas
- Perl 5 with module
Text::NSP
- AWK
./data
directory contains input files./preprocessing
directory contains scripts for preparing data./filtering
directory contains scripts for filtering predications with BERT./models
directory contains scripts for knowledge graph completion./predictions
directory contains output files from graph completion models
- Download and set up SemMedDB
- Create
./data
directory in project's root folder - Prepare
sub_rel_obj_pyear_edat_pmid_sent_id_sent.tsv.gz
file and place it into the./data/SemMedDB
directory - Download SemRepped CORD-19 dataset and extract files into
./data/cord-19 directory
- Prepare SemMedDB and CORD-19 data using the
./preprocessing/run.sh
file - Run Python notebooks in the
./filtering
directory - Run Python notebooks in the
./models
directory
Halil Kilicoglu (halil (at) illinois.edu
)