This is the codebase for the computational experiments for the paper "Imputation of Clinical Covariates in Time Series" by Dimitris Bertsimas, Agni Orfanoudaki, and Colin Pawlowski. The purpose of this method, MedImpute, is to fill in missing values in healthcare datasets with a longitudinal time series structure. This code is compatible with Julia version 1.0.5, available for download here.
In this paper, three real-world clinical datasets were used for the computational experiments to compare imputation methods: Framingham Heart Study, Parkinson's Progressive Markers Index, and electronic health record data from the Dana Farber Cancer Institute. Access to the first two datasets can be requested from the following sources:
You must obtain an academic license and precompiled Julia system image in order to install and run the MedImpute software package. Please email {agniorf,cpawlows}@mit.edu with the subject line "Request for MedImpute License" in order to request access.
Documentation for the MedImpute package is available here. In addition, an example script using MedImpute to impute missing values in a synthetic EHR dataset is available in the folder medimpute_example/
.