Skip to content

COVID-19 outcome prediction models based on machine learning algorithms. The unique feature is a custom cross-validation strategy based on the three clinical datasets of age- and gender-matched patients.

Notifications You must be signed in to change notification settings

Eugene-Ovcharenko/COVID_reg

Repository files navigation

COVID-19 outcome prediction models
based on machine learning algorithms

Project description

Using the given code, we performed a multicenter, age- and sex-matched study to compare the efficiency of various machine learning algorithms in prediction of COVID-19 fatal outcomes and to develop sensitive, specific, and robust artificial intelligence tools for the prompt triage of patients with severe COVID-19 in the intensive care unit setting. In a challenge against other established machine learning algorithms (decision trees, random forests, extra trees, neural networks, k-nearest neighbours, and gradient boosting: XGBoost, LightGBM, and CatBoost) and multivariate logistic regression as a reference, neural networks demonstrated the highest sensitivity, sufficient specificity and excellent robustness. Further, neural networks based on coronary artery disease/chronic heart failure, stage 3-5 chronic kidney disease, blood urea nitrogen, and C-reactive protein as the predictors exceeded 90% sensitivity and 80% specificity, reaching AUROC of 0.866 at primary cross-validation and 0.849 at secondary cross-validation on virtual samples generated by the bootstrapping procedure. These results underscore the impact of cardiovascular and renal comorbidities in context of thrombotic complications characteristic of severe COVID-19. As aforementioned predictors can be obtained from the case histories or are inexpensive to be measured at the admission to intensive care unit, we suggest this predictor composition as useful for the triage of critically ill COVID-19 patients.

Link to published study: Ovcharenko E, Kutikhin A, Gruzdeva O, Kuzmina A, Slesareva T, Brusina E, Kudasheva S, Bondarenko T, Kuzmenko S, Osyaev N, Ivannikova N, Vavin G, Moses V, Danilov V, Komossky E, Klyshnikov K. Cardiovascular and Renal Comorbidities Included into Neural Networks Predict the Outcome in COVID-19 Patients Admitted to an Intensive Care Unit: Three-Center, Cross-Validation, Age- and Sex-Matched Study. Journal of Cardiovascular Development and Disease. 2023; 10(2):39. https://doi.org/10.3390/jcdd10020039

Documentation

Project contains the following files:

File Description
requirements.txt file listing all the dependencies for the given project
/dataset/ the dataset of age- and sex-matched patients from three clinics
data_processing.py script for the data preprocessing
ml.py code for the machine learning using MLJAR AutoML (https://github.com/mljar/mljar-supervised)
analysis.py code for the study of the ML algorithms performance
visualization_tools.py utilities for the data visualization

Results

ML classification metrics

Feature Importance Heatmap of 3-custom-fold cross-validation

About

COVID-19 outcome prediction models based on machine learning algorithms. The unique feature is a custom cross-validation strategy based on the three clinical datasets of age- and gender-matched patients.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages