COVID-19 outcome prediction models
based on machine learning algorithms

Project description

Using the given code, we performed a multicenter, age- and sex-matched study to compare the efficiency of various machine learning algorithms in prediction of COVID-19 fatal outcomes and to develop sensitive, specific, and robust artificial intelligence tools for the prompt triage of patients with severe COVID-19 in the intensive care unit setting. In a challenge against other established machine learning algorithms (decision trees, random forests, extra trees, neural networks, k-nearest neighbours, and gradient boosting: XGBoost, LightGBM, and CatBoost) and multivariate logistic regression as a reference, neural networks demonstrated the highest sensitivity, sufficient specificity and excellent robustness. Further, neural networks based on coronary artery disease/chronic heart failure, stage 3-5 chronic kidney disease, blood urea nitrogen, and C-reactive protein as the predictors exceeded 90% sensitivity and 80% specificity, reaching AUROC of 0.866 at primary cross-validation and 0.849 at secondary cross-validation on virtual samples generated by the bootstrapping procedure. These results underscore the impact of cardiovascular and renal comorbidities in context of thrombotic complications characteristic of severe COVID-19. As aforementioned predictors can be obtained from the case histories or are inexpensive to be measured at the admission to intensive care unit, we suggest this predictor composition as useful for the triage of critically ill COVID-19 patients.

Link to published study: Ovcharenko E, Kutikhin A, Gruzdeva O, Kuzmina A, Slesareva T, Brusina E, Kudasheva S, Bondarenko T, Kuzmenko S, Osyaev N, Ivannikova N, Vavin G, Moses V, Danilov V, Komossky E, Klyshnikov K. Cardiovascular and Renal Comorbidities Included into Neural Networks Predict the Outcome in COVID-19 Patients Admitted to an Intensive Care Unit: Three-Center, Cross-Validation, Age- and Sex-Matched Study. Journal of Cardiovascular Development and Disease. 2023; 10(2):39. https://doi.org/10.3390/jcdd10020039

Documentation

Project contains the following files:

File	Description
requirements.txt	file listing all the dependencies for the given project
/dataset/	the dataset of age- and sex-matched patients from three clinics
data_processing.py	script for the data preprocessing
ml.py	code for the machine learning using MLJAR AutoML (https://github.com/mljar/mljar-supervised)
analysis.py	code for the study of the ML algorithms performance
visualization_tools.py	utilities for the data visualization

Results

ML classification metrics

Feature Importance Heatmap of 3-custom-fold cross-validation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID-19 outcome prediction models
based on machine learning algorithms

Project description

Documentation

Results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dataset		dataset
media		media
.gitignore		.gitignore
README.md		README.md
analysis.py		analysis.py
data_processing.py		data_processing.py
ml.py		ml.py
requirements.txt		requirements.txt
visualization_tools.py		visualization_tools.py

Eugene-Ovcharenko/COVID_reg

Folders and files

Latest commit

History

Repository files navigation

COVID-19 outcome prediction models based on machine learning algorithms

Project description

Documentation

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

COVID-19 outcome prediction models
based on machine learning algorithms

Packages