GitHub - KwokHing/WiDS-Datathon-Patient-Survival: A challenge to create a model that uses data from the first 24 hours of intensive care to predict patient survival

WiDS Datathon On Predicting Patient Survival

This repo provides the submission entry for a kaggle dataton to create a model that uses data from the first 24 hours of intensive care to predict patient's survival.

There are numerous missing data in this challenge, the key to getting higher accuracy lies in (i) imputation of data, (ii) feature engineering based on domain knowledge (e.g calculating BMI, or other medical metrics), and (iii) features selection (as there are far too many features in the dataset and not all are useful). LightGBM is used to achieve an approximately 90% accuracy, I believe that any gradient boosted models with some decent work on data imputation, feature engineering and selection should provide a fairly accurate prediction model.

Getting started

Open WiDS_Patient_Survival.ipynb on a jupyter notebook environment. Alternatively, you can view the codes in . The notebook consists of further technical details.

Improvements

Could potentially explore the use of Deep Learning Techniques.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
README.md		README.md
WiDS_Patient_Survival.ipynb		WiDS_Patient_Survival.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WiDS Datathon On Predicting Patient Survival

Getting started

Improvements

About

Languages

KwokHing/WiDS-Datathon-Patient-Survival

Folders and files

Latest commit

History

Repository files navigation

WiDS Datathon On Predicting Patient Survival

Getting started

Improvements

About

Topics

Resources

Stars

Watchers

Forks

Languages