Skip to content

MaryamBisadi/Insight_Data_Engineering_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project-Title

A ML model is used to predict ICU patient mortality using features extracted from the heart signals of ICU patients. Presentation Link

Introduction

Early hospital mortality prediction is critical to make efficient medical decisions about the severely ill patients staying in intensive care units (ICUs). Most of mortality prediction methods need clinical records. However, some of the laboratory test results are time-consuming and and need to be processed.

In this project heart rate signals are used to predict mortality.

Architecture

Untitled drawing

Dataset

MIMIC III waveform Database

Engineering challenges

Huge amount of binary data for waveform signals and missing signals in some records were part of the challenge I faced in this project. In a addition, current modules for extracting heart rate from signals are not designed for distributed systems. The module I used needs signal file and the related header file in current directory. So, I had to read signal and related header file from S3 and save it on EC2 to extract heart rate and delete it after processing.

To overcome these challenge I divided data in 10 batches and create a thread for each one. Feature extraction and prediction was done in spark to take advantage of distribution and paralelizem.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages