# Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling
 
The general problem addressed in this paper is to find a better approach to handling
irregular multimodal data obtained on EHRs to better assess real-time predictions in
ICUs.
“Health conditions among patients in intensive care units (ICUs) are monitored via
electronic health records (EHRs), composed of numerical time series and lengthy clinical
note sequences, both taken at irregular time intervals. Dealing with such irregularity in
every modality, and integrating irregularity into multimodal representations to improve
medical predictions, is a challenging problem.” (Zhang et al., 2023)
<br>

> Zhang, X., Li, S., Chen, Z., Yan, X., & Petzold, L. R. (2023, July). Improving medical predictions
> by irregular multimodal electronic health records modeling. In International Conference on
> Machine Learning (pp. 41300-41313). PMLR. [link](https://arxiv.org/abs/2210.12156)

## specific approach
The specific approach of the paper is to model the EHR records by integrating the
real-time series and clinical notes while considering their irregularities. To achieve this,
the paper addressed three main challenges; Modeling Irregularity in TimeSeries,
Processing Irregular Clinical Notes, and Multimodal Fusion.


# Data Preprocessing
Leverage the following projects to help on the data extraction
### MIMIC benchmarks
Helps to process timeseries data and divide train and test sets
 https://github.com/YerevaNN/mimic3-benchmarks.git

### ClinicalNotesICU
Helps to process medical notes and divide in train and test
 https://github.com/kaggarwal/ClinicalNotesICU.git


### Modeling Irregularity in Time Series:
1. Temporal Discretization-Based Embeddings (TDE): Utilizes a novel unified
approach (UTDE) that combines:
    - Imputation: Regularizes time series by filling in missing values based
on prior observations or statistical methods.
    - Discretized Multi-Time Attention (mTAND): Applies a learned
interpolation method using a multi-time attention mechanism to
represent the irregular time series data better.
2. Unified Approach (UTDE): This approach integrates imputation and mTAND
through a gating mechanism to dynamically combine the representation of
the time series.


### Processing Irregular Clinical Notes:
1. Text Encoding: Uses a pretrained model (TextEncoder) to encode clinical
notes into a series of representations.
2. Irregularity Modeling: Sorts these representations by time, treats them as
Multivariate Irregularly Sampled Time Series (MINSTS), and employs mTAND
to generate a set of text interpolation representations to handle irregularities.

### Multimodal Fusion:
1. Interleaved Attention Mechanism: Fuses time series and clinical note
representations across temporal steps, integrating irregularity into multimodal
representations.
2. Self and Cross-Attention:
    - Multi-Head Self-Attention (MH): Acquires contextual embeddings for
each modality by focusing within the same modality across time.
    - Multi-Head Cross-Attention (CMH): Each modality learns from the
other, integrating information across modalities.
3. Feed-Forward and Prediction Layers: A feed-forward sublayer follows the
CMH outputs, with layer normalization and residual connections applied. The
final step involves passing the integrated representations through fully
connected layers to predict the outcome.

# Train Model

# Evaluate

# Ablation 1 - Drop UTDE

# Ablation 2 - Remove mTAND