# DeepSurv

## Cox Proportional Hazards model (CoxPH) with neural network

### CoxPH model

The **Cox Proportional Hazards model (CoxPH)** is one of the most widely used methods for **survival analysis**.  
It models the relationship between the **time until an event occurs** (e.g., death) and a set of **covariates** (e.g., clinical, genetic).

CoxPH is **semi-parametric**:  
- It does **not assume any specific form** for the underlying survival time distribution.  
- It **parametrically models** the influence of covariates on the event risk.


#### Model Overview

The model expresses the **hazard function** — the instantaneous risk of experiencing the event at time *t* given survival up to *t* — as:

$$
h(t | x) = h_0(t) \, \exp(\beta_1 x_1 + \beta_2 x_2 + \dots + \beta_p x_p)
$$

where:  
- $h_0(t)$ — baseline hazard function (shared across all individuals)  
- $\beta_i$ — coefficients representing the effect of each covariate $x_i$  
- $\exp(\beta_i)$ — hazard ratio (HR), indicating how a one-unit change in $x_i$ affects the risk

    - If $\exp(\beta_i) > 1$: the covariate **increases the hazard** (reduces survival).  
    - If $\exp(\beta_i) < 1$: the covariate **reduces the hazard** (improves survival).  
    - If $\exp(\beta_i) = 1$: the covariate **has no effect** on the hazard.


#### Key Assumption: Proportional Hazards

The defining assumption of the Cox model is that the **hazard ratios between individuals remain constant over time**:

$$
\frac{h(t | x_1)}{h(t | x_2)} = \exp(\beta^\top (x_1 - x_2))
$$

This means that covariates shift the hazard proportionally but do not change its shape over time.  
If this assumption is violated, model estimates may be unreliable.


#### Parameter Estimation

Cox proposed the **partial likelihood** approach, which allows estimation of coefficients $\beta$ without specifying the baseline hazard $h_0(t)$.  
This makes the model computationally efficient and robust for high-dimensional biological or clinical data.


#### Typical Outputs

A fitted CoxPH model typically provides:
- Estimated coefficients $\beta_i$ and hazard ratios $\exp(\beta_i)$  
- Significance tests (z-scores, p-values)  
- Model performance metrics such as the **Concordance Index (C-index)**  
- Diagnostic checks for proportional hazards assumptions


---

# DeepSurv: A Deep Neural Network for Survival Analysis

### DeepSurv (2018) = CoxPH + Deep Neural Network
**DeepSurv** is a deep learning extension of the **Cox Proportional Hazards model (CoxPH)**, designed to capture **non-linear relationships** between covariates and survival risk.


### Model Overview

DeepSurv replaces the linear term with a non-linear transformation learned by a **feedforward neural network**.

$$
h(t|x) = h_0(t) \, \exp(f_\theta(x))
$$

where:
- $f_\theta(x)$ is the output of the neural network parameterized by $\theta$  
- $h_0(t)$ is the baseline hazard

### Loss Function

The model is trained by minimizing the **negative partial log-likelihood** of the Cox model:

$$
\mathcal{L}(\theta) = - \sum_{i: E_i=1} \left[ f_\theta(x_i) - \log \sum_{j \in R(T_i)} \exp(f_\theta(x_j)) \right]
$$

where:
- $E_i = 1$ if the event is observed, $0$ if censored  
- $R(T_i)$ is the set of individuals still at risk at time $T_i$

### Interpretation

- The network output $f_\theta(x)$ represents the **log hazard ratio**.  
- The **hazard ratio** for an individual is $\exp(f_\theta(x))$.  
- Model performance is commonly evaluated using the **Concordance Index (C-index)**.