# X-Risk Estimation

In the notebook [xrisk-database-processing]('./xrisk-database-processing.ipynb'), we processed the X-Risk estimation "database" from the EA Forum.

Here we take that data and create a risk model.

The goal is to create a risk model to estimate future risk, and tackle the question of "how long do we expect to still be around?".


## The Model

We're going to model risk and survival probability using a probability cascade.

We can formulate this as a "game", namely the "extinction game", to be more dramatic. In this game humanity *wins* and gets to play again, or it *loses* and doesn't get to play again. What exactly it means to lose is debatable, it could either mean civilization collapse, or extinction. However, the most generic scenario is one where humanity's potential to flourish is destroyed. This could mean either full-fledged extinction, or it could be the establishment of ultra stable dystopia, or civilization's collapse without the ability to recover.

### Correlated Existential Risks

### Definitions

Let:
- $x_i(t)$ = risk from source $i$ in period $t$
- $X(t)$ = total existential risk in period $t$
- $\rho_{ij}$ = correlation coefficient between risks $i$ and $j$
- $n$ = number of risk sources (6 in our case)
- $\Sigma$ = correlation matrix with entries $\rho_{ij}$

Our risk sources are:
1. AI ($x_1$)
2. Nuclear ($x_2$)
3. Bio ($x_3$)
4. Natural ($x_4$)
5. Climate ($x_5$)
6. Dystopia ($x_6$)

## Base Risk Values (by 2100)
$x_1 = 0.10$ (AI)
$x_2 = 0.01$ (Nuclear)
$x_3 = 0.025$ (Bio)
$x_4 = 0.0001$ (Natural)
$x_5 = 0.001$ (Climate)
$x_6 = 0.05$ (Dystopia)

## Correlation Matrix
$\Sigma = \begin{pmatrix} 
1.0 & 0.3 & 0.4 & -0.3 & 0.2 & 0.7 \\
0.3 & 1.0 & 0.6 & 0.2 & 0.6 & 0.8 \\
0.4 & 0.6 & 1.0 & 0.4 & 0.5 & 0.7 \\
-0.3 & 0.2 & 0.4 & 1.0 & 0.8 & 0.5 \\
0.2 & 0.6 & 0.5 & 0.8 & 1.0 & 0.7 \\
0.7 & 0.8 & 0.7 & 0.5 & 0.7 & 1.0
\end{pmatrix}$

## Single Period Risk

For independent risks:
$X_{ind}(t) = 1 - \prod_{i=1}^n (1 - x_i(t))$

For correlated risks using a Gaussian copula:
$X(t) = 1 - C(1-x_1(t), 1-x_2(t), ..., 1-x_n(t); \Sigma)$

where $C$ is the Gaussian copula:
$C(u_1,...,u_n;\Sigma) = \Phi_\Sigma(\Phi^{-1}(u_1),...,\Phi^{-1}(u_n))$

Here:
- $\Phi_\Sigma$ is the multivariate normal CDF with correlation matrix $\Sigma$
- $\Phi^{-1}$ is the inverse of the standard normal CDF

## Multi-Period Survival

Probability of survival through $T$ periods:
$P(\text{survival through }T\text{ periods}) = \prod_{t=1}^T (1 - X(t))$

Expected survival time:
$E[T] = \sum_{t=1}^{\infty} t \cdot P(\text{survival exactly }t\text{ periods})$

where:
$P(\text{survival exactly }t\text{ periods}) = (1-X(t))^{t-1} \cdot X(t)$