# **Course Project ADA511 COVID epidemic from ODE -> utility/decision theory**



- $S(t)$ is the number of susceptible individuals.
- $I(t)$ is the number of infected individuals.
- $R(t)$ is the number of removed (recovered or deceased) individuals.

The ODEs are given by:
$$
\frac{dS}{dt} = -\frac{\beta SI}{N}, \\[1ex]

\frac{dI}{dt} = \frac{\beta SI}{N} - \gamma I, \\[1ex]

\frac{dR}{dt} = \gamma I,
$$

where $\beta$ is the transmission rate or infectious rate, 
$\gamma$ is the recovery rate, and $N$ is the total population. 




Calculate the utility of least squares error:

$$\text{Utility} = \sum_{i=1}^{N} (I_{\text{observed}}[i] - I_{\text{fitted}}[i])^2$$

$N$ is the number of data points.



## **Utility matrix and applicaton of decision theory to SIR model parameters ($\beta$ and $\gamma$)**

### Step 1: Define utilities

- $ U_{AA} $: The utility of choosing a parameter value when it is indeed the best fit.
- $ U_{AB} $: The utility of choosing a parameter value when it is not the best fit.

For simplicity, binary utility matrix:

- $ U_{AA} = 1 $ (High utility for a good fit)
- $ U_{AB} = 0 $ (No utility for a bad fit)

### Step 2: Set a Threshold Probability

Determine a threshold probability $ p $ for which a parameter would be consideret to have a value to be a good fit.

### Step 3: Create the Utility Matrix

Construct a utility matrix using the formula from the image:

$$ 
\text{Utility Matrix} = 
\begin{bmatrix}
U_{AA} & U_{AB} \\
U_{BA} & U_{BB}
\end{bmatrix}
$$

Given that $ U_{BA} $ (the utility of not choosing a parameter value when it was actually the best fit) can be considered the cost of a false negative, may want to set it to a negative value to represent a penalty. Let's say $ U_{BA} = -1 $.

### Step 4: Calculate the Expected Utilities

For each parameter value, calculate the expected utility based on the estimated probabilities (which can deduce from the histograms). would use the observations about the distribution of utility scores to estimate these probabilities.

### Step 5: Apply Decision Rules

Use the utility matrix to make decisions. Choose the parameter value that has the highest expected utility, which is calculated as follows:

- For a given parameter value, if the probability of it being a good fit is $ p $, and the probability of it not being a good fit is $ 1-p $, the expected utility $ EU $ for that parameter value is:

$$ 
EU = p \cdot U_{AA} + (1-p) \cdot U_{BA}
$$

Choose the parameter value that maximizes this expected utility.

### Step 6: Implement the Decision

Based on the expected utilities, decide on the values of $\beta$ and $\gamma$ that will use for the SIR model.

### Example:

Let's say from the histograms, estimate that the probability of a certain range of $\beta$ values providing a good fit is 0.8 (80%). the utility matrix might look like this:

$$ 
\text{Utility Matrix} = 
\begin{bmatrix}
1 & 0 \\
-1 & 0
\end{bmatrix}
$$

The expected utility for choosing a $\beta$ value in this range would be:

$$ 
EU = 0.8 \cdot 1 + 0.2 \cdot (-1) = 0.8 - 0.2 = 0.6
$$

If this is the highest expected utility across all ranges of $\beta$ values, then this range would be the decision.

Let $O_c$ represent the observed number of individuals and $F_c$ represent the fitted number from the model for a given compartment $c$ (which could be Susceptible $S$, Infected $I$, or Recovered $R$). The utility function $U$ for each compartment can be defined as:

$$
U(O_c, F_c, c) = 
\begin{cases} 
- \alpha \cdot (O_c - F_c)^2 & \text{if } O_c > F_c \text{ and } c = I \\
- \beta \cdot (O_c - F_c)^2 & \text{if } O_c \leq F_c \text{ and } c = I \\
- \gamma \cdot (O_c - F_c)^2 & \text{if } c = S \\
- \delta \cdot (O_c - F_c)^2 & \text{if } O_c > F_c \text{ and } c = R \\
- \epsilon \cdot (O_c - F_c)^2 & \text{if } O_c \leq F_c \text{ and } c = R \\
\end{cases}
$$

where:
- $\alpha$ is the penalty for underestimating the number of Infected,
- $\beta$ is the penalty for overestimating the number of Infected,
- $\gamma$ is the penalty for errors in estimating the number of Susceptible (assuming it's symmetric),
- $\delta$ is the penalty for overestimating the number of Recovered,
- $\epsilon$ is the penalty for underestimating the number of Recovered.

The penalties $\alpha$, $\beta$, $\gamma$, $\delta$, and $\epsilon$ are non-negative weights that would choose based on the relative importance and consequences of errors in each compartment.

The total utility for a set of parameter values is then the sum of the utilities for each compartment:

$$
U_{\text{total}} = U(O_S, F_S, S) + U(O_I, F_I, I) + U(O_R, F_R, R)
$$

This total utility $U_{\text{total}}$ is what would use to evaluate the fit of model. would choose the parameter values that minimize $U_{\text{total}}$, indicating the smallest weighted errors across all compartments.