# Derivation: Single Index Model Covariance Matrix

In this derivation, we show how to compute the covariance between two assets using the single index model. This is a key result for portfolio optimization, as it allows us to build covariance matrices without estimating all pairwise correlations directly from data.

___

## The Single Index Model

Recall the single index model for firm $i$ at time $t$:
$$
g^{(t)}_{i} = \alpha_{i} + \beta_{i}\;g^{(t)}_{M} + \epsilon^{(t)}_{i}
$$
where $\alpha_{i}$ is the idiosyncratic (firm-specific) growth, $\beta_{i}$ is the sensitivity of firm $i$ to market movements, $g^{(t)}_{M}$ is the growth rate of the market index at time $t$, and $\epsilon^{(t)}_{i}$ is the error term for firm $i$ (describes growth rate not captured by the firm or market factors).

> __Error Model Assumptions__
>
> We assume the error model has the following properties:
> $$
\begin{align*}
\mathbb{E}[\epsilon^{(t)}_{i}] &= 0\\
\text{Var}(\epsilon^{(t)}_{i}) &= \Delta{t}\;\sigma^2_{\epsilon_i}\\
\text{Cov}(\epsilon^{(t)}_{i}, g^{(t)}_{M}) &= 0\\
\text{Cov}(\epsilon^{(t)}_{i}, \epsilon^{(t)}_{j}) &= 0\quad\text{for }i \neq j
\end{align*}
$$

The first assumption says errors have zero mean. The second specifies the error variance, where $\Delta{t}$ is the time step (e.g., $\Delta{t} = 1/252$ for daily data annualized). The third assumption says errors are uncorrelated with the market. The fourth and most crucial assumption says that after accounting for market movements, the remaining variation in firm returns is independent across firms. This is what allows us to dramatically simplify the covariance structure.

___

## Derivation: Covariance Between Two Different Assets

Let's compute the covariance between the growth rates of two different firms $i$ and $j$ where $i \neq j$. Starting with the definition of covariance:
$$
\text{Cov}(g_i, g_j) = \mathbb{E}\left[(g_i - \mathbb{E}[g_i])(g_j - \mathbb{E}[g_j])\right]
$$

First, we need the expected growth rates. Taking expectations of the SIM:
$$
\begin{align*}
\mathbb{E}[g_i] &= \mathbb{E}[\alpha_{i} + \beta_{i}\;g_{M} + \epsilon_{i}]\\
&= \alpha_{i} + \beta_{i}\;\mathbb{E}[g_{M}] + \mathbb{E}[\epsilon_{i}]\\
&= \alpha_{i} + \beta_{i}\;\mathbb{E}[g_{M}]
\end{align*}
$$
Similarly, $\mathbb{E}[g_j] = \alpha_{j} + \beta_{j}\;\mathbb{E}[g_{M}]$.

Now we compute the deviations from the mean:
$$
\begin{align*}
g_i - \mathbb{E}[g_i] &= (\alpha_{i} + \beta_{i}\;g_{M} + \epsilon_{i}) - (\alpha_{i} + \beta_{i}\;\mathbb{E}[g_{M}])\\
&= \beta_{i}(g_{M} - \mathbb{E}[g_{M}]) + \epsilon_{i}
\end{align*}
$$
Similarly, $g_j - \mathbb{E}[g_j] = \beta_{j}(g_{M} - \mathbb{E}[g_{M}]) + \epsilon_{j}$.

Substituting into the covariance formula:
$$
\begin{align*}
\text{Cov}(g_i, g_j) &= \mathbb{E}\left[\left(\beta_{i}(g_{M} - \mathbb{E}[g_{M}]) + \epsilon_{i}\right)\left(\beta_{j}(g_{M} - \mathbb{E}[g_{M}]) + \epsilon_{j}\right)\right]
\end{align*}
$$

Expanding the product:
$$
\begin{align*}
\text{Cov}(g_i, g_j) &= \mathbb{E}\Big[\beta_{i}\beta_{j}(g_{M} - \mathbb{E}[g_{M}])^2 + \beta_{i}(g_{M} - \mathbb{E}[g_{M}])\epsilon_{j}\\
&\quad + \beta_{j}(g_{M} - \mathbb{E}[g_{M}])\epsilon_{i} + \epsilon_{i}\epsilon_{j}\Big]
\end{align*}
$$

Taking expectations term by term using linearity:
$$
\begin{align*}
\text{Cov}(g_i, g_j) &= \beta_{i}\beta_{j}\;\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])^2] + \beta_{i}\;\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])\epsilon_{j}]\\
&\quad + \beta_{j}\;\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])\epsilon_{i}] + \mathbb{E}[\epsilon_{i}\epsilon_{j}]
\end{align*}
$$

Now we apply our assumptions:
* $\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])^2] = \text{Var}(g_M) = \sigma^2_m$ (market variance)
* $\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])\epsilon_{j}] = \text{Cov}(g_M, \epsilon_j) = 0$ (market uncorrelated with errors)
* $\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])\epsilon_{i}] = \text{Cov}(g_M, \epsilon_i) = 0$ (market uncorrelated with errors)
* $\mathbb{E}[\epsilon_{i}\epsilon_{j}] = \text{Cov}(\epsilon_i, \epsilon_j) = 0$ for $i \neq j$ (errors uncorrelated across firms)

Therefore:
$$
\boxed{\text{Cov}(g_i, g_j) = \beta_{i}\beta_{j}\;\sigma^2_m \quad \text{for } i \neq j\quad\blacksquare}
$$

> __Key Insight:__ The covariance between any two different assets depends only on their market exposures ($\beta_i$ and $\beta_j$) and the market variance ($\sigma^2_m$). All correlation between assets arises through their common exposure to market movements.

___

## Derivation: Variance of a Single Asset

Now let's compute the variance of a single asset $i$, which is the special case where $i = j$ in the covariance formula. The variance is:
$$
\text{Var}(g_i) = \text{Cov}(g_i, g_i) = \mathbb{E}\left[(g_i - \mathbb{E}[g_i])^2\right]
$$

From our earlier work, we have:
$$
g_i - \mathbb{E}[g_i] = \beta_{i}(g_{M} - \mathbb{E}[g_{M}]) + \epsilon_{i}
$$

Squaring both sides:
$$
\begin{align*}
(g_i - \mathbb{E}[g_i])^2 &= \left(\beta_{i}(g_{M} - \mathbb{E}[g_{M}]) + \epsilon_{i}\right)^2\\
&= \beta^2_{i}(g_{M} - \mathbb{E}[g_{M}])^2 + 2\beta_{i}(g_{M} - \mathbb{E}[g_{M}])\epsilon_{i} + \epsilon^2_{i}
\end{align*}
$$

Taking expectations:
$$
\begin{align*}
\text{Var}(g_i) &= \beta^2_{i}\;\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])^2] + 2\beta_{i}\;\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])\epsilon_{i}] + \mathbb{E}[\epsilon^2_{i}]
\end{align*}
$$

Applying our assumptions where $\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])^2] = \sigma^2_m$ is the market variance, $\mathbb{E}[(g_{M} - \mathbb{E}[g_{M}])\epsilon_{i}] = 0$ because the market is uncorrelated with errors, and $\mathbb{E}[\epsilon^2_{i}] = \text{Var}(\epsilon_i) = \Delta{t}\;\sigma^2_{\epsilon_i}$ is the error variance, we obtain:
$$
\boxed{\text{Var}(g_i) = \beta^2_{i}\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_i}\quad\blacksquare}
$$

> __Risk Decomposition__ 
>
> The total variance of firm $i$ consists of two components: __systematic risk__ $\beta^2_{i}\;\sigma^2_m$, which is the risk from market exposure that cannot be diversified away, and __idiosyncratic risk__ $\Delta{t}\;\sigma^2_{\epsilon_i}$, which is firm-specific risk that can be diversified away in a portfolio.

___

## The Complete SIM Covariance Formula

Combining our results, we have the general covariance formula:
$$
\boxed{
\text{Cov}(g_i, g_j) = \begin{cases}
\beta^2_{i}\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_i} & \text{if } i = j\\
\beta_{i}\beta_{j}\;\sigma^2_m & \text{if } i \neq j
\end{cases}\quad\blacksquare
}
$$

This formula is the foundation for constructing covariance matrices in SIM-based portfolio optimization.

### Constructing the Covariance Matrix

For a portfolio with $N$ assets, the covariance matrix $\boldsymbol{\Sigma}$ is an $N \times N$ symmetric matrix where:
$$
\boldsymbol{\Sigma}_{i,j} = \text{Cov}(g_i, g_j)
$$

Using the SIM, we can construct this matrix element by element:

__Diagonal elements__ (variances):
$$
\boldsymbol{\Sigma}_{i,i} = \beta^2_{i}\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_i}
$$

__Off-diagonal elements__ (covariances):
$$
\boldsymbol{\Sigma}_{i,j} = \beta_{i}\beta_{j}\;\sigma^2_m \quad \text{for } i \neq j
$$

> __Computational Advantage__ 
>
> Using the SIM, we only need to estimate $N$ beta values ($\beta_1, \beta_2, \ldots, \beta_N$), $N$ residual variances ($\sigma^2_{\epsilon_1}, \sigma^2_{\epsilon_2}, \ldots, \sigma^2_{\epsilon_N}$), and one market variance ($\sigma^2_m$). This is a total of $2N + 1$ parameters, compared to $N(N+1)/2$ unique elements in a full covariance matrix. For a 100-asset portfolio, this reduces from 5,050 parameters to just 201!

___

## Example: Three-Asset Portfolio

Let's construct the covariance matrix for a simple three-asset portfolio to see how this works in practice. Suppose we have estimated the following SIM parameters: Asset 1 has $\beta_1 = 0.8$ and $\sigma^2_{\epsilon_1} = 0.02$, Asset 2 has $\beta_2 = 1.2$ and $\sigma^2_{\epsilon_2} = 0.03$, Asset 3 has $\beta_3 = 1.5$ and $\sigma^2_{\epsilon_3} = 0.04$, the market variance is $\sigma^2_m = 0.04$, and the time step is $\Delta{t} = 1/252$ (daily data, annualized).

### Computing the Diagonal Elements

$$
\begin{align*}
\boldsymbol{\Sigma}_{1,1} &= \beta^2_1\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_1} = (0.8)^2(0.04) + \frac{1}{252}(0.02) = 0.0256 + 0.0000794 = 0.0257\\
\boldsymbol{\Sigma}_{2,2} &= \beta^2_2\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_2} = (1.2)^2(0.04) + \frac{1}{252}(0.03) = 0.0576 + 0.0001190 = 0.0577\\
\boldsymbol{\Sigma}_{3,3} &= \beta^2_3\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_3} = (1.5)^2(0.04) + \frac{1}{252}(0.04) = 0.0900 + 0.0001587 = 0.0902
\end{align*}
$$

### Computing the Off-Diagonal Elements

$$
\begin{align*}
\boldsymbol{\Sigma}_{1,2} = \boldsymbol{\Sigma}_{2,1} &= \beta_1\beta_2\;\sigma^2_m = (0.8)(1.2)(0.04) = 0.0384\\
\boldsymbol{\Sigma}_{1,3} = \boldsymbol{\Sigma}_{3,1} &= \beta_1\beta_3\;\sigma^2_m = (0.8)(1.5)(0.04) = 0.0480\\
\boldsymbol{\Sigma}_{2,3} = \boldsymbol{\Sigma}_{3,2} &= \beta_2\beta_3\;\sigma^2_m = (1.2)(1.5)(0.04) = 0.0720
\end{align*}
$$

### The Complete Covariance Matrix

$$
\boldsymbol{\Sigma} = \begin{bmatrix}
0.0257 & 0.0384 & 0.0480\\
0.0384 & 0.0577 & 0.0720\\
0.0480 & 0.0720 & 0.0902
\end{bmatrix}
$$

> __Observation:__ Notice that the strongest covariance (0.0720) is between assets 2 and 3, which both have high betas (1.2 and 1.5). This makes sense: high-beta stocks move together more strongly because they both amplify market movements. The idiosyncratic risk terms ($\Delta{t}\;\sigma^2_{\epsilon_i}$) only appear on the diagonal and are relatively small compared to the systematic risk.

___

## Summary

In this derivation, we showed how the single index model dramatically simplifies the computation of asset covariances. Starting with the SIM for each firm, we computed the covariance between two firms using the definition and properties of expectation. By applying the key assumption that error terms are uncorrelated across firms and with the market, we obtained a general result that depends only on betas and market variance.

> __Key Results__
>
> The covariance between two different assets is $\text{Cov}(g_i, g_j) = \beta_i\beta_j\;\sigma^2_m$ for $i \neq j$, meaning all correlations arise from common market exposure. The variance of a single asset is $\text{Var}(g_i) = \beta^2_i\;\sigma^2_m + \Delta{t}\;\sigma^2_{\epsilon_i}$, decomposing total risk into systematic and idiosyncratic components. This dramatically simplifies portfolio optimization by reducing the number of parameters from $O(N^2)$ to $O(N)$.

___