# Chapter 4 - Linear Models of Returns

## 4.1 Factor model

$$\mathbf{r}_t = \mathbf{\alpha} + \mathbf{B}\mathbf{f}_t + \mathbf{\epsilon}_t$$

- $\mathbf{r}_t$: asset returns at time t
- $\mathbf{B}$: factor loading, mapping from factor returns to asset returns
- $\mathbf{f}_t$: factor returns at time t
- $\mathbf{\epsilon}_t$: idiosyncratic returns at t

Given the factor model, the variance of returns is:

$$\mathbf{\Omega_r} = \mathbf{B{\Omega_f}B}^T + \mathbf{\Omega_\epsilon}$$

## 4.2 Interpretations

### 4.2.1 Graphical Model

$$E\left(r_i - \alpha_i | f\right) = \sum_j \mathbf{[B]}_{i,j}f_j$$

Each asset return, in excess of alpha, is a linear combination of factor returns. 

$$E\left(r_i - \alpha_i | f\right) = \mathbf{B}_i \cdot \mathbf{f}$$

### 4.2.2 Superposition of Effects

$$E\left(\mathbf{r} - \mathbf{\alpha} | \mathbf{f}\right) = \sum_j \mathbf{[B]}_{\cdot,j}f_j$$

Asset return is an overlap of influences from the loading matrix weighted by the factor returns. 

### 4.2.3 Single-Asset Product

This is similar to the second equation from 4.2.1, this is used in performance attribution, more on this later.

## 4.3 Alpha spanned and orthogonal

The concept is to decompose the alpha term in the factor model into orthogonal and spanned (in loading matrix B's column subspace, e.g. some linear combinations of B's columns). Some linear combination of B's columns is the defination of factor models, so spanned alpha is really just part of the expected returns of factors. On the other hand, the orthogonal alpha is not in B's column subspace and thus has the following relationship with the sharpe ratio:

$$SR = \frac{E(w^Tr_t)}{\sqrt{var(w^Tr_t)}} \geq \frac{||\alpha_\perp||}{\sqrt{||\Omega_\epsilon||}} \geq \sqrt{n}\frac{\mu}{\sqrt{||\Omega_\epsilon||}}$$
$$w = \frac{\alpha_\perp}{||\alpha_\perp||}$$

This suggests that additionl orthogonal alpha increases the overall sharpe ratio, which makes it very valuable. But most of the time, the alpha is spanned, so it comes with systematic risks that can't be diversified away.

## 4.4 Transformations

This part is a bit confusing to me as I don't have experience in factor modeling, will need to revisit

### Rotations

General form:

$$\tilde{B} = BC^{-1}$$
$$\tilde{f} = Cf$$

It doesn't change the overall modeling, just some information transferred between the factor returns and the factor loading matrix.

#### Identity factor covariance matrix

Turn the factor return covariance into identity, so the factors are uncorrelated and have unit variance.

$$\Omega_f = USU^T$$
$$C = S^{-1/2}U^T$$

#### Orthonormal loadings

Turn loading matrix into orthonormal: 

$$\tilde{B}^T\tilde{B}=I$$
$$B=USV^T$$
$$C=SV^T$$
$$\tilde{B}=BC^{-1}=BVS^{-1}=USV^TVS^{-1}=U$$

#### z-scored loadings

transform loading matrix into z-scores for each row, helps interpret the influence of each factor on each asset returns. 

$$\tilde{B} = B + ee^Tdiag(v) = B(I + aa^TB^Tdiag(v))$$
$$e = Ba = (1,1,...,1)^T$$
$$C^{-1} = I + aa^TB^Tdiag(v)$$

The second equation may not exist for any $B$. This step centers each row with the offset $v=(v_1,...,v_n)^T$

To scale to unit variance, we can multiply for $diag(k_1, ..., k_n)$, so
$$C^{-1} = \left(I + aa^TB^Tdiag(v)\right)diag(k)$$

### Projections

Reduce the number of factors by projecting the factor onto a lower-dimension.

Original model: $r = \alpha + Bf + \epsilon$

New model: $r = \alpha + Ag + \eta$

Minimize: $\left||Bf - Ag\right||^2$, then 

$$g = Hf$$
$$H=(A^TA)^{-1}A^TB$$

### Push-Outs

Adding additional factors

$$r = \alpha_{\perp} + Bf + Ag + \eta$$

$A$ must be orthogonal to $B$, if not, the parallel compoent of $A$ will modify the original model.

## 4.5 Applications

### 4.5.1 Performance Attribution
- PnL = (Factor PnL) + (Idio PnL)
- Factor PnL = $w^TBf$, $b^T = w^TB$
- Idio PnL = $w^T(\alpha_{\perp} + \epsilon)$

### 4.5.2 Risk Management
- $var(w^Tr) = b^T\Omega_{f}b + w^T\Omega_{\epsilon}w$
- Fraction of total variance for group i is
  - grouping = partition constituents into exclusive and exhaustive subsets
  - fraction = (variance of group i + covariance of group i with other group) / (total variance)
  - fraction = covariance of group i with the entire portfolio / total variance
- marginal contribution to risk of group i:
  - portfolio vol change if we buy 1 vol of group i
  - $m_i = \rho_i = p_i \upsilon_{TOT} / \upsilon_i$ (DERIVATION UNCLEAR???)
- sharpe ratio sensitivity: CLARIFY DERVIATION!!