# Univariate scan permutation demo:

## Statistical Settings:

Original inputs:

- $y = N \times 1$ vector of measurements of one trait for N individuals;

- $X = [1, g] = N \times 2$ matrix of information about one genetic marker and intercept for the N individuals;

- $K = N \times N$ matrix of genetic relatedness between the N individuals (pre-calculated from the genome $G$ matrix). 

Then, the univariate trait $y$ by assumption has the distribution

$$y|X \sim N(\beta_0 1+\beta_1 g, \space \sigma^2_g K + \sigma^2_e I)$$

### First transformation - to have uncorrelated variances:

If $K = U D U^T$, then if we let 

$$y^* = U^T y$$
$$X^* = [U^T 1, U^T g]$$

then 

$$y^*|X^* \sim N(\beta_0 U^T 1 + \beta_1 U^T g, \space \sigma_e^2 V)$$ 

where the matrix $V = V(h^2)$ is a diagonal matrix of uncorrelated covariances and is parametrized by $h^2$, the heritability, i.e. 

$$h^2 = \frac{\sigma_g^2}{\sigma_g^2+\sigma_e^2}$$

### Second transformation - to obtain homoskedesticity:

If we denote $W = (\sigma^2_e V)^{-1}$, then consider the distribution of $y^{**} = \sqrt{W}y^*$

$$y^{**}|X^{**} \sim N(\beta_0 \sqrt{W}U^T1 + \beta_1 \sqrt{W} U^T g, \space I)$$

### Permutation Testing Framework:

$$H_0: \beta_1 = 0;$$

Then, by the null hypothesis, 

$$y^{**}|X^{**} \sim N(\beta_0 \sqrt{W}U^T 1, I)$$

If we project $y^{**}$ to the space that is orthogonal to the intercept (after rotation) $U^T 1$, which is equivalent to taking the residuals of regressing $y^{**}$ on to $U^T 1$, we get 

$$y^{***} = y^{**} - \hat \beta_0 \sqrt{W} U^T 1$$

where $\hat \beta_0$ is the OLS estimate such that 

$$\hat \beta_0 = (X^{**T}_0 X^{**}_0)^{-1}X^{**T}_0 y^{**}$$



**Note:** this is mathematically equivalent to taking the residuals after performing weighted least squares on the unweighted data $(y^{*}, X^{*})$, as

$$X_0^{**} = \sqrt{W} U^T1$$ 

$$y^{**} = \sqrt{W} U^T y$$

then $$\hat \beta_0 = (X^{**T}_0 X^{**}_0)^{-1}X^{**T}_0 y^{**} = (X_0^{*T}W X_0^{*})^{-1}X^{*T}W y^{*}$$

Recall that $$y^{*}|X^{*} \sim N(\beta_0 X_0^{*}, \sigma_e^2 V = W^{-1})$$

By $H_0$,

$$y^{***}|X^{***} \sim N(0, I)$$

Finally, permuting $y^{***}$ can give us samples from the i.i.d standard normal distribution under the null hypothesis.