# Statistical Power

## Concept

Reviewing errors in statistical tests,

- P-value is $\alpha$ and **Type I error**. It means when $H_0$ is true, reject $H_0$. **False positive**.
- **Type II error** is that, when $H_0$ is false, don't reject $H_0$. This is $\beta$. **False negative**.
- **True negative** is $1 - \alpha$. When $H_0$ is true, don't reject $H_0$.
- **True positive** is $1 - \beta$. When $H_0$ is false, reject $H_0$. **This is statistical power**.

**Statistical power** is

- P(reject $H_0$ | $H_0$ is false).
- The probability of finding an effect when it is really there.
- Expressed as a probability 0% to 100%
- We want the statistical power to be **as high as possible**
- The density of $H_A$ distribution **above the significance threshold** $\alpha$.

Statistical power can increase by

- Increase the sample size
- Increase the effect size
- Lower $\alpha$ meaning use a bigger significance threshold like 10% instead of 5% (But it's not a good idea).

Statistical power decreases when

- The data has a larger variability
- Using the higher $\alpha$ meaning use 1% significance threshold instead of 5%.

The problem with power is

- Decreasing $alpha$ can increase statistical power, but also increase **Type I error**, because it's $\alpha$.
- Some factors influencing statistical power are out of our control. For example, sample size, variability in the given data.

## Compute statistical power

$$
z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}
$$

$z$ is z-value proportional to statistical power. $\bar{x} - \mu_0$ is **effect size**. $\sigma / \sqrt{n}$ is **standard error**. 

$$
= \frac{(\bar{x} - \mu_0) \sqrt{n}}{\sigma}
$$

This tells us that statistical power increases when

- The effect size $\bar{x} - \mu_0$ gets bigger
- The sample size $\sqrt{n}$ gets bigger
- The variability of data $\sigma$ gets smaller

With this $z$, statistical power $1 - \beta$ is

$$
1 - \beta = 1 - P(Z > z)
$$

We can find the sample size by solving for $n$ from the above formula

$$
n = (\frac{z \sigma}{\bar{x} - \mu_0})^2
$$

But this formula shows the ambiguity that before collecting data, how can you know $\bar{x}$ or $\sigma$? It means that we can do **a priori power**, meaning get the effect size and standard deviation from the published studies or from the pilot data, so that we can compute the desired sample size. We can also do **post-hoc power** meaning compute the power from our study. But it's just proportional to the p-value that we can compute, so it doesn't provide the new information by computing the power.

