# Hypothesis Testing

In this session we will take a look at some basic hypothesis tests and how we can carry them out with pandas data.

---
## Definitions

### Null and Alternative Hypotheses

The *null hypothesis* $H_0$ and *alternative hypothesis* $H_1$ are always two rival hypothesis, e.g.
* $H_0$: $\mu = 0$; 
* $H_1$: $\mu \ne 0$

The null hypothesis is the one that can be modelled (i.e. the parameters are defined).

### Test statistic
A quantity derived from the sample, used in hypothesis testing.

### p-value
The probability of obtaining an observation as extreme or more extreme than the test statistic, assuming that the null hypothesis is true.
e.g. $p = 0.03$

The smaller the p-value is, the more unlikely the observation would be to occur if $H_0$ were true.

### Significance level
The significance level, $\alpha$, is used to assess the p-value, and it must be selected in advance of the hypothesis test. 

We will *reject $H_0$* when $p < \alpha$. 

From the definition of the p-value, $\alpha$ is the probability of incorrectly rejecting $H_0$ if it is true. This is known as the probability of a *Type-I error*.

By choosing a smaller $\alpha$, we can specify a more conservative test.


---
## Hypothesis testing procedure

There are three stages:

### Planning
1. Propose a research question.
2. Formulate the null hypothesis $H_0$ and alternative hypothesis $H_1$.
3. Choose an appropriate statistical test.
4. Choose an appropriate significance level, $\alpha$.

If possible, it is considered best practice to formulate your hypothesis and choose your significance level **before** looking at the data, because looking at the data might influence your choices and therefore invalidate your hypothesis test. 

However, we often need to use data exploration to help us formulate interesting hypotheses. One way to deal with this problem might be to explore a small sample of the whole data set, then test the arising hypotheses on the remainder of the data.

### Experiment
5. Collect the data! 
6. Verify the data with some visualisation. 
7. If necessary, clean up the data set and remove any unreliable data points.

Real data is often messy, and we must be certain that we trust that the data going into the hypothesis test is correct. Sometimes it is clear that a recorded value is incorrect, and must be discarded (e.g. a negative value for blood pressure). 

Of course, this does not mean that we can choose to throw away any extreme "outlier" values just because they look unusual! They may be important evidence that our current understanding of the process generating the data needs revising. This is what science is all about :)

### Analysis
7. Calculate the test statistic and the p-value.
8. If $p < \alpha$, reject $H_0$. 
9. If $p \ge \alpha$, we report that we "cannot reject the null hypothesis". 

In philosophy of science, it is considered to be impossible to demonstrate that the null hypothesis is **TRUE**. We can only look for evidence against it. Hence we should be careful about how we phrase the results of our tests.



---