**Abstract**. Abstract Every quantity that is estimated from the data, such as the mean or the variance of a Gaussian variable, is subject to statistical fluctuations of the measurements. For this reason they are referred to as a statistics. If a different sample of measurements is collected, statistical fluctuations will certainly give rise to a different set of measurements, even if the experiments are performed under the same conditions. The use of different data samples to measure the same statistic results in the determination of the sampling distribution of the statistic, to describe what is the expected range of values for that quantity. In this blog post we derive the distribution of a few fundamental statistics that play a central role in data analysis, such as the $\chi^2$ statistic. The distribution of each statistic can be used for a variety of tests, including the acceptance or rejection of the fit to a model.

Hypothesis testing is the process that establishes whether the measurement of a given statistic, such as the sample mean, is consistent with its theoretical distribution. The process of hypothesis testing requires a considerable amount of care in the definition the hypothesis to test and in drawing conclusions. Hypothesis tests can be divided into two main schools of thought: Bayesian and Statistical inference (Neyman-Pearson's Method). [3]

For a given hypothesis $H$ to be tested, it must be formulated in the following form
* $H_0$ The statement of the phenomenon is false
* $H$ The statement about the phenomenon is true

# Bayesian Inference

The problem can formulated in the following way: we want to know the conditional probability for a hypothesis being true, given a certain outcome of an experiment or measurement

$$P(H|O)=\dfrac{P(O|H)P(H)}{P(O|H)P(H)+P(O|H_0)P(H_0)}$$

where
* $P(H_0)$ is the prior probability of the null hypothesis be true
* $P(H)$ is the prior probability of the alternative hypothesis be true
* $P(*|O)$ is the probability of the (null) hypothesis given the outcome

By fixing a critical value $\alpha$, one either can verify the probability of hypothesis $P(H|O)$ is higher or smaller than $\alpha$. Then, one accepts the hypothesis if $P(H|O)\geq\alpha$.

**Example**. Consider a coin that has been tossed 100 times. Given that number of tails is 70, is this coin fair?

In [1]:
# Todo: Add coin example here

# Statistical Inference

Hypothesis testing is the process that establishes whether the measurement of a given statistic, such as the sample mean, is consistent with its theoretical distribution. The process of hypothesis testing requires a considerable amount of care in the definition the hypothesis to test and in drawing conclusions. The method can be divided into the following four steps.

1. Define the hypothesis to test. For example, define if the test is to decide if there exists a difference between the parameter to be tested with respect to a given value. This is the null hypothesis.

2. Determine the statistics to use for the null hypothesis. The choice of statistic means that we are in a position to use the theoretical distribution function for that statistic to tell whether the actual measurements are consistent with its expected distribution, according to the null hypothesis.

3. Determine the probability or confidence level for the agreement between the statistic and its expected distribution under the null hypothesis. This confidence level defines a range of values for statistics that are consistent with its expected distribution. This range is called *acceptable region* for the statistic. Values of the statistics outside of the acceptable range define the *rejection region*.

For a statistic described by a $\mathcal{N}(0,1)$, 

**Definition**. Let $x_1,x_2\in\mathbb{R}$. The notation $x_1\neq x_2$ stands for, there exists an index $i\in\mathbb{N}_{\leq n}$ such that $x_{1i}\neq x_{2i}$.

Let $E=\text{span}\{x\}$, every hypothesis $H$ consists of an assumption about the probability density [2] 
$$ f(x;\lambda)\;, $$
where $\lambda\in\mathbb{R}^n$.

The hypothesis $H_0$ is said to be the null hypothesis if, for a given $\lambda_0\in\mathbb{R}^n$,
$$ H_0(\lambda=\lambda_0)\;. $$
The alternative hypothesis is formulated as
$$ H_1(\lambda\neq\lambda_0)\;. $$

Since the null hypothesis makes a statement about the probability density in the sample space, it also predicts the probability for observing a point $X$. The critical region $S_c$ with a significance level $\alpha$ is given by
$$ P(X\in S_c|H_0)=\alpha $$

In other words, we determine $S_c$ such that the probability to observe a point $X\in E$ within $S_c$ is $α$, under the assumption that $H_0$ is true. If the point $X$ from the sample actually falls into the region $S_c$, then the hypothesis $H_0$ is rejected. Note that the above equation does not define the critical region $S_c$ uniquely.

In practice, the set $E$ is not available due to the lack of knowledge of the population. Instead one constructs a test statistic
$$ T(X) $$
and determines a region $U$ of the variable $T$ such that it corresponds to the critical region $S_c$, i.e.,
$$X\mapsto T(X), S_c(X)\mapsto U(X).$$

The null hypothesis is rejected, whenever $T\in U$

The following is from [2]

Related reading: Neyman–Pearson Lemma

## F-Test on Equality of Variances

## Student's Test: Comparison of Means

# Errors

Because of the statistical nature of the sample, it is clearly possible that the null hypothesis could be true, even though it was rejected since $X \in S_c$. The probability for such an error, an error of the first kind, is equal to $\alpha$.

There is in addition another possibility to make a wrong decision, if one does not reject the hypothesis $H_0$ because $X$ was not in the critical region $S_c$, even though the hypothesis was actually false and an alternative hypothesis was true. This is an error of the second kind. The probability for this,
$$P(X\notin S_c|H_1)=\beta$$

This connection with the alternative hypothesis $H_1$ provides us with a method to specify the critical region $S_c$. A test is clearly most reasonable if for a given significance level $\alpha$ the critical region is chosen such that the probability $\beta$ for an error of the second kind is a minimum. The critical region and therefore the test itself naturally depend on the alternative hypothesis under consideration.

Once the critical region has been determined, we can consider the probability for rejecting the null hypothesis as a function of the ``true'' hypothesis, or rather as a function of the parameters that describe it. 
$$ M(S_c,\lambda)=P(X\in S_c|H)=P(X\in S_c|\lambda)$$

# Import

## Modules

# References

[1] M. Bonamente, "Statistics and Analysis of Scientific Data", Springer, 2017

[2] S. Brandt, "Data Analysis", Springer, 2014

[3] L.-G. Johansson, "Philosophy of Science for Scientists", Springer 2016