# Statistical Hypothesis

**Definition 18.1.**
* A statistical hypothesis H is a conjecture about the distribution $f(x; \theta)$ of a population X. 
* This conjecture is usually about the parameter $\theta$ if one is dealing with a **parametric statistics**; otherwise it is
about the form of the distribution of X.

**Definition 18.2.**
* A hypothesis H is said to be a **simple hypothesis** if H
completely specifies the density $f(x; \theta)$ of the population; 
* otherwise it is called a **composite hypothesis**.

**Definition 18.3.**
* The hypothesis to be tested is called the **null hypothesis**.
* The negation of the null hypothesis is called the **alternative hypothesis**. 
* The null and alternative hypotheses are denoted by $H_0$ and $H_\alpha$, respectively.

**Definition: Critical region**
* A critical region is a subset of the sample space $(X_1, X_2, ..., X_n)$ in which we accept the alternative hypothesis to be true.  
* That is, when we obtain a sample from a sequence of random variables $(X_1, X_2, ..., X_n)$, 
  * If th output of $(X_1, X_2, ..., X_n)$ turns out to be an element of C, then we decide to accept $H_\alpha$; otherwise we accept $H_0$.
  * We usually a test statistic W(X1, X2, ..., Xn) to determin whether the outcome is in $C$.

**Remark: hypthesis test**
* A hypothesis test is a rule that tells us for which sample values we should decide to accept $H_0$ as true and for which sample values we should decide to reject $H_0$ and accept $H_\alpha$ as true. 
Typically, a hypothesis test is specified in terms of a test statistic $W$. 
  * For example, a test might specify that $H_0$ is to be rejected if the sample total $\sum_{k=1}^n X_k$ is less than 8. In this case the critical region $C$ is the set $\{(x_1, x_2, ..., x_n)|\: x_1 + x_2 + \cdots + x_n < 8\}$.

**Definition 18.4.**
* A hypothesis test is an ordered sequence $(X_1, X_2, ..., X_n; H_0, H_\alpha; C)$, where
  * $X_1, X_2, ..., X_n$ is a random sample from a population X with the probability density function $f(x; \theta)$, 
  * $H_0$ and $H_\alpha$ are hypotheses concerning the
parameter $\theta$ in $f(x; \theta)$, and 
  * C is a *Borel set* in $\mathbf{R}^n$.

**Remark 18.**
* Let X be some set, and let ${\mathcal {P}}(X)$ represent its power set. Then a subset $\Sigma \subseteq {\mathcal {P}}(X)$ is called a σ-algebra if it satisfies the following three properties:
  * X is in Σ, and X is considered to be the universal set in the following context.
  * Σ is closed under complementation: If A is in Σ, then so is its complement, X \ A.
  * Σ is closed under countable unions: If A1, A2, A3, ... are in Σ, then so is A = A1 ∪ A2 ∪ A3 ∪ … .
* From these properties, it follows that the σ-algebra is also closed under countable intersections (by applying De Morgan's laws).

**Remark**
* Borel sets are the member of the smallest -algebra containing all open sets in $\mathcal{R^n}$
* Two examples of Borel sets in $\mathcal{R}^n$ are sets obtained by
  * countable union of closed intervals in $\mathcal{R}^n$, and 
  * countable intersection of open sets in $\mathcal{R}^n$.

## Methods of finding tests

There are several methods to find test procedures and they are: 
1. Likelihood Ratio Tests, 
2. Invariant Tests, 
3. Bayesian Tests, and 
4. Union-Intersection and Intersection-Union Tests.

## Likelihood Ratio Tests

**Definition 18.5.**
* The likelihood ratio test statistic for testing the simple
null hypothesis $H_0 : \theta \in \Omega_0$ against the composite alternative hypothesis $H_\alpha : \theta \notin \Omega_0$ based on a set of random sample data $x_1, x_2, ..., x_n$ is defined as
$$\displaystyle W(x_1, x_2, ..., x_n) = 
\frac{\max\limits_{\theta \in \Omega_0}L(\theta, x_1, x_2, ..., x_n)}
{\max\limits_{\theta \in \Omega}L(\theta, x_1, x_2, ..., x_n) }
$$
* where $\Omega$ denotes the parameter space, and $L(\theta, x_1, x_2, ..., x_n)$ denotes the likelihood function of the random sample, that is
$$ L(\theta, x_1, x_2, ..., x_n) = \prod _{i=1}^n f(xi; \theta)$$
* If $H_0 : \theta = \theta_0$ and $H_\alpha : \theta = \theta_a$ are both simple hypotheses, then the likelihood ratio test statistic is defined as
$$W(x1, x2, ..., xn) = \frac{L(\theta_0, x1, x2, ..., xn)}
{L(\theta_\alpha, x1, x2, ..., xn)}
.$$

**Remark**
* A likelihood ratio test (LRT) is any test that has a critical region $C$ (i.e., rejection region) of the form
$$ C = \{(x_1, x_2, ..., x_n) | W(x_1, x_2, ..., x_n) \le k\} ,$$
where $k$ is a number in the unit interval $[0, 1]$.

**Example 18.1.**
* Let X1, X2, X3 denote three independent observations from
a distribution with density 
$$ \begin{align}f(x; \theta) = \begin{cases}
( (1 + \theta) x^\theta & \text{ if } 0 \le x \le 1 \\
0 & otherwise.
\end{cases} \end{align}$$
What is the form of the LRT critical region for testing $H_0 : \theta = 1$ versus $H_\alpha : \theta = 2$?
*Answer: The critical region is given by:
$$\begin{align}
C &= \Bigl\{(x1, x2, x3) \in \mathcal{R}^3 \Bigl|
  \frac{L(\theta_0, x1, x2, x3)}{L(\theta_\alpha, x1, x2, x3)} \le k \Bigr\} \\
  &= \Bigl\{(x1, x2, x3) \in \mathcal{R}^3 \Bigl|
  \frac{(1 + \theta_0)^3 \prod^3_{i=1} x_i^{\theta_0} }
  {(1 + \theta_\alpha)^3 \prod^3_{i=1} x_i^{\theta_\alpha}} \le k \Bigr\}  \\
  &= \Bigl\{(x1, x2, x3) \in \mathcal{R}^3 \Bigl|
  \frac{8x_1x_2x_3}{27x^2_1 x^2_2 x^2_3} \le k \Bigr\} \\
  &= \Bigl\{(x1, x2, x3) \in \mathcal{R}^3 \Bigl| 
  \frac{1}{x_1x_2x_3} \le \frac{27}{8} k \Bigr\} \\
  &= \Bigl\{(x1, x2, x3) \in \mathcal{R}^3 \Bigl| x_1x_2x_3 \le a\Bigr\},
\end{align}$$
where $a$ is some constant. Hence the likelihood ratio test is of the form:
“Reject $H_0$ if $\prod_{i=1}^3 X_i \ge a$.”

**Example 18.2.**
* Let X1, X2, ..., X12 be a random sample from a normal
population with mean zero and variance $\sigma^2$. What is the form of the LRT
critical region for testing the null hypothesis $H_0 : \sigma^2 = 10$ versus $H_\alpha : \sigma^2 = 5$?
* Answer: 
$$\begin{align}
C &= \Bigl\{(x_1, x_2, ...,x_{12}) \in \mathcal{R}^{12} \Bigl|
\frac{L(\sigma_0^2, x1, x2, ..., x12}
{L(\sigma_\alpha^2, x1, x2, ..., x12)} \le k \Bigr\} \\
  &= \Bigl\{(x_1, x_2, ...,x_{12}) \in \mathcal{R}^{12} \Bigl|
  \prod_{i=1}^{12}\frac{\frac{1}{\sqrt{2\pi\sigma_0^2}} e^{-\frac{1}{2}(\frac{x_i}{\sigma_0})^2}}
  {\frac{1}{\sqrt{2\pi\sigma_\alpha^2}} e^{-\frac{1}{2}(\frac{x_i}{\sigma_\alpha})^2}}    \le k\Bigr\} \\
  &= \Bigl\{(x_1, x_2, ...,x_{12}) \in \mathcal{R}^{12} \Bigl|
  \Bigl(\frac{1}{2}\Bigr)^6 e^{\frac{1}{20} \sum_{i=1}^{12} x_i^2}  \le k \Bigr\} \\
  &= \Bigl\{(x_1, x_2, ...,x_{12}) \in \mathcal{R}^{12} \Bigl|
  \ \sum_{i=1}^{12} x_i^2  \le a \Bigr\}
\end{align}$$
where $a$ is some constant.
Hence the likelihood ratio test is of the form:
“Reject Ho if $\sum^{12}_{i=1} X^2_i \le a$.”

**Example 18.3.**
* Suppose that X is a random variable about which the hypothesis $H_0 : X \sim UNIF(0, 1)$ against $H_\alpha : X \sim N(0, 1)$ is to be tested.
What is the form of the LRT critical region based on one observation of X?
* Answer: In this example, $L_0(x) = 1$ and 
$L_\alpha(x) = \frac{1}{\sqrt{2\pi}} e^{-\frac{1}{2}x^2}$. By the above
definition, the form of the critical region is given by
$$\begin{align}
C &= \Bigl\{ x \in \mathcal{R} \Bigr|\frac{L_0(x)}{L_\alpha(x)} \le k \Bigr\} \\
  &= \Bigl\{ x \in \mathcal{R} \Bigr| 
  \sqrt{2\pi} e^{\frac{1}{2}x^2} \le k \Bigr\} \\
  &= \Bigl\{ x \in \mathcal{R} \Bigr| 
  x^2 \le 2ln \Bigl( \frac{k}{\sqrt{2\pi}} \Bigr) \Bigr\} \\
  &= \Bigl\{ x \in \mathcal{R} \Bigr| x \le a \Bigr\}
\end{align}$$
where $a$ is some constant. Hence the likelihood ratio test is of the form:
“Reject $H_0$ if $X \le a$.”

### If the hypothesis is composite

If the alternative hypothesis is composite, then the following algorithm can be used to construct the likelihood ratio critical region:
1. Find the likelihood function L(✓, x1, x2, ..., xn) for the given sample.
2. Find $L(\theta_0, x1, x2, ..., xn).$
3. Find $\max\limits_{\theta\in\Omega}L(\theta, x1, x2, ..., xn)$.
4. Rewrite $\frac{L(\theta_0,x_1,x_2,...,x_n)}
{\max\limits_{\theta\in\Omega}L(\theta, x_1, x_2, ..., x_n)}$ in a “suitable form”.
5. Use step (4) to construct the critical region.

**Example 18.4.**
* Let X be a single observation from a population with probability density
$$f(x; \theta) = \begin{cases}
\frac{\theta^x e^{-\theta}}{x!} & for \; x = 0, 1, 2, ...,\infty \\
 0 & otherwise,
\end{cases}$$
where $\theta \ge 0$. Find the likelihood ratio critical region for testing the null hypothesis $H_0 : \theta = 2$ against the composite alternative $H_a : \theta \ne 2$.
**Answer:** 
* we know that $L(\theta_0, x)$ which is given by $L(2, x) = \frac{2^x e^{-2}}{x!}$.
* The next step is to find 
$\max\limits_{\theta\ge0}L(\theta, x)$. For this, we differentiate $L(\theta, x)$, set the derivative to $0$, and solve for $\theta$.
$$ \frac{dL(\theta, x)}{d\theta} 
= \frac{1}{x!}[x\theta^{x-1}e^{-\theta} - \theta^x e^{-\theta}] = 0
$$
We get $\theta = x$, thus
$$ \max\limits_{\theta\ge0}L(\theta, x)
= \frac{x^x e^{-x}}{x!}
$$
Now we perform step (4), and 
$$\frac{L(2,x)}
{\max\limits_{\theta\in\Omega}L(\theta, x)}
= \frac{\frac{2^x e^{-2}}{x!}}{\frac{x^x e^{-x}}{x!}}
= \Bigl(\frac{2e}{x}\Bigr)^x e^{-2}
$$
Thus, the likelihood ratio critical region is given by
$$ C = \Bigl\{ x \in \mathcal{R} \Bigr| \Bigl(\frac{2e}{x}\Bigr)^x e^{-2} \le k \Bigr\} 
  = \Bigl\{ x \in \mathcal{R} \Bigr| \Bigl(\frac{2e}{x}\Bigr)^x \le a \Bigr\}
  $$
where $a$ is some constant. The likelihood ratio test is of the form: “Reject
$H_0$ if $\Bigl(\frac{2e}{x}\Bigr)^x \le a$


## Evaluating tests (Goodness of tests)

There are several criteria to evaluate the goodness of a test procedure.
Some well known criteria are: 
1. Powerfulness, 
2. Unbiasedness and Invariancy, and 
3. Local Powerfulness. 
In order to examine some of these criteria, we need to define the following terminologies: **error probabilities, power functions, type I error, and type II error.**

In hypothesis test based whether the null hypothesis is true, and whether we accept the null hypothesis, there are four situations:

  Situations |$H_0$ is true |$H_0$ is false
---|---|---
Accept $H_0$ |Correct Decision |Type II Error (誤信)
Reject $H_0$ |Type I Error (誤殺)  |Correct Decision


**Definition 18.6.**
* Let $H_0 : \theta \in \Omega_0$ and $H_\alpha : \theta \notin \Omega_0$ be the null and alternative hypotheses to be tested based on a random sample $X1, X2, ..., Xn$ from a population $X$ with density $f(x; \theta)$, where $\theta$ is a parameter. The **significance level** of the hypothesis test. denoted by $\alpha$, is defined as
$$\alpha = P (\text{Type I Error}).$$
Thus, the significance level of a hypothesis test we mean the probability of
rejecting a true null hypothesis, that is
$$\alpha = P (\text{Reject }H_0 / H_0 \text{ is true}).$$

**Definition 18.7.**
* Let $H_0 : \theta \in \Omega_0$ and $H_\alpha : \theta \notin \Omega_0$ be the null and alternative hypotheses to be tested based on a random sample $X1, X2, ..., Xn$ from a population $X$ with density $f(x; \theta)$, where $\theta$ is a parameter.  The probability of type II error of the hypotehsis test, denoted by $\beta$, is defined as:
$$ \beta = P (\text{Accept } H_0 / H_0 \text{ is false}).$$

**Example 18.5.**
* Let X1, X2, ..., X20 be a random sample from a distribution with probability density function 
$$f(x; p) = \begin{cases}
p^x(1 - p)^{1-x} & if x = 0, 1 \\
0 & otherwise,
\end{cases}$$
where $0 < p \le \frac{1}{2}$ is a parameter. The hypothesis $H_0 : p = \frac{1}{2}$ to be tested against $H_\alpha : p < \frac{1}{2}$. 
* If Ho is rejected when $\sum^{20}_{i=1} Xi \le 6$, then what is the
probability of type I error?

**Answer:**
* Since each observation $X_i \sim BER(p)$, the sum the observations
$\sum^{20}_{i=1} X_i \sim BIN(20, p)$. 
* The probability of type I error is given by
$$\begin{align}
\alpha &= P(\text{Type I Error}) \\
  &= P (\text{Reject } H_0 | H_0 \text{ is true}) \\
  &= P ( \sum^{20}_{i=1} Xi \le 6 | H_0 \text{ is true} )\\
  &= P ( \sum^{20}_{i=1} Xi \le 6 | H_0 : p = \frac{1}{2}) \\
  &= \sum^6_{k=0} \binom{20}{k}\bigl(\frac{1}{2}\bigr)^k\bigl(1-\frac{1}{2})^{20-k} \\
  & = 0.0577  \quad \text{(from binomial table)}
\end{align}$$

**Example 18.6.**
* Let p represent the proportion of defectives in a manufacturing process. To test $Ho : p \le \frac{1}{4}$ versus $Ha : p > \frac{1}{4}$ , a random sample of
size 5 is taken from the process. 
* If the number of defectives is 4 or more, the null hypothesis is rejected. 
* What is the probability of rejecting Ho if $p = \frac{1}{5}$ ?

**Answer:**
* Let X denote the number of defectives out of a random sample of
size 5. Then X is a binomial random variable with n = 5 and p = 1/5 . 
*Hence, the probability of rejecting Ho is given by
$$\begin{align}
\alpha &= P (\text{Reject } H_0o | H_0 \text{ is true}) \\
  &= P (X \ge 4 | p = \frac{1}{5}) \\
  &= \binom{5}{4}p^4(1-p)^1   +\binom{5}{5} p^5(1-P)^0 \\
  &= \Bigl( \frac{1}{5} \Bigr)^5[20+1] \\
  &= \frac{21}{3125}
\end{align}$$

**Example 18.7.**
* A random sample of size 4 is taken from a normal distribution with unknown mean µ and variance $\sigma^2 > 0$. 
* To test Ho : µ = 0 against Ha : µ < 0 the following test is used: “Reject Ho if and only if X1 + X2 + X3 + X4 < -20.” 
* Find the value of $\sigma$ so that the significance level of this test will be closed to 0.14.

**Answers**
* Since the significance level $\alpha \approx 0.14$, we have:
$$\begin{align}
0.14 &= = P (Type I Error) \\
  &= P (\text{Reject } H_0 | H_0 \text{ is true}) \\
  &= P (X1 + X2 + X3 + X4 < -20 /Ho : µ = 0) \\
  &= P(\overline{X} < -5 /Ho : µ = 0) \\
  &= P(\frac{\overline{x}-0}{\frac{\sigma}{2}} < \frac{-5 -0}{\frac{\sigma}{2}})\\
  &= p(Z < -\frac{10}{\sigma}) \quad \text{(where Z is a standard normal R.V.)}
\end{align}$$
We get from the standard normal table
$$ 1.08 = \frac{10}{\sigma} $$
Therefore
$$ \sigma = 10/1.08 = 9.26$$

##Skipped before example 18.8

###Probability for wrong decision
While deciding to accept Ho or Ha, we may make a wrong decision. 
* The probability $\gamma$ of a wrong decision can be computed as follows:
$$\begin{align}
\gamma &= P(H_a\text{ accepted and } H_0 \text{ is true}) + P(H_0 \text{ accepted and } H_a \text{ is true}) \\
  &= P(H_a\text{ accepted} | \text{Ho is true}) P(\text{Ho is true}) 
+ P(H_0 \text{ accepted} |H_a \text{ is true}) P (H_a \text{ is true}) \\
  & = \alpha P (H_0 \text{ is true}) +  \beta P (H_a\text{ is true})\end{align}$$
* In most cases, the probabilities $P(\text{Ho is true})$ and $P(H_a \text{ is true})$ are not known. Therefore, it is, in general, not possible to determine the exact numerical value of the probability $\gamma$ of making a wrong decision. However, since $\gamma$ is a weighted sum of $\alpha$ and $\beta$ , and $P(\text{Ho is true}) + P(H_a \text{ is true}) = 1$,
we have
$$ \gamma \le max\{\alpha, \beta\}$$
A good decision rule (or a good test) is the one which yields the smallest 

###The power of test
* The alternative hypothesis is mostly a composite hypothesis. 
* Thus, it is not possible to find a value for the probability of type II error, $\beta$. 
* For composite alternative, $\beta$ is a function of $\theta$. That is,
$$\beta: \Omega^c_o :\rightarrow [0, 1]$$. 
Here $\Omega^c_o$ denotes the complement of the set $\Omega_o$ in the parameter space $\Omega$. 
* In hypothesis test, instead of $\beta$, one usually considers the **power of the test** $1-\beta(\theta)$
  * A small $\beta$ (probability of type II error) is equivalent to large power of the test.

**Definition 18.8. Power function**
* Let $H_0 : \theta \in \Omega_0$ and $H_a : \theta \notin \Omega_0$ be the null and alternative hypothesis to be tested, based on a random sample $X1, X2, ..., Xn$ from a population X with density $f(x; \theta)$, where $\theta$ is a parameter. The power function of the hypothesis test is a function $\pi : \Omega \rightarrow [0, 1]$ defined by
$$
\pi(\theta) = \begin{cases}
P (\text{Type I Error}) & \text{if Ho is true} \\
1 - P (\text{Type II Error}) & \text{if Ha is true}.
\end{cases}$$

**Example 18.9.**
* A manufacturing firm needs to test the null hypothesis $H_0$ that the probability p of a defective item is 0.1 or less, against the alternative
hypothesis $H_a$ : p > 0.1. 
* The procedure is to select two items at random. 
  * If both are defective, Ho is rejected; 
  * otherwise, a third is selected. If the third item is defective $H_o$ is rejected. 
  * In all other cases, Ho is accepted.
* What is the power of the test in terms of p (if Ho is true)?

**Answer:**
* Let p be the probability of a defective item. 
* We want to calculate the power of the test at the null hypothesis. 
* The power function of the test is given by
$$ \pi(p) = \begin{cases}
  P(\text{Type I Error}) &\text{if }p \le 0.1 \\
  1-P(\text{Type II Error}) &\text{if }p > 0.1
\end{cases}$$
* Hence, we have
$$\begin{align}
\pi(p) &= P (\text{Reject Ho }|\text{Ho is true}) \\
  &= P (\text{Reject Ho}|Ho : p = p) \\
  &= P(\text{first two items are both defective}|p) \\
  &+ P(\text{at least one of the first two items is not defective and third is}|p) \\
  &= p2 + (1-p)^2 p + \binom{2}{1}p(1-p)p \\
  &= p + p2 - p3.
\end{align}$$

**Remark 18.4.**
* If X denotes the number of independent trials needed to obtain the first success, then $X \sim GEO(p)$, and
$$ P(X = k) = (1 - p)^{k-1} p,$$
where $k = 1, 2, 3, ...,\infty$.
* Further $P(X \le n) = 1 - (1 - p)^n$
  * since
  $$\begin{align}
    \sum^n_{k=1} (1 - p)^{k-1} p &= p \sum^n_{k=1} (1-p)^{k-1} \\
    &= p \frac{1-(1-p)^n}{1-(1-p)} \\
    &= 1-(1-p)^n.
\end{align}$$

**Example 18.10.**
* Let X be the number of independent trails required to obtain a success where p is the probability of success on each trial. The hypothesis Ho : p = 0.1 is to be tested against the alternative Ha : p = 0.3.
* The hypothesis is rejected if $X \le 4$. What is the power of the test if Ha is true?

**Answer:**
* The power function is given by
$$\pi(p) = \begin{cases}
P(\text{Type I Error}) &if\; p = 0.1\\
1-P(\text{Type II Error}) &if\; p = 0.3.
\end{cases}$$
* Hence, we have
$$\begin{align}
\pi &= 1 - P (Accept\; Ho /\text{Ho is false}) \\
  &= P(\text{Reject Ho}/\text{Ha is true}) \\
  &= P (X \le 4 / \text{Ha is true}) \\ &= P (X \le 4 / p = 0.3) \\
  &= \sum^ 4_{ k=1} P (X = k /p = 0.3) \\   &= \sum^4_{k=1}(1-p)^{k-1}p\\   
  &= 1-(1-p)^4 \\   &= 1 - (0.7)^4 \\   &= 0.7599.
\end{align}$$
the power of the test at the alternative is 0.7599.

**Example 18.11.**
* Let X1, X2, ..., X25 be a random sample of size 25 drawn from a normal distribution with unknown mean µ and variance $\sigma^2 = 100$.
* It is desired to test the null hypothesis µ = 4 against the alternative µ = 6.
* rejection rule: reject µ = 4, if $\sum^{25}_{i=1} X_i \ge 125$
* What is the power of the test at µ = 6?

**Answer:**
* The power of the test at the alternative is
$$\begin{align}
\pi(6) &= 1 - P(\text{Type II Error}) \\
&= 1-P(\text{Accept Ho | Ho is false})\\ &= P(\text{Reject Ho | Ha is true})\\
&= P(\sum^{25}_{i=1} X_i \ge 125 | Ha : µ = 6) \\
&= P(\overline{X} \ge 5 | H_a:µ = 6) \\
&= P(\frac{X-6}{\frac{10}{\sqrt{25}}} \ge \frac{5-6}{\frac{10}{\sqrt{25}}}\Bigl|\mu=6)\\
&= P(Z \ge -\frac{1}{2}) \quad \text{where Z is standard normal} \\
&= 0.6915.
\end{align}$$