## Chapter 07
# Hypothesis Testing with One Sample

Adopted from ["Elementary Statistics - Picturing the World" 6th edition](https://www.amazon.com/Elementary-Statistics-Picturing-World-6th/dp/0321911210/)

In [1]:
from notebook.services.config import ConfigManager
cm = ConfigManager()
cm.update('livereveal', {
        'scroll': True,
        'width': "100%",
        'height': "100%",
})

{'scroll': True, 'width': '100%', 'height': '100%'}


## 7.1 <br/>Introduction to Hypothesis Testing

### Hypothesis Tests

- A **hypothesis test** is a process that uses sample statistics to test a claim about the value of a population parameter.
- Researchers in fields such as medicine, psychology, and business rely on hypothesis testing to make informed decisions about new medicines, treatments, and marketing strategies.

### Stating a Hypothesis

- A statement about a population parameter is called a **statistical hypothesis**.
- To test a population parameter, you should carefully state a pair of hypotheses—one that represents the claim and the other, its complement.
- When one of these hypotheses is false, the other must be true. Either hypothesis—the **null hypothesis** or the **alternative hypothesis** —may represent the original claim.
- The term **null hypothesis** was introduced by Ronald Fisher.
- If the statement in the **null hypothesis** is not true, then the **alternative hypothesis** must be true.

### Stating a Hypothesis: Definition

- A **null hypothesis $H_{0}$** is a statistical hypothesis that contains a statement of equality, such as $\le$, $=$, or $\ge$.
- The **alternative hypothesis $H_{a}$** is the complement of the null hypothesis. It is a statement that must be true if $H_{0}$ is false and it contains a statement of strict inequality, such as $\gt$, $\ne$, or $\lt$.

### Stating a Hypothesis

- To write the null and alternative hypotheses, translate the claim made about the population parameter from a verbal statement to a mathematical statement. Then, write its complement. 
- For instance, if the claim value is $k$ and the population parameter is $\mu$, then some possible pairs of null and alternative hypotheses are:
 - $\left\{ \begin{array}{}
       H_{0}:\mu \le k \\
       H_{a}:\mu \gt k 
     \end{array}\right.$
     
 - $\left\{ \begin{array}{}
       H_{0}:\mu \ge k \\
       H_{a}:\mu \lt k 
     \end{array}\right.$

 - $\left\{ \begin{array}{}
       H_{0}:\mu = k \\
       H_{a}:\mu \ne k 
     \end{array}\right.$

- Regardless of which of the three pairs of hypotheses we use, we always assume $\mu = k$ and examine the sampling distribution on the basis of this assumption. 
- Within this sampling distribution, we will determine whether or not a sample statistic is unusual.

### Stating a Hypothesis

- The table shows the relationship between possible verbal statements about the parameter $\mu$ and the corresponding null and alternative hypotheses.
- Similar statements can be made to test other population parameters, such as $p$, $\sigma$, or $\sigma^{2}$.

![](./image/7_1_hypothesis_table.png)

### Stating the Null and Alternative Hypotheses [example 1]

Write the claim as a mathematical statement. State the null and alternative hypotheses, and identify which represents the claim.

- Q1: A school publicizes that the proportion of its students who are involved in at least one extracurricular activity is $61\%$.
- Q2: A car dealership announces that the mean time for an oil change is less than $15$ minutes.
- Q3: A company advertises that the mean life of its furnaces is more than $18$ years.

### Stating the Null and Alternative Hypotheses [solution]

#### Q1: A school publicizes that the proportion of its students who are involved in at least one extracurricular activity is $61\%$.
- The claim can be written as $p = 0.61$. 
- Its complement is $p \ne 0.61$. 
- Because $p = 0.61$ contains the statement of equality, it becomes the **null hypothesis**. 
- In this case, the **null hypothesis** represents the **claim**.
- $\left\{ \begin{array}{}
       H_{0}:p = 0.61\\
       H_{a}:p \ne 0.61 
     \end{array}\right.$
![](./image/7_1_ex_1_q_1_hypothesis.png)

#### Q2: A car dealership announces that the mean time for an oil change is less than $15$ minutes.
- The claim can be written as $\mu \lt 15$.
- Its complement is $\mu \ge 15$. 
- Because $\mu \ge 15$ contains the statement of equality, it becomes the **null hypothesis**. 
- In this case, the **alternative hypothesis** represents the **claim**.
- $\left\{ \begin{array}{}
       H_{0}:\mu \ge 15\\
       H_{a}:\mu \lt 15 
     \end{array}\right.$
![](./image/7_1_ex_1_q_2_hypothesis.png)
 
#### Q3: A company advertises that the mean life of its furnaces is more than $18$ years.
- The claim can be written as $\mu \gt 18$.
- Its complement is $\mu \le 18$. 
- Because $\mu \le 18$ contains the statement of equality, it becomes the null hypothesis. 
- In this case, the **alternative hypothesis** represents the **claim**.
- $\left\{ \begin{array}{}
       H_{0}:\mu \le 18\\
       H_{a}:\mu \gt 18 
     \end{array}\right.$
![](./image/7_1_ex_1_q_3_hypothesis.png)

### Types of Errors and Level of Significance

- No matter which hypothesis represents the claim, we always begin a hypothesis test by assuming that the equality condition in the null hypothesis is true. 
- When we perform a hypothesis test, we make one of two decisions:
  1. reject the null hypothesis or
  2. accept (fail to reject) the null hypothesis.
- Because your decision is based on a sample rather than the entire population, there is always the possibility you will make the wrong decision.
- The only way to be absolutely certain of whether $H_{0}$ is true or false is to test the entire population. 
- Because your decision — to reject $H_{0}$ or to accept (fail to reject) $H_{0}$ — is based on a sample, you must accept the fact that your decision might be incorrect.

### Types of Errors: Definition


- A **type I error** occurs if the null hypothesis is rejected when it is true. **Failed to accept**.
- A **type II error** occurs if the null hypothesis is not rejected when it is false. **Failed to reject**.

![](./image/7_1_type_of_errors_table.png)

### Types of Errors 

Hypothesis testing is sometimes compared to the legal system used in the United States. Under this system, these steps are used:

1. A carefully worded accusation is written.
2. The defendant is assumed innocent ($H_{0}$) until proven guilty. The burden of proof lies with the prosecution. If the evidence is not strong enough, then there is no conviction. A “not guilty” verdict does not prove that a defendant is innocent.
3. The evidence needs to be conclusive beyond a reasonable doubt. The system assumes that more harm is done by convicting the innocent (**type I error**) than by not convicting the guilty (**type II error**).

![](./image/7_1_types_of_errors_legal.png)

### Identifying Type I and Type II Errors [example 2]

- The USDA limit for salmonella contamination for chicken is $20\%$. 
- A meat inspector reports that the chicken produced by a company exceeds the USDA limit. 
- You perform a hypothesis test to determine whether the meat inspector’s claim is true. 
- When will a type I or type II error occur? 
- Which error is more serious?

### Identifying Type I and Type II Errors [solution]

- Let $p$ represent the proportion of the chicken that is contaminated. 
- The meat inspector’s claim is “more than $20\%$ is contaminated.” 
- We can write the null and alternative hypotheses as:

$\left\{ \begin{array}{}
       H_{0}:p \le 0.2\\
       H_{a}:p \gt 0.2 
     \end{array}\right.$

- In this case, the **alternative hypothesis** represents the **claim**.

![](./image/7_1_ex_2_contaminated_chicken.png)

- A **type I error** will occur when the actual proportion of contaminated chicken is less than or equal to $0.2$, but you reject $H_{0}$ . 
- A **type II error** will occur when the actual proportion of contaminated chicken is greater than $0.2$, but you do not reject $H_{0}$.
- With a type I error, you might create a health scare and hurt the sales of chicken producers who were actually meeting the USDA limits.
- With a type II error, you could be allowing chicken that exceeded the USDA contamination limit to be sold to consumers. 
- A type II error is more serious because it could result in sickness or even death.

### Level of Significance

- We will reject the null hypothesis when the sample statistic from the sampling distribution is unusual. 
- We have already identified unusual events to be those that occur with a probability of $0.05$ or less. 
- When statistical tests are used, an unusual event is sometimes required to have a probability of $0.10$ or less, $0.05$ or less, or $0.01$ or less. 
- Because there is variation from sample to sample, there is always a possibility that you will reject a null hypothesis when it is actually true. 
- In other words, although the null hypothesis is true, your sample statistic is determined to be an unusual event in the sampling distribution. 
- We can decrease the probability of this happening by lowering the **level of significance**.

### Level of Significance: Definition

- In a hypothesis test, the level of significance is your maximum allowable probability of making a **type I error**. It is denoted by $\alpha$, the lowercase Greek letter **alpha**.
- The probability of a **type II error** is denoted by $\beta$, the lowercase Greek letter **beta**.

- By setting the level of significance at a small value, we are saying that you want the probability of rejecting a true null hypothesis to be small. 
- Three commonly used levels of significance are $\alpha = 0.10$, $\alpha = 0.05$, and $\alpha = 0.01$.
- When we decrease $\alpha$, we are likely to be increasing $\beta$.

### Statistical Tests and $P$-Values

- After stating the null and alternative hypotheses and specifying the level of significance, the next step in a hypothesis test is to obtain a random sample from the population and calculate the sample statistic ( such as $\bar{x}$, $\hat{p}$, or $s^{2}$) corresponding to the parameter in the null hypothesis (such as $\mu$, $p$, or $\sigma^{2}$) .
- This sample statistic is called the **test statistic**. 
- With the assumption that the null hypothesis is true, the test statistic is then converted to a **standardized test statistic**, such as $z$, $t$, or $x^{2}$ . 
- The standardized test statistic is used in making the decision about the null hypothesis.

### Statistical Tests and $P$-Values

- If the null hypothesis is true, then a **$P$-value** (or **probability value**) of a hypothesis test is the probability of obtaining a sample statistic with a value as extreme or more extreme than the one determined from the sample data.
- The $P$-value of a hypothesis test depends on the nature of the test. 
- There are three types of hypothesis tests — **left-tailed**, **right-tailed**, and **two-tailed**.
- The type of test depends on the location of the region of the sampling distribution that favors a rejection of $H_{0}$. 
- This region is indicated by the alternative hypothesis.

### Statistical Tests and $P$-Values

1. If the **alternative hypothesis** $H_{a}$ contains the **less-than** inequality symbol ($\lt$), then the hypothesis test is a **left-tailed test**.

![](./image/7_1_left_tailed_test_plot.png)

2. If the **alternative hypothesis** $H_{a}$ contains the **greater-than** inequality symbol ($\gt$), then the hypothesis test is a **right-tailed test**.

![](./image/7_1_right_tailed_test_plot.png)

3. If the **alternative hypothesis** $H_{a}$ contains the **not-equal-to** symbol ($\ne$), then the hypothesis test is a **two-tailed test**. In a two-tailed test, each tail has an area of $\frac{1}{2}P$.

![](./image/7-1_two_tailed_test_plot.png)

#### How to interpret

- The smaller the $P$-value of the test, the more evidence there is to reject the null hypothesis. 
- A very small $P$-value indicates an unusual event. 
- However, that even a very low $P$-value does not constitute proof that the null hypothesis is false, only that it is probably false.

### Identifying the Nature of a Hypothesis Test [example 3]

For each claim, state $H_{0}$ and $H_{a}$ in words and in symbols. Then determine whether the hypothesis test is a left-tailed test, right-tailed test, or two-tailed test. Sketch a normal sampling distribution and shade the area for the $P$-value.

- Q1: A school publicizes that the proportion of its students who are involved in at least one extracurricular activity is $61\%$.
- Q2: A car dealership announces that the mean time for an oil change is less than $15$ minutes.
- Q3: A company advertises that the mean life of its furnaces is more than $18$ years.

### Identifying the Nature of a Hypothesis Test [solution]

#### Q1: A school publicizes that the proportion of its students who are involved in at least one extracurricular activity is $61\%$.

- The proportion of students who are involved in at least one extracurricular activity is $61\%$.
  - $H_{0} = 0.61$

- The proportion of students who are involved in at least one extracurricular activity is not $61\%$.
  - $H_{a} \ne 0.61$
  
- Because $H_{a}$ contains the $\ne$ symbol, the test is a two-tailed hypothesis test.
- The figure below shows the normal sampling distribution with a shaded area for the $P$-value.

![](./image/7_1_ex_3_q_1_p_value_plut.png)

#### Q2: A car dealership announces that the mean time for an oil change is less than $15$ minutes.

- The mean time for an oil change is greater than or equal to $15$ minutes.
  - $H_{0}: \mu \ge 15$

- The mean time for an oil change is less than $15$ minutes.
  - $H_{a}: \mu \lt 15$
  
- Because $H_{a}$ contains the $\lt$ symbol, the test is a left-tailed hypothesis test.
- The figure below shows the normal sampling distribution with a shaded area for the $P$-value.

![](./image/7_1_ex_3_q_2_p_value_plot.png)

#### Q3: A company advertises that the mean life of its furnaces is more than $18$ years.

- The mean life of the furnaces is less than or equal to $18$ years.
  - $H_{0}: \mu \le 18$

- The mean life of the furnaces is more than $18$ years.
  - $H_{a}: \mu \gt 18$

- Because $H_{a}$ contains the $\gt$ symbol, the test is a right-tailed hypothesis test.
- The figure below shows the normal sampling distribution with a shaded area for the $P$-value.

![](./image/7_1_ex_3_q_3_p_value_plot.png)

### Making a Decision and Interpreting the Decision

- To conclude a hypothesis test, we make a decision and interpret that decision.
- For any hypothesis test, there are two possible outcomes:  
  1. reject the null hypothesis or 
  2. accept (fail to reject) the null hypothesis.
  
- To use a $P$-value to make a decision in a hypothesis test, compare the $P$-value with $\alpha$.
  - If $P \le \alpha$, then reject $H_{0}$.
  - If $P \gt \alpha$, then accept (fail to reject) $H_{0}$.
  
- Accepting (failing to reject) the null hypothesis does not mean that we have accepted the null hypothesis as true. It simply means that there is not enough evidence to reject the null hypothesis. 
- To support a claim, state it so that it becomes the alternative hypothesis. 
- To reject a claim, state it so that it becomes the null hypothesis. 

![](./image/7_1_h0_table.png)

### Making a Decision and Interpreting the Decision [example 4]

You perform a hypothesis test for each claim. How should you interpret your decision if you reject $H_{0}$? If you fail to reject $H_{0}$?

- Q1: $H_{0}$ (Claim): A school publicizes that the proportion of its students who are involved in at least one extracurricular activity is $61\%$.
- Q2: $H_{a}$ (Claim): A car dealership announces that the mean time for an oil change is less than $15$ minutes.

### Making a Decision and Interpreting the Decision [solution]

#### Q1: $H_{0}$ (Claim): A school publicizes that the proportion of its students who are involved in at least one extracurricular activity is $61\%$.

- The claim is represented by $H_{0}$. 
- If you reject $H_{0}$, then you should conclude "there is enough evidence to reject the school's claim that the proportion of students who are involved in at least one extracurricular activity is $61\%$."
- If you fail to reject $H_{0}$, then you should conclude “there is not enough evidence to reject the school’s claim that the proportion of students who are involved in at least one extracurricular activity is $61\%$.”

#### Q2: $H_{a}$ (Claim): A car dealership announces that the mean time for an oil change is less than $15$ minutes.

- The claim is represented by $H_{a}$, so the null hypothesis is “the mean time for an oil change is greater than or equal to 15 minutes.” 
- If we reject $H_{0}$, then we should conclude “there is enough evidence to support the dealership’s claim that the mean time for an oil change is less than $15$ minutes.” 
- If we fail to reject $H_{0}$, then we should conclude “there is not enough evidence to support the dealership’s claim that the mean time for an oil change is less than $15$ minutes.”

### Steps for Hypothesis Testing

1. State the claim mathematically and verbally. Identify the null and alternative hypotheses.
  - $H_0 = ?$, $H_a = ?$
  
1. Specify the level of significance.
  - $\alpha = ?$
  
3. Determine the standardizedsampling distribution and sketch its graph.
![](./image/7_1_standardized_sampling_distribution.png)

4. Calculate the test statistic and its corresponding standardized test statistic. Add it to your sketch.
![](./image/7_1_standardized_test_statistics.png)

5. Find the $P$-value.
6. Use this decision rule.
![](./image/7_1_decision_rule.png)

7. Write a statement to interpret the decision in the context of the original claim.

### Strategies for Hypothesis Testing

- The strategy that we will use in hypothesis testing should depend on whether we are trying to support or reject a claim. 
- Remember that we cannot use a hypothesis test to support our claim when our claim is the null hypothesis.
- So, as a researcher, to perform a hypothesis test where the possible outcome will support a claim, word the claim so it is the alternative hypothesis. 
- To perform a hypothesis test where the possible outcome will reject a claim, word it so the claim is the null hypothesis.

### Writing the Hypotheses [example 5]

A medical research team is investigating the benefits of a new surgical treatment. One of the claims is that the mean recovery time for patients after the new treatment is less than $96$ hours.

- Q1: How would you write the null and alternative hypotheses when you are on the research team and want to support the claim?
- Q2: How would you write the null and alternative hypotheses when you are on an opposing team and want to reject the claim?

### Writing the Hypotheses [solution]

#### Q1

To answer the question, first think about the context of the claim. Because you want to support this claim, make the alternative hypothesis state that the mean recovery time for patients is less than $96$ hours.
So, $H_a : \mu < 96$ hours. Its complement, $H_0 : \mu \ge 96$ hours, would be the
null hypothesis.
- $H_0 : \mu \ge 96$
- $H_a : \mu < 96$ (Claim)

#### Q2

First think about the context of the claim. As an opposing researcher, you do not want the recovery time to be less than $96$ hours. Because you want to reject this claim, make it the null hypothesis. So, $H_0 : \mu \le 96$ hours.
Its complement, $H_a : \mu > 96$ hours, would be the alternative hypothesis.
- $H_0 : \mu \le 96$ (Claim)
- $H_a : \mu > 96$


## 7.2 <br/>Hypothesis Testing for the Mean ($\sigma$ Known)

### Using P@Values to Make Decisions

To use a $P$-value to make a decision in a hypothesis test, compare the $P$-value with $\alpha$.

1. If $P \le \alpha$, then reject $H_0$.
2. If $P > \alpha$, then fail to reject $H_0$.

### Interpreting a $P$-Value [example 1]

The $P$-value for a hypothesis test is $P = 0.0237$. What is your decision when the level of significance is 

- Q1: $\alpha = 0.05$ and 
- Q2: $\alpha = 0.01$


### Interpreting a $P$-Value [solution]

1. Because $0.0237 < 0.05$, we reject the null hypothesis.
2. Because $0.0237 > 0.01$, we fail to reject the null hypothesis.

- The lower the $P$-value, the more evidence there is in favor ofrejecting $H_0$. 
- The $P$-value givesyou the lowest level of significance for which the sample statistic allows you to reject the null hypothesis.

### Finding the $P$-value for a Hypothesis Testing

After determining the hypothesis test's standardized test statistic and the standardized test statistic's corresponding area, do one of the following to find the $P$-value.

1. For a left-tailed test, $P =$ (Area in left tail).
2. For a right-tailed test, $P =$ (Area in right tail).
3. For a two-tailed test, $P = 2$ (Area in tail of standardized test statistic).

### Finding a $P$-Value for a Left-Tailed Test [example 2]

Find the $P$-value for a left-tailed hypothesis test with a standardized test statistic of $z = -2.23$. Decide whether to reject $H_0$ when the level of significance is $\alpha = 0.01$.

### Finding a $P$-Value for a Left-Tailed Test [solution]

![](./image/7_2_ex_2_left_tailed_test.png)

The figure shows the standard normal curve with a shaded area tothe left of $z = -2.23$. For a left-tailed test,

$P =$ (Area in left tail).

- The area corresponding to $z = -2.23$ is $0.0129$, which is the area in the left tail. 
- So, the $P$-value for a left-tailed hypothesis test with a standardized test statistic of $z = -2.23$ is $P = 0.0129$.

#### Interpretation  
Because the $P$-value of $0.0129$ is greater than $0.01$,we fail to reject $H_0$.

In [2]:
from scipy import stats

z = -2.23
P = stats.norm.cdf(z)
print(f'P: {P}')

P: 0.012873721438602014


### Finding a P-Value for a Two-Tailed Test [example 3]

Find the $P$-value for a two-tailed hypothesis test with a standardized test statistic of $z = 2.14$. Decide whether to reject $H_0$ when the level of significance is $\alpha = 0.05$.

### Finding a P-Value for a Two-Tailed Test [solution]

![](./image/7_2_ex_3_two_tailed_test.png)

The figure shows the standard normal curve with shaded areas to the left of $z = -2.14$ and to the right of $z = 2.14$. For a two-tailed test, 

$P = 2$ (Area in tail of standardized test statistic).

- The area corresponding to $z = 2.14$ is $0.9838$. 
- The area in the right tail is $1 - 0.9838 = 0.0162$. 
- So, the $P$-value for a two-tailed hypothesis test with a standardized test statistic of $z = 2.14$ is $P = 2 \times (0.0162) = 0.0324$.

#### Interpretation  
Because the $P$-value of $0.0324$ is less than $0.05$, you reject $H_0$ .

In [3]:
from scipy import stats

z = - 2.14
P = 2 * stats.norm.cdf(z)
print(f'P: {P}')

P: 0.03235476674433216


### Using $P$-values for a $z$-Test

- The $z$-test for a mean $\mu$ is a statistical test for a population mean. 
- The test statistic is the sample mean $\bar{x}$. 
- The standardized test statistic is: 

$z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}}$

- when these conditions are met.
  1. The sample is random.
  2. At least one of the following is true: The population is normally distributed or $n \ge 30$.

- Recall that $\frac{\sigma}{\sqrt{n}}$ is the standard error of the mean, $\sigma_\bar{x}$ .

### Guidelines: Using $P$-Values for a $z$-Test for a Mean $\mu$ ($\sigma$ Known)

1. Verify that $\sigma$ is known, the sample is random, and either the population is normally distributed or $n \ge 30$.

2. State the claim mathematically and verbally. Identify the null and alternative hypotheses.
  - State $H_0$ and $H_a$
  
3. Specify the level of significance.
  - Identify $\alpha$
  
4. Find the standardized test statistic.
  - $z = \frac{\bar{x}-\mu}{\sigma/\sqrt{n}}$
  
5. Find the area that corresponds to $z$.
  - Use ```SciPy.stats```
  
6. Find the $P$-value.
  - For a left-tailed test, $P$ = (Area in left tail).
  - For a right-tailed test, $P$ = (Area in right tail).
  - For a two-tailed test, $P$ = 2 (Area in tail of standardized test statistic).
  
7. Make a decision to reject or fail to reject the null hypothesis.
  - If $P \le \alpha$, then reject $H_0$. Otherwise, fail to reject $H_0$.

8. Interpret the decision in the context of the original claim.

### Hypothesis Testing Using a $P$-Value [example 4]

- In auto racing, a pit stop is where a racing vehicle stops for new tires, fuel, repairs, and other mechanical adjustments. 
- The efficiency of a pit crew that makes these adjustments can affect the outcome of a race. 
- A pit crew claims that its mean pit stop time (for $4$ new tires and fuel) is less than $13$ seconds.
- A random sample of $32$ pit stop times has a sample mean of $12.9$ seconds.
- Assume the population standard deviation is $0.19$ second. 
- Is there enough evidence to support the claim at $\alpha = 0.01$? Use a $P$-value.

### Hypothesis Testing Using a $P$-Value [solution]

- Because $\sigma$ is known ($\sigma = 0.19$), the sample is random, and $n = 32 \ge 30$, we can use the $z$-test. 
- The claim is "the mean pit stop time is less than $13$ seconds." So, the null and alternative hypotheses are:
  - $H_0 : \mu \ge 13$ seconds 
  - $H_a : \mu < 13$ seconds. (Claim)
- The level of significance is $\alpha = 0.01$. The standardized test statistic is:
  - $z = \frac{\bar{x}-\mu}{\sigma/\sqrt{n}} = \frac{12.9 - 13}{0.19/\sqrt{32}} \approx -2.98$
- The area corresponding to $z = -2.98$ is $0.0014$ (we can use Scipy).
- Because this test is a left-tailed test, the $P$-value is equal to the area to the left of $z = -2.98$, as shown in the figure. So, $P = 0.0014$. 
- Because the $P$-value is less than $\alpha = 0.01$, we reject the null hypothesis.

![](./image/7_2_ex_4_lest_tailed_test.png)

#### Interpretation  
There is enough evidence at the $1\%$ level of significance to support the claim that the mean pit stop time is less than $13$ seconds.

In [4]:
import math
from scipy import stats

sigma = 0.19
n = 32
mu = 13
x_bar = 12.9
alpha = 0.01

z = (x_bar - mu) / (sigma / math.sqrt(n))
print(f'z: {z}')

P = stats.norm.cdf(z)
print(f'P: {P}')

z: -2.977291710259137
P: 0.0014540358484991462


### Hypothesis Testing Using a P-Value [example 5]

- According to a study, the mean cost of bariatric (weight loss) surgery is $\$21,500$. 
- You think this information is incorrect. 
- You randomly select $25$ bariatric surgery patients and find that the mean cost for their surgeries is $\$20,695$. 
- From past studies, the population standard deviation is known to be $\$2,250$ and the population is normally distributed. 
- Is there enough evidence to support your claim at $\alpha = 0.05$? Use a $P$-value.

### Hypothesis Testing Using a P-Value [solution]

- Because $\sigma$ is known ($\sigma = \$2,250$), the sample is random, and the population is normally distributed, you can use the $z$-test. 
- The claim is "the mean is different from $\$21,500$." So, the null and alternative hypotheses are:
  - $H_0 : \mu = \$21,500$        
  - $H_a : \mu \ne \$21,500$. (Claim)
- The level of significance is $\alpha = 0.05$. The standardized test statistic is:
  - $z = \frac{\bar{x}-\mu}{\sigma/\sqrt{n}} = \frac{20,695 - 21,500}{2,250/\sqrt{25}} \approx -1.79$
- The area corresponding to $z = -1.79$ is $0.0367$ (we can use Scipy). 
- Because the test is a two-tailed test, the $P$-value is equal to twice the area to the left of $z = -1.79$, as shown in the figure. So:
  - $P = 2 \times (0.0367) = 0.0734$.
- Because the $P$-value is greater than $\alpha = 0.05$, you fail to reject the null hypothesis.

![](./image/7_2_ex_5_two_tailed_test.png)

#### Interpretation  
There is not enough evidence at the $5\%$ level of significance to support the claim that the mean cost of bariatric surgery is different from $\$21,500$.

In [5]:
import math
from scipy import stats

sigma = 2250
n = 25
mu = 21500
x_bar = 20695
alpha = 0.05

z = (x_bar - mu) / (sigma / math.sqrt(n))
print(f'z: {z}')

P = 2 * stats.norm.cdf(z)
print(f'P: {P}')

z: -1.788888888888889
P: 0.07363271157315165


### Rejection Regions and Critical Values

- Another method to decide whether to reject the null hypothesis is to determine whether the standardized test statistic falls within a range of values called the **rejection region** of the sampling distribution.
- A rejection region (or critical region) of the sampling distribution is therange of values for which the null hypothesis is not probable. 
- If a standardizedtest statistic falls in this region, then the null hypothesis is rejected. 
- A critical value $z_0$ separates the rejection region from the nonrejection region.

### Finding Critical Values in the Standard Normal Distribution [guidelines]

1. Specify the level of significance $\alpha$.
2. Determine whether the test is left-tailed, right-tailed, or two-tailed.
3. Find the critical value(s) $z_0$ . When the hypothesis test is:
  - *left-tailed*, find the $z$-score that corresponds to an area of $\alpha$.
  - *right-tailed*, find the $z$-score that corresponds to an area of $1 - \alpha$.
  - *two-tailed*, find the $z$-scores that correspond to $\frac{1}{2}\alpha$ and $1 - \frac{1}{2}\alpha$.
4. Sketch the standard normal distribution. Draw a vertical line at each critical
value and shade the rejection region(s). (See the figures at the left.)

Note that a standardized test statistic that falls in a rejection region is considered an **unusual event**.

| ![](./image/7_2_left_tailed_test.png) | ![](./image/7_2_right_tailed_test.png) | 
|:-:|:-:|

![](./image/7_2_two_tailed_test.png)

### Finding a Critical Value for a Left-Tailed Test [example 7]

Find the critical value and rejection region for a left-tailed test with $\alpha = 0.014$.

### Finding a Critical Value for a Left-Tailed Test [solution]

- The figure shows the standard normal curve with a shaded area of $0.01$ in the left tail. 
- The $z$-score that is closest to an area of $0.01$ is $-2.33$ (we can use Scipy). 
- So, the critical value is
  - $z_0 = -2.33$
- The rejection region is to the left of this critical value.

![](./image/7_2_ex_7_left_tailed_test.png)

### Finding Critical Values for a Two-Tailed Test [example 8]

Find the critical values and rejection regions for a two-tailed test with $\alpha = 0.05$.

### Finding Critical Values for a Two-Tailed Test [solution]

- The figure shows the standard normal curve with shaded areas of $\frac{1}{2} \alpha = 0.025$ in each tail. 
- The area to the left of $-z_0$ is $\frac{1}{2} \alpha = 0.025$, and the area to the left of $z_0$ is $1 - \frac{1}{2} \alpha = 0.975$. 
- The $z$-scores that correspond to the areas $0.025$ and $0.975$ are $-1.96$ and $1.96$, respectively. 
- So, the critical values are $-z_0 = -1.96$ and $z_0 = 1.96$. 
- The rejection regions are to the left of $-1.96$ and to the right of $1.96$.

![](./image/7_2_ex_8_two_tailed_test.png)

### Using Rejection Regions for a $z$-Test

- To use a rejection region to conduct a hypothesis test, calculate the standardized test statistic $z$. 
- If the standardized test statistic $z$:
  1. is in the rejection region, then reject $H_0$.
  2. is not in the rejection region, then fail to reject $H_0$.
  
![](./image/72_decision_rule_based_on_rejection_region.png)
  
Remember, failing to reject the null hypothesis does not mean that you have accepted the null hypothesis as true. It simply means that there is not enough evidence to reject the null hypothesis.

### Using Rejection Regions for a $z$-Test for a Mean $\mu$ ($\sigma$ Known) [guidelines]

1. Verify that $\sigma$ is known, the sample is random, and either the population is normally distributed or $n \ge 30$.

2. State the claim mathematically and verbally. Identify the null and alternative hypotheses.
  - State $H_0$ and $H_a$
  
3. Specify the level of significance.
  - Identify $\alpha$

4. Determine the critical value(s) (we can use Scipy)

5. Determine the rejection region(s).

6. Find the standardized test statistic and sketch the sampling distribution.
  - $z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}}$

7. Make a decision to reject or fail to reject the null hypothesis.
  - If $z$ is in the rejection region, then reject $H_0$. Otherwise, fail to reject $H_0$.
  
8. Interpret the decision in the context of the original claim.

### Hypothesis Testing Using a Rejection Region [example 9]

- Employees at a construction and mining company claim that the mean salary of the company’s mechanical engineers is less than that of one of its competitors,which is $\$68,000$. 
- A random sample of $20$ of the company’s mechanical engineers has a mean salary of $\$66,900$. 
- Assume the population standard deviation is $\$5,500$ and the population is normally distributed. 
- At $\alpha = 0.05$, test the employees’ claim.

### Hypothesis Testing Using a Rejection Region [solution]

- Because $\sigma$ is known $\sigma = \$5,500$, the sample is random, and the population is normally distributed, you can use the $z$-test. 
- The claim is "the mean salary is less than $\$68,000$." 
- So, the null and alternative hypotheses can be written as:
  - $H_0 : \mu \ge \$68,000$       
  - $H_a : \mu < \$68,000$. (Claim)
- Because the test is a left-tailed test and the level of significance is $\alpha = 0.05$, the critical value is $z_0 = -1.645$ and the rejection region is $z < -1.645$. 
- The standardized test statistic is:
  - $z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}} = \frac{66900 - 68000}{5500 / \sqrt{20}} \approx -0.89$
  
![](./image/7_2_ex_9_left_tailed_test.png)

- The figure shows the location of the rejection region and the standardized test statistic $z$. 
- Because $z$ is not in the $\alpha = 0.05$ rejection region, we fail to reject the null hypothesis.

#### Interpretation  
- There is not enough evidence at the $5\%$ level of significance to support the employees’ claim that the mean salary is less than $\$68,000$.
- Be sure you understand the decision made in this example. Even though our sample has a mean of $\$66,900$, we cannot (at a $5\%$ level of significance) support the claim that the mean of all the mechanical engineers’ salaries is less than $\$68,000$. 
- The difference between your test statistic ($\bar{x} = \$66,900$) and the hypothesized mean ($\mu = \$68,000$) is probably due to sampling error.

#### Hypothesis Testing Using Rejection Regions [example 10]

- A researcher claims that the mean annual cost of raising a child (age $2$ and under) by husband-wife families in the U.S. is $\$13,960$. 
- In a random sample of husband-wife families in the U.S., the mean annual cost of raising a child (age $2$ and under) is $\$13,725$. 
- The sample consists of $500$ children. 
- Assume the population standard deviation is $\$2,345$. 
- At $\alpha = 0.10$, is there enough evidence to reject the claim?

#### Hypothesis Testing Using Rejection Regions [solution]

- Because $\sigma$ is known ($\sigma = \$2,345$), the sample is random, and $n = 500 \ge 30$, we can use the $z$-test. 
- The claim is "the mean annual cost is $\$13,960$." 
- So, the null and alternative hypotheses are:
  - $H_0 : \mu = \$13,960$  (Claim)
  - $H_a : \mu \ne \$13,960$.