In [1]:
import numpy as np
import os
import sys

sys.path.append(os.path.join(sys.path[0], os.path.pardir))

from utils import NormalDistribution

### **10.1.** TESTING A HYPOTHESIS ABOUT SAT SCORES

#### **Hypothesized Sampling Distribution:**
- If the null hypothesis is true, then the distribution of sample means will be centered about the population mean.
- Whenever the sampling distribution mean is equal to the population mean, then it is referred to as the <i>hypothesized</i> sampling distribution.

#### **Common Outcomes:**
- Definition:
    + An observed sample mean qualifies as a common outcome if the difference between its value and that of the hypothesized population mean is small enough to be viewed as a probable outcome under the null hypothesis.

#### **Rare Outcomes:**
- Definition:
    + An observed sample mean qualifies as rare outcome if the difference between its value and that of the hypothesized population mean is too large to be reasonably viewed as a probable outcome under the null hypothesis.

#### **Boundaries for Common and Rare Outcomes:**

### **10.2.** z TEST FOR A POPULATION MEAN

- Definition:
    + Sampling Distribution of z: the distribution of z values that would be obtained if a value of z were calculated for each sample mean for all possible random samples of a given size from some population.
- The Sampling Distribution of z is the basis for a standardized, hypothesized test to be conducted.

#### **Reminder: Conver a Raw Score to z**
- Following the convention of calculating the standard z scores, each random sample mean will be calculated as: <br><br>
<center>$\Large standard\,\, score = \frac{raw\,\, score\, -\, mean}{standard\,\, deviation}$</center> <br>
- The boundaries for regarding one observed mean as a common or a rare outcome, based on the Sampling Distribution of z: if a z score lies
    + In the interval of [-1.96, 1.96]: the Null Hypothesis is retained
    + In the tail of either side, (-$\infty$, -1.96) or (1.96, $\infty$): the Null Hypothesis is rejected. <br>
![image.png](attachment:b0a78e5b-dfd5-4e8a-8530-c844c832bd9f.png)

#### **Converting a Sample Mean to z:**
<center><b>z RATIO FOR A POPULATION MEAN</b></center>
<center>$\Large z = \frac{\overline{X}\, -\, \mu_{hyp}}{\sigma_{\overline{X}}}$</center>

#### **Assumptions of z Test:**
- Definition:
    + z Test for a Population Mean: A hypothesis test that evaluates how far the observed sample mean deviates, in standard error units, from the hypothesized population mean.
- The z Test is accurate only if:
    + (1): the population is normally distributed or the sample size is large enough to satisfy the requirements of the central limit theorem
    + (2): the population standard deviation is known.

#### **Progress Check 10.1:** Calculate the value of the z test for each of the following situations:

In [16]:
# Solution:
# a):
NormalDistribution.mean_zscore(560, 30, 566, 36)

1.2

In [21]:
# b):
NormalDistribution.mean_zscore(24, 4, 25, 64)

2.0

In [18]:
# c):
NormalDistribution.mean_zscore(75, 14, 82, 49)

3.5

In [19]:
# d):
NormalDistribution.mean_zscore(146, 15, 136, 25)

-3.33

### **10.3.** STEP-BY-STEP PROCEDURE

### **10.4.** STATEMENT OF THE RESEARCH PROBLEM

![image.png](attachment:361cc35b-dfad-485a-b3dd-52bd095ebd0b.png)

### **10.5.** NULL HYPOTHESIS $(H_0)$

- Notation: <br>
<center>$\Large H_0 : \mu_{\overline{X}} = \mu$</center>

- Definition:
    + Null Hypothesis ($H_0$): A statistical hypothesis that usually asserts that nothing special is happening with respect to some characteristic of the underlying population.

- Because the hypothesis testing procedure requires that the hypothesized sampling distribution of the mean be centered about a single number, the Null Hypothesis equals a single number.
- Furthermore, the Null Hypothesis always makes a precise statement about a characteristic of a population, never about a sample.
- The goal of a hypothesis test is to determine whether a particular outcome, such as an observed sample mean, could have reasonably originated from a population with a hypothesized characteristic.

#### **Finding the Single Number for $H_0$:**
- The single number for $H_0$ varies from problem to problem, however, they are never arbitrarily obtained, but in fact, calculated using some other techniques.

### **10.6.** ALTERNATIVE HYPOTHESIS $(H_1)$

- Notation: <br>
<center>$\Large H_1 : \mu_{\overline{X}} \neq \mu$</center>

- Definition:
    + Alternative Hypothesis ($H_1$): The opposite of Null Hypothesis.

- $H_1$ is usually identified with the <b>research hypothesis</b>, <i>the informal hypothesis or hunch that, by implying the presence of something special in the underlying population, serves as inspiration for the entire investigation</i>.

#### **Progress Check 10.2:** Indicate what’s wrong with each of the following statistical hypotheses:

a) The single numbers (which represent the Null Hypothesis or the Population Mean) in $H_0$ and $H_1$ are not the same, therefore, cannot be considered in a single hypothesis test.

b) The notation for the mean is incorrect, the hypothesis test is meant for a population, not a particular sample.

#### **Progress Check 10.3:** First using words, then symbols, identify the null hypothesis for each of the following situations. (Don’t concern yourself about the precise form of the alternative hypothesis at this point.)

(a) A school administrator wishes to determine whether sixth-grade boys in her school district differ, on average, from the national norms of 10.2 pushups for sixth-grade boys. <br>
Null Hypothesis: The average numebr of pushups that the sixth-grade boys in the administrator's school district can do is equal to that of the nation-wide sixth-grade boys. <br>
Alternative Hypothesis: The is a real difference between the numbers of pushups of the local boys and the national boys can do. <br>
$H_0$: $\mu = 10.2$ <br>
$H_1$: $\mu \neq 10.2$.

(b) A consumer group investigates whether, on average, the true weights of packages of ground beef sold by a large supermarket chain differ from the specified 16 ounces. <br>
Null Hypothesis: The average weight of packages of ground beef sold by the supermarket chain is centered about 16 ounces. <br>
Alternative Hypothesis: The average weight of packages of ground beef sold differs greatly from 16 ounces. <br>
$H_0$: $\mu = 16$ <br>
$H_1$: $\mu \neq 16$.

(c) A marriage counselor wishes to determine whether, during a standard conflict-resolution session, his clients differ, on average, from the 11 verbal interruptions reported for “well adjusted couples". <br>
Null Hypothesis: The frequency of verbal interruptions seen from the counselor's clients centers about 11 interruptions, which is equal to the reported number of interruptions for "well adjusted couples". <br>
Alternative Hypothesis: The number of verbal interruptions seen from the clients differ significantly from the average reported number obtained from "well adjusted couples". <br>
$H_0$: $\mu = 11$ <br>
$H_1$: $\mu \neq 11$

### **10.7.** DECISION RULE

- Definition:
    + Decision Rule: Specifies precisely when $H_0$ should be rejected (because the observed z qualifies as a rare outcome).

#### **Critical z Scores:**
- Definition:
    + Critical z Score: A z score that separates common from rare outcomes and hence dictates if $H_0$ should be retained or rejected.
- The range [-1.96, 1.96] defines the boundaries for the middle 0.95 proportion of the total area (1.00) under the hypothesized sampling distribution.

#### **Level of Significance ($\alpha$)**
- Definition:
    + Level of Significance ($\alpha$): The degree of rarity required of an observed outcome in order to reject the Null Hypothesis ($H_0$).
- The Level of Significance possesses an area (under the hypothesized sampling distribution) of at most 0.05, which indicates the probability of the event's occurrence is equal to or less than 5%.

### **10.8** CALCULATIONS

### **10.9.** DECISION

#### **Retain or Reject $H_0$:**
- Depends on the actual z score of the observed mean.

#### **Progress Check 10.4:** For each of the following situations, indicate whether $H_0$ should be retained or rejected and justify your answer by specifying the precise relationship between observed and critical z scores. Should $H_0$ be retained or rejected, given a hypothesis test with critical z scores of ± 1.96 and

(a) z = 1.74. <br>
Retained, since z is positive and less than 1.96, the critical z scores . <br>
(b) z = 0.13. <br>
Retained, it is barely deviated from the mean. <br>
(c) z = –2.51. <br>
Rejected, it is more negative than -1.96, the lower boundary of critical z scores.

### **10.10.** INTERPRETATION

#### **Progress Check 10.5:** According to the American Psychological Association, members with a doctorate and a full-time teaching appointment earn, on the average, $\$82,500$ per year, with a standard deviation of $\$6,000$. An investigator wishes to determine whether $\$82,500$ is also the mean salary for all female members with a doctorate and a full-time teaching appointment. Salaries are obtained for a random sampleof 100 women from this population, and the mean salary equals $\$80,100$.

(a) Someone claims that the observed difference between $\$80,100$ and $\$82,500$ is large enough by itself to support the conclusion that female members earn less than male members. Explain why it is important to conduct a hypothesis test. <br>
Without the account for variablity, the conclusion on the difference could be biased and disregard the effect of chances involved. On the other hand, the hypothesis test permits us to evaluate the effect of chance by measuring the observed difference relative to the standard error of the mean.

(b) The investigator wishes to conduct a hypothesis test for what population? <br>
For the population of all female psychologists.

(c) What is the null hypothesis, H0? <br>
$H_0$ : $\mu = 82.500$ <br>
(d) What is the alternative hypothesis, H1? <br>
$H_1$ : $\mu \neq 82.500$

(e) Specify the decision rule, using the .05 level of significance. <br>
If corresponding value of the z score converted from the observed mean is equal to or less than the 0.05 level of significance, then the null hypothesis is rejected, otherwise, it is retained.

(f) Calculate the value of z. (Remember to convert the standard deviation to a standard error.) <br>
-4.0

In [25]:
NormalDistribution.mean_zscore(82500, 6000, 80100, 100)

-4.0

(g) What is your decision about $H_0$? <br>
The null hypothesis is rejected

### **REVIEW QUESTIONS**