

### Statistical Estimation: Making Inferences About the Unknown

In statistics, our goal is often to understand a characteristic of a large group, called a **population**. This characteristic, like the true average height of all students in a country, is called a **population parameter (e.g., μ)**.

Since it's usually impossible to measure everyone in a population, we take a smaller subset, called a **sample**, and calculate a characteristic for that sample. This is called a **sample statistic (e.g., x̄, the sample mean)**.

An **estimate** is a specific, observed numerical value calculated from sample data that is used to guess the value of an unknown population parameter. There are two main types of estimates we can make.

---

### 1. Point Estimate

A point estimate is a **single numerical value** used to provide our "best guess" for an unknown population parameter.

*   **Definition:** It is one single number that represents our most likely value for the parameter.
*   **Key Example:** The most common example,  is using the **sample mean (x̄)** as a point estimate for the **population mean (μ)**.

**Illustration :**
*   You want to know the true average score (μ) of all students on a national exam (the population). This is the unknown parameter.
*   You take a sample of students and find their average score is **60 (x̄)**.
*   This single value, 60, is your **point estimate** for the true average score of *all* students.

**Common Point Estimates:**
*   **Sample Mean (x̄)** estimates the **Population Mean (μ)**.
*   **Sample Proportion (p̂)** estimates the **Population Proportion (p)**.
*   **Sample Standard Deviation (s)** estimates the **Population Standard Deviation (σ)**.

**The Major Limitation of Point Estimates:**
While a point estimate gives a precise "best guess," it has a very high probability of being wrong. It's extremely unlikely that the sample mean you calculated (60) is *exactly* the same as the true population mean. Furthermore, the point estimate gives us **no information about our uncertainty**. We don't know if the true mean is likely to be very close to 60 or very far away.

---

### 2. Interval Estimate

An interval estimate addresses the uncertainty of a point estimate by providing a **range of plausible values** that is likely to contain the unknown population parameter.

*   **Definition:** Instead of a single number, it is an interval (e.g., "between 55 and 65") that is used to estimate the parameter.
*   **Key Concept:** The most common type of interval estimate is called a **Confidence Interval**.

**Illustration :**
Instead of just saying our best guess for the true average score is 60, we can construct an interval estimate.
*   Our point estimate (sample mean) is 60.
*   We calculate a margin of error based on our sample's variability and size.
*   We then state that we are "95% confident that the true population mean (μ) lies somewhere **between 55 and 65**."

This range, `[55, 65]`, is the **interval estimate**.

**Why is an Interval Estimate More Useful?**
An interval estimate is much more informative because it provides two key pieces of information:
1.  **A "Best Guess" Range:** It gives us a set of plausible values for the population parameter.
2.  **A Measure of Confidence:** It tells us how confident we are that the true parameter lies within that range. The width of the interval also indicates the precision of our estimate—a narrow interval suggests a more precise estimate than a wide one.

### Summary: Point vs. Interval Estimate

| Feature                | Point Estimate                                   | Interval Estimate                                         |
|------------------------|--------------------------------------------------|-----------------------------------------------------------|
| **What it is**         | A single numerical value.                        | A range of numerical values.                              |
| **Example**            | "The estimated average score is 60."             | "We are 95% confident the average score is between 55 and 65." |
| **Information Provided**| A single "best guess."                           | A range of plausible values and a level of confidence.    |
| **Precision**          | Appears precise, but is very likely to be wrong. | Acknowledges uncertainty and provides a measure of it.      |
| **Utility**            | A good starting point.                           | More useful for making informed decisions and conclusions. |


### Hypothesis Testing & Statistical Analysis: An Overview

Hypothesis testing is the formal procedure for using sample data to draw conclusions about a population. The goal is to determine if there is enough evidence to reject a default assumption (the null hypothesis). The choice of statistical test depends on the type of data and the research question.

this notes cover some of the most fundamental tests:
*   **Z-test & t-test:** Used to compare the **average** (mean) of a group to a known value or to compare the averages of two groups.
*   **Chi-Square (χ²) test:** Used to analyze **categorical data** to see if there is a relationship between two variables (e.g., is there a relationship between gender and voting preference?).
*   **ANOVA (Analysis of Variance):** Used to compare the **averages** of three or more groups simultaneously. It analyzes the **variance** between and within groups.

This guide will focus on the **Z-test**, as detailed in notes.

---

### The Z-test

The Z-test is a statistical test used to determine whether two population means are different when the variances are known and the sample size is large.

**When to use a Z-test:**
1.  You are comparing a sample mean to a population mean.
2.  The **population standard deviation (σ) is known**.
3.  The **sample size (n) is large (typically n > 30)**, or the population is known to be normally distributed.

The output of a Z-test is a **Z-score** (or Z-statistic), which is then used to calculate a **p-value**.

---

### Example 1: Average Heights (A Two-Tailed Test)

This is a classic example of a two-tailed test, where we are looking for any significant *difference* (either higher or lower) from the population mean.

**Scenario:** The average height of all residents in a city is **168 cm** (μ) with a population standard deviation of **3.9 cm** (σ). A doctor measures a sample of **36 individuals** (n) and finds their average height to be **169.5 cm** (x̄).

**Question:** At a 95% confidence level (α = 0.05), is there enough evidence to say the true average height is different from 168 cm?

#### Step 1: State the Null and Alternate Hypotheses
*   **Null Hypothesis (H₀):** `μ = 168 cm` (The true average height is 168 cm).
*   **Alternate Hypothesis (H₁):** `μ ≠ 168 cm` (The true average height is *not* 168 cm).

#### Step 2: Establish the Decision Boundary (Critical Value Method)
*   **Confidence Level:** 95%
*   **Significance Level (α):** `1 - 0.95 = 0.05`.
*   **Test Type:** Two-tailed test, because we are testing for "not equal to." We split the alpha between the two tails: `0.05 / 2 = 0.025` in each tail.
*   **Critical Z-values:** We look up the Z-score that corresponds to the cumulative area of `1 - 0.025 = 0.975`. This gives us a critical Z-value of **±1.96**.

**Decision Rule:** If our calculated Z-score is **less than -1.96** or **greater than +1.96**, we will reject the null hypothesis.

#### Step 3: Calculate the Z-statistic
`Z = (x̄ - μ) / (σ / √n)`
`Z = (169.5 - 168) / (3.9 / √36)`
`Z = 1.5 / (3.9 / 6)`
`Z = 1.5 / 0.65`
`Z ≈ 2.31`

#### Step 4: Make a Conclusion
*   **Using the Critical Value Method:** Our Z-score of **2.31** is greater than the critical value of **1.96**. Therefore, it falls in the rejection region.
*   **Using the P-value Method:**
    1.  Find the probability associated with a Z-score of 2.31. The area to the right of 2.31 is approximately **0.01044**.
    2.  Since this is a two-tailed test, we must double this value: `p-value = 2 * 0.01044 = 0.02088`.
    3.  Compare the p-value to alpha: `0.02088 < 0.05`.

**Final Conclusion:** Both methods lead to the same conclusion: We **reject the null hypothesis**. The evidence suggests that the average height of the residents is significantly different from 168 cm. Based on the sample, it appears to be increasing.

---

### Example 2: Light Bulb Warranty (A One-Tailed Test)

This is an example of a one-tailed test, where we are looking for a difference in a *specific direction* (in this case, "less than").

**Scenario:** A factory produces bulbs with an average warranty of **5 years** (μ) and a standard deviation of **0.5 years** (σ). A worker tests a sample of **40 bulbs** (n) and finds the average life is **4.8 years** (x̄).

**Question:** At a 2% significance level (α = 0.02), is there enough evidence to support the idea that the bulbs malfunction in *less than* 5 years?

#### Step 1: State the Hypotheses
*   **Null Hypothesis (H₀):** `μ = 5 years` (The average warranty is 5 years).
*   **Alternate Hypothesis (H₁):** `μ < 5 years` (The average warranty is *less than* 5 years). This is what the worker believes.

#### Step 2: Establish the Decision Boundary
*   **Significance Level (α):** 0.02.
*   **Test Type:** One-tailed test (left-tailed, because we are testing for "less than"). The entire rejection region of 2% is in the left tail.
*   **Critical Z-value:** We look for the Z-score corresponding to an area of 0.02 in the left tail, which is approximately **-2.05**.

**Decision Rule:** If our calculated Z-score is **less than -2.05**, we will reject H₀. (we use -2.53, which seems to be the calculated value, not the critical one).

#### Step 3: Calculate the Z-statistic
`Z = (x̄ - μ) / (σ / √n)`
`Z = (4.8 - 5.0) / (0.50 / √40)`
`Z = -0.2 / (0.50 / 6.32)`
`Z = -0.2 / 0.079`
`Z ≈ -2.53`

#### Step 4: Make a Conclusion
*   **Using the Critical Value Method:** The calculated Z-score of **-2.53** is less than the critical value of **-2.05**. It falls in the rejection region.
*   **Using the P-value Method:**
    1.  The p-value is the area to the left of our calculated Z-score of -2.53. This area is **0.0057**.
    2.  Compare the p-value to alpha: `0.0057 < 0.02`.

**Final Conclusion:** The statement "0.0570 < 0.02 => False"  we should **reject the null hypothesis**. There is strong evidence to support the worker's belief that the bulbs last, on average, for less than 5 years. The warranty should likely be revised.



### The Student's t-Distribution: Handling the Unknown

In statistical analysis, the Z-test is a powerful tool for hypothesis testing. However, it relies on a critical piece of information that is often unavailable in real-world scenarios: the **population standard deviation (σ)**.

The fundamental question then becomes:
> "How do we perform a reliable analysis when we don't know the population's true standard deviation?"

The answer is the **Student's t-distribution**.

#### When to Use the t-distribution Instead of the Z-distribution

You should use the t-distribution (and a **t-test**) under the following conditions:
1.  The **population standard deviation (σ) is unknown**.
2.  The **sample size is small (typically n < 30)**.
3.  The population from which the sample is drawn is assumed to be approximately normally distributed.

Instead of the unknown `σ`, we use the **sample standard deviation (s)** as an estimate.

#### The t-statistic vs. the Z-statistic

The formulas for the two statistics look very similar, but their underlying distributions are different.

*   **Z-statistic:**
    `Z = (x̄ - μ) / (σ / √n)`
    This formula uses the *true population parameter* `σ`. The Z-statistic follows a standard normal distribution (Z-distribution).

*   **t-statistic:**
    `t = (x̄ - μ) / (s / √n)`
    This formula uses the *sample statistic* `s` as an estimate for `σ`. The t-statistic follows a Student's t-distribution.

The key difference is that `s` (the sample standard deviation) will vary from sample to sample, while `σ` is a fixed constant for the population. This extra variability introduced by estimating the standard deviation is what makes the t-distribution different from the Z-distribution.

#### Characteristics of the t-distribution

The t-distribution is a family of distributions that shares some similarities with the normal distribution but has one key difference:
*   **Shape:** It is symmetric and bell-shaped, just like the normal distribution.
*   **Tails:** The t-distribution has **"fatter" or "heavier" tails** than the normal distribution. This means it assigns more probability to extreme values. This accounts for the extra uncertainty we have because we are using `s` to estimate `σ`.
*   **Degrees of Freedom (df):** The exact shape of the t-distribution is determined by a parameter called the **degrees of freedom (df)**.

As the **degrees of freedom increase** (which happens as the sample size `n` gets larger), the t-distribution gets closer and closer to the standard normal distribution. By the time `n` is over 30, the two are practically identical.

#### Understanding Degrees of Freedom (df)

Degrees of freedom represent the number of independent pieces of information available to estimate a parameter. For a one-sample t-test, the formula is:

**df = n - 1**

where `n` is the sample size.

**Intuitive Explanation:**
Imagine you have a sample of 3 people (`n=3`), and you know the average height of these 3 people is 170 cm.
*   The first person's height can be anything—it's free to vary.
*   The second person's height can also be anything—it's free to vary.
*   However, once you know the heights of the first two people, the height of the **third person is fixed**. It must be a specific value to make the average of all three equal to 170 cm.

So, in a sample of 3, only 2 values are "free" to vary. Therefore, the degrees of freedom is `3 - 1 = 2`.

When performing a t-test, you will use a **t-table**, which requires you to know both your desired significance level (alpha) and the degrees of freedom to find the critical t-value for your hypothesis test.


### One-Sample t-Test: The Goal

The one-sample t-test is a statistical procedure used to determine if the **mean of a single sample** is significantly different from a **known or hypothesized population mean**. We use a t-test instead of a Z-test because the **population standard deviation (σ) is unknown**, and we must use the **sample standard deviation (s)** as an estimate.

---

### The Scenario: Testing a New Medication

**Problem:** The average IQ in the general population is **100**. A research team wants to know if a new medication affects intelligence. They give the medication to a sample of **30 participants** and find that their average IQ is **140**, with a sample standard deviation of **20**.

**Question:** Using a 95% confidence level (α = 0.05), is there enough evidence to conclude that the medication has an effect (either positive or negative) on intelligence?

---

### The Hypothesis Testing Procedure

The standard 5-step process for hypothesis testing.

#### Step 1: State the Null and Alternate Hypotheses

First, we state our assumptions. The null hypothesis represents "no effect," while the alternate hypothesis represents what the researchers are trying to find evidence for.

*   **Null Hypothesis (H₀): `μ = 100`**
    This assumes the medication has **no effect**, and the average IQ of the group taking it is the same as the general population mean.

*   **Alternate Hypothesis (H₁): `μ ≠ 100`**
    This assumes the medication **does have an effect**, causing the average IQ to be different from the population mean. This is a **two-tailed test** because we are interested in any difference, whether it's an increase or a decrease in IQ.

#### Step 2: Set the Significance Level (α)

This is our threshold for what we consider "statistically significant."
*   **Significance Level (α) = 0.05**
    This means we are willing to accept a 5% risk of incorrectly rejecting the null hypothesis when it is actually true (a Type I error).

#### Step 3: Determine the Degrees of Freedom (df)

The shape of the t-distribution depends on the sample size, which is accounted for by the degrees of freedom.
*   **Formula:** `df = n - 1`
*   **Calculation:** `df = 30 - 1 = 29`

#### Step 4: Establish the Decision Rule (Find the Critical Value)

Based on our `α` and `df`, we find the critical t-values from a t-distribution table. These values create the boundaries for our "rejection regions."

*   **Significance Level (α):** 0.05
*   **Test Type:** Two-tailed, so we split the alpha: `0.05 / 2 = 0.025` in each tail.
*   **Degrees of Freedom (df):** 29
*   Looking up these values in a t-table gives us a critical t-value of **±2.045**.

**Decision Rule:** from t-statistic table,  **less than -2.045** or **greater than +2.045**, it will fall into the rejection region, and we will reject the null hypothesis.

#### Step 5: Calculate the Test Statistic (t-statistic)

Now, we calculate the t-statistic for our sample. This value tells us how many standard errors our sample mean is away from the population mean.

*   **Formula:** `t = (x̄ - μ) / (s / √n)`
    *   x̄ = Sample mean (140)
    *   μ = Population mean (100)
    *   s = Sample standard deviation (20)
    *   n = Sample size (30)

*   **Calculation:**
    `t = (140 - 100) / (20 / √30)`
    `t = 40 / (20 / 5.477)`
    `t = 40 / 3.65`
    `t ≈ 10.96`

---

### Conclusion and Interpretation

Now we compare our calculated t-statistic to our critical value from Step 4.

*   **Calculated t-statistic = 10.96**
*   **Critical t-value = ±2.045**

**Decision:**
Since **10.96 is much greater than 2.045**, our result falls deep inside the rejection region. Therefore, we **Reject the Null Hypothesis (H₀)**.

**Interpretation in Plain English:**
The results from our sample are highly statistically significant. There is very strong evidence to suggest that the medication has an effect on intelligence. Since the sample mean (IQ of 140) is significantly higher than the population mean (IQ of 100), we can conclude that the medication appears to substantially increase intelligence scores.