# Hypothesis Testing

In business, many decisions have to be made every day. Instead of relying on guesswork, performing experiments and forming hypotheses is a more data-driven approach to decision-making.

---

## What is Hypothesis Testing?

- Hypothesis testing is a statistical mechanism used to make decisions or inferences about population parameters based on sample data.  
- It helps to **prove or disprove claims** being tested.  
- It provides a structured framework to define the problem and make data-centric decisions.  

---

### **Statistical Hypothesis**

When a researcher already has an idea or assumption about the outcome before doing an experiment, a **statistical hypothesis** provides a structured framework for testing and decision-making.

There are two types of hypotheses:

1. **Null Hypothesis (H₀)** – assumes no change or effect; old beliefs hold true.  
2. **Alternative Hypothesis (Hₐ)** – assumes there is a new effect, difference, or relationship.

---

### **Example 1: Census of Height**

$$H_{0}: \mu = 160$$  
$$H_{a}: \mu \neq 160$$

---

### **Example 2: Fish Farm**

Suppose we want to test if the average length of fish is greater than 2 kg.

$$H_{0}: \mu = 2$$  
$$H_{a}: \mu > 2$$

In general, any new claim or proposed change is defined in the **alternative hypothesis**.

---

## Types of Hypothesis Tests

### 1. **Two-Tailed Test**
Used when we are checking for any difference (increase or decrease).

$$H_{0}: \mu = 163$$  
$$H_{a}: \mu \neq 163$$

This test requires further investigation to determine the direction of the difference.

---

### 2. **One-Tailed Test**
Used when we expect the difference to be in a specific direction (greater or smaller).

$$H_{0}: \text{length} = 2$$  
$$H_{a}: \text{length} > 2$$

Used when the researcher believes the true value will be greater (or smaller) than the hypothesized value.

---

## Interpreting Results

- If the **null hypothesis (H₀)** is rejected, and the **alternative hypothesis (Hₐ)** is accepted, we say the result is **statistically significant**.  
- This means the outcome is unlikely to have occurred by random chance.

Example:  
If 2.1 kg is statistically higher than 2 kg, it may be statistically significant, but not necessarily **practically significant** for business use.

---

## Steps of Performing a Hypothesis Test

1. **State the hypotheses** – H₀ and Hₐ  
2. **Choose the significance level (α)** – usually 0.05  
3. **Select the appropriate statistical test** (z-test, t-test, etc.)  
4. **Collect the sample data**  
5. **Compute the test statistic and compare with the critical value**  
6. **Make a decision** – reject or fail to reject H₀  

---

# Type I and Type II Errors

- **Type I Error (α):** Rejecting a true null hypothesis.  
- **Type II Error (β):** Failing to reject a false null hypothesis.  

A researcher cannot commit both errors in the same test.

- **α (alpha)** occurs when H₀ is rejected.  
- **β (beta)** occurs when H₀ is not rejected but is false.

---

## Example of a Statistical Test (When Population Information is Known)

The **Z-test formula**:

$$z = \frac{\bar{x} - \mu}{\frac{\sigma}{\sqrt{n}}}$$

Where:
- $\bar{x}$ = sample mean  
- $\mu$ = population mean (hypothesized)  
- $\sigma$ = population standard deviation  
- $n$ = sample size  

Example Hypothesis:

$$H_{0}: \mu = 170$$  
$$H_{a}: \mu \neq 170$$

This is a **two-tailed test**.

The critical z-value for a 0.05 significance level (two-tailed) is **±1.96**.

If the calculated z < -1.96 or > +1.96 → **Reject H₀**

---

### Example Result:

- Observed z-value: -2.45  
- Critical value: ±1.96  
- Decision: **Reject H₀**

 We did **not** make a Type I error because the value is actually not equal to 170.

---

# p-value (Observed Significance Level)

Another approach to making a decision is using the **p-value**.

The p-value represents the smallest α level at which H₀ can be rejected.

- If **p < 0.05** → strong evidence against H₀ → **Reject H₀**  
- If **p > 0.05** → weak evidence → **Fail to reject H₀**  
- If **p ≈ 0.05** → borderline evidence → decision uncertain  

Example:  
If z = 2.45, p = 0.006  
→ Reject H₀ at α = 0.05  
→ Fail to reject H₀ at α = 0.001  

---

# t-test for Mean Estimation of Population

Used when population standard deviation (σ) is **unknown**.

$$t = \frac{\bar{x} - \mu}{\frac{s}{\sqrt{n}}}$$

Where:  
- $\bar{x}$ = sample mean  
- $\mu$ = population mean (expected)  
- $s$ = sample standard deviation  
- $n$ = sample size  
- **Degrees of freedom (df)** = n - 1  

If **p < 0.05**, reject the null hypothesis.

---

### Example Results:

| Test | Hypothesized Mean | t-Statistic | p-Value | Decision |
|------|--------------------|-------------|----------|-----------|
| t-test 1 | 170 | -2.26 | 0.026 | Reject H₀ |
| t-test 2 | 168 | -0.68 | 0.49 | Fail to reject H₀ |
| t-test 3 | 169 | -1.47 | 0.14 | Fail to reject H₀ |

---

### Summary:

- **Reject H₀** → p < 0.05 (statistically significant)  
- **Fail to Reject H₀** → p > 0.05 (not significant)  

---

 **Conclusion:**
Hypothesis testing allows data-driven decisions by providing a structured way to test claims.  
However, statistical significance should always be interpreted with **business context and practical importance** in mind.

