# Chi-Square Goodness of Fit Test

## Introduction
The Chi-Square Goodness of Fit test is used to determine whether the distribution of categorical data fits a specified distribution.

## Hypotheses
- Null Hypothesis (H0): The observed frequencies match the expected frequencies.
- Alternative Hypothesis (H1): There is a significant difference between observed and expected frequencies.

## Formula
\[ \chi^2 = \sum \frac{{(O_i - E_i)^2}}{{E_i}} \]

## Example
Suppose you roll a fair six-sided die 60 times and want to test if the outcomes follow an equal distribution.

### Observed Frequencies
- 1: 12 times
- 2: 10 times
- 3: 8 times
- 4: 15 times
- 5: 9 times
- 6: 6 times

### Expected Frequencies (for a fair die)
- 1: 10 times
- 2: 10 times
- 3: 10 times
- 4: 10 times
- 5: 10 times
- 6: 10 times

## Code
```python
import scipy.stats as stats

observed = [12, 10, 8, 15, 9, 6]
expected = [10, 10, 10, 10, 10, 10]

chi2_stat, p_value = stats.chisquare(f_obs=observed, f_exp=expected)
print(f"Chi-Square Statistic: {chi2_stat}\nP-value: {p_value}")



### Chi-Square Test of Homogeneity

# Chi-Square Test of Homogeneity

## Introduction
The Chi-Square Test of Homogeneity is used to determine whether the proportions of categorical variables are the same across different groups.

## Hypotheses
- Null Hypothesis (H0): The proportions are the same across all groups.
- Alternative Hypothesis (H1): The proportions are different across at least one group.

## Formula
\[ \chi^2 = \sum \frac{{(O_{ij} - E_{ij})^2}}{{E_{ij}}} \]

## Example
Suppose you want to test if the distribution of favorite colors is the same among three different age groups.

### Observed Frequencies
| Color  | Young (18-25) | Middle-aged (26-40) | Older (41+) |
|--------|---------------|---------------------|-------------|
| Red    | 20            | 15                  | 10          |
| Blue   | 25            | 30                  | 15          |
| Green  | 15            | 10                  | 5           |



### Chi-Square Test of Independence

# Chi-Square Test of Independence

## Introduction
The Chi-Square Test of Independence is used to determine whether there is a significant association between two categorical variables.

## Hypotheses
- Null Hypothesis (H0): There is no association between the two variables.
- Alternative Hypothesis (H1): There is a significant association between the two variables.

## Formula
\[ \chi^2 = \sum \frac{{(O_{ij} - E_{ij})^2}}{{E_{ij}}} \]

## Example
Suppose you want to test if there is an association between gender and smoking status.

### Observed Frequencies
|           | Non-Smoker | Smoker |
|-----------|------------|--------|
| Male      | 200        | 150    |
| Female    | 250        | 100    |


