### Take Home Task

#### Chi-Square (χ²) Test

A chi-square (χ²) test is a statistical method used to compare observed data with expected data under a specific model. It measures how well the observed results fit the expected distribution by evaluating the discrepancies between them. This test is mainly applied to categorical data, where the order of categories does not matter.  

In general, chi-square tests are used to determine whether:
- The observed results align with the expected results.
- Two categorical variables are related (dependent) or independent.



#### Types of Chi-Square Tests

1. Test of Independence 
   - Examines whether two categorical variables are related.  
   - Example: Is there a relationship between car brand preference and gender?

2. Goodness-of-Fit Test
   - Assesses how well a sample distribution matches a known or assumed population distribution.  
   - Example: *Does a sample reflect the expected proportions in the population?*



#### Mathematical Representation

The chi-square statistic is calculated as:
$$

\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}

$$
Where:  
- \( O_i \) = Observed value(s)  
- \( E_i \) = Expected value(s)  
- \( c \) = Degrees of freedom  


The chi-square test helps determine whether differences between observed and expected results are due to chance or if they indicate a statistically significant relationship.


### Analysis of Variance (ANOVA)

ANOVA (Analysis of Variance) is a statistical test used to evaluate differences between the means of three or more groups. Instead of comparing groups one by one, ANOVA allows for the simultaneous comparison of multiple group means.  

The main goal is to determine whether the differences among group means are likely due to random chance or represent true, meaningful differences.  

- One-Way ANOVA: Compares means across groups with a single independent variable.  
- Two-Way ANOVA: Examines the effect of two independent variables and their possible interaction.  
- General ANOVA: Can handle multiple factors and their interactions.  

Although typically performed using statistical software, ANOVA can also be computed manually using hand calculations.  



#### Mathematical Representation

$$
F = \frac{MST}{MSE}
$$

Where:  
- \( F \) = ANOVA test statistic  
- \( MST \) = Mean sum of squares due to treatment  
- \( MSE \) = Mean sum of squares due to error  


ANOVA helps test whether observed differences in group means are statistically significant or simply due to chance.


### Two-Tailed Test

A two-tailed test is not a standalone statistical test, but rather a feature of how a hypothesis test is designed. It is related to the directionality of the alternative hypothesis in tests such as the t-test or z-test.  

In a two-tailed test, we evaluate whether a sample mean is significantly different from the population mean, considering both possibilities:  
- The sample mean may be greater than the population mean.  
- The sample mean may be less than the population mean.  

This type of test is used when we are interested in detecting any significant difference from the null hypothesis value, regardless of the direction of the difference.  

It is often regarded as the default approach and is widely applied across diverse fields.


### Proportion Tests

Proportion tests are used when the variable of interest is expressed as a proportion or percentage (e.g., the proportion of users who click a link, or the percentage of defective products in a batch). These tests help evaluate hypotheses about one or more population proportions.  



#### Types of Proportion Tests  

- One-Sample Proportion Test
  Compares a sample proportion to a hypothesized population proportion.  
  Example: “Is the proportion of left-handed students in this GenAI class (12%) significantly different from the national average of 10%?”  



#### Difference from Chi-Square Test  

- Proportion Tests: Focus on testing proportions (binary/numerical outcomes).  
- Chi-Square Tests: Focus on testing relationships or associations between categorical variables.  



#### Test Statistic  

Proportion tests are often based on the z-test statistic, which is calculated as:

$$
z = \frac{\hat{p} - p_0}{\sqrt{\frac{p_0 (1 - p_0)}{n}}}
$$

Where: 
 
- \(p-p_o\) = Sample proportion  
- \( p_0 \) = Hypothesized (claimed) population proportion  
- \( n \) = Sample size  


Proportion tests are applied when we want to verify whether a sample proportion significantly differs from a known or assumed population proportion.
