# Summary of Statistical Tests

| Type of Data | Test Name                  | Parametric/Non-Parametric | When to Use                                                                                          | Statsmodels Function         |
|--------------|----------------------------|---------------------------|------------------------------------------------------------------------------------------------------|----------------------------|
| Categorical  | Chi-Square Test            | Non-Parametric            | To test the independence between categorical variables (e.g., click/no-click across control/exp group). | `statsmodels.stats.contingency_tables.Table.chi2_contingency` |
| Categorical  | Fisher's Exact Test        | Non-Parametric            | For small sample sizes or when the expected frequency in any cell of a contingency table is < 5.       | N/A                        |
| Ordinal      | Mann-Whitney U Test        | Non-Parametric            | To compare the ranks of two independent groups.                                                      | `statsmodels.stats.nonparametric.ranksums` |
| Ordinal      | Wilcoxon Signed-Rank Test  | Non-Parametric            | To compare the ranks of two related groups (paired data).                                            | `statsmodels.stats.descriptivestats.SignTest` |
| Continuous   | t-Test                     | Parametric                | To compare the means of two independent groups (data should be normally distributed).                 | `statsmodels.stats.weightstats.ttest_ind` |
| Continuous   | Paired t-Test              | Parametric                | To compare the means of two related groups (paired data, normally distributed).                       | `statsmodels.stats.weightstats.ttest_rel` |
| Continuous   | Z-Test                     | Parametric                | To compare the means of two independent groups when the sample size is large and the population standard deviation is known. | N/A |
| Continuous   | ANOVA                      | Parametric                | To compare the means of three or more groups (normally distributed, equal variances).                 | `statsmodels.stats.anova.AnovaRM` |
| Continuous   | Kruskal-Wallis Test        | Non-Parametric            | To compare the ranks of three or more independent groups (non-normal distribution).                   | `statsmodels.stats.nonparametric.kruskal` |
| Continuous   | Pearson's Correlation      | Parametric                | To measure the linear relationship between two continuous variables (normally distributed).           | `statsmodels.stats.correlation.correlation_pearson` |
| Continuous   | Spearman's Rank Correlation| Non-Parametric            | To measure the strength and direction of the relationship between two ranked variables.               | `statsmodels.stats.correlation.correlation_spearman` |



**PARAMETRIC** makes assumptions about data.
**NON-PARAMETRIC** if the data is skewed, or ordinal, you cannot assume normality.

----

*Hypothesis: the more one drink, the poorer they will do at school*

**DEPENDENT VARIABLE** is what the researcher measures. // **the possible effect**(i.e grades)

**INDEPENT VARIABLE** is what the researcher observes or manipulates. // **the possible cause cause**  (i.e drinking level)

**CO-VARIATE** influences the relationship between X and Y (i.e. gender, age)


----


[https://towardsdatascience.com/a-b-testing-a-complete-guide-to-statistical-testing-e3f1db140499]
[https://github.com/FrancescoCasalegno/AB-Testing/blob/main/AB_Testing.ipynb]

<img src="../img/stats_tests.png" width="70%">

<img src="../img/time_series.png" width="70%">