# Parametric Tests:

Parametric tests are statistical tests that are based on the assumption that the data being analyzed follows a normal distribution. These tests are used to make inferences about population parameters based on a sample of data.

## One-Sample Test:

A one-sample test is used when there is only one population of interest, and a single sample is taken from it. The test determines if there is a significant difference between the values in the sample and the population parameter.

## Two-Sample Test:

A two-sample test is used when samples are collected from two different populations. The test assesses whether the population parameters are significantly different based on the sample parameters.

## Critical Test Statistic:

A critical test statistic is the limiting value of the sample test statistic used to determine whether to accept or reject the null hypothesis.

## Region of Rejection:

The region of rejection is the range of values where the null hypothesis is rejected. The region of acceptance is the area where the null hypothesis holds.

## Types of Tests
The following four types of tests will be discussed:

+ **Z-tests**: Used for large sample sizes (n >= 30) with known population standard deviation.
+ **T-tests**: Used for small sample sizes (n < 30) with unknown population standard deviation.
+ **F-tests**: Used for comparing values of more than two variables.
+ **Chi-square**: Used for comparing categorical data.

## One-Tail Test (Directional Test):

A one-tail test is performed when testing for a change in the mean, with knowledge of the direction of the change.  

There are two types of one-tail tests:  

+ **Upper one-tail**: The region of rejection is on the right tail. This test is performed when testing if the mean score has increased.
+ **Lower one-tail**: The region of rejection is on the left tail. This test is performed when testing if the mean score has decreased.

## Two-Tail Test (Non-Directional Test):

A two-tail test is performed when testing for a change in the mean, without knowledge of the direction of the change. The region of rejection is located on both tails of the distribution.

## The P-value:

The p-value is used to determine whether to accept or reject the null hypothesis. It is calculated based on the sample data and is compared to a significance level, typically 0.05. 
+ If p < 0.05, it suggests that the sample data is unlikely to be random and not representative of the population. In this case, the null hypothesis is rejected. 
+ If p > 0.05, it indicates that there is a higher probability that the sample is not representative of the population, and the null hypothesis is not rejected.

![](./data/hypothesis.png)

SciPy is a Python-based library for mathematics, science, and engineering. There are many useful imports we are going to make from this module.

## CDF (cumulative distribution function) and PPF (percent point function)


The cdf() function calculates the probability value given z-statistic while the ppf() caluculates the z-statistic given probabilty value. It is sometimes refered as inverse of cdf.

To illustrate this calculation, check the below sample code. 

![](data/cdf_ppf.png)

In [1]:
from scipy.stats import norm

z = 1.6448536269514722
p = 0.95

print("Calculating p given z: p = ", norm.cdf(z))
print("Calculating z given p: z = ", norm.ppf(p))

Calculating p given z: p =  0.95
Calculating z given p: z =  1.6448536269514722


# Choosing a statistical test?

Below is a simple diagram which shows how to choose a test depending on different data types.

![](data/a1.png)