# Correlation Tests (used to check if two samples are related).

## 1. Pearson’s Correlation Coefficient 
Tests whether two samples have a linear relationship

**Assumptions**
- Observations in two samples are independent and identically distributed;
- Observations in two samples are normally distributed;
- Observations in two samples have the same variance.

**Interpretation**
- H0: two samples are independent.
- H1: there is a dependency between the samples.

**More Information**
- [scipy.stats.pearsonr](https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.pearsonr.html)
- [Pearson’s correlation coefficient on Wikipedia](https://en.wikipedia.org/wiki/Pearson_correlation_coefficient)

In [5]:
# Example of the Pearson's Correlation test
from scipy.stats import pearsonr

data1 = [0.873, 2.817, 0.121, -0.945, -0.055, -1.436, 0.360, -1.478, -1.637, -1.869]
data2 = [0.353, 3.517, 0.125, -7.545, -0.555, -1.536, 3.350, -1.578, -3.537, -1.579]

stat, p = pearsonr(data1, data2)
print('stat = %.3f, p = %.3f' % (stat, p))

if p>0.05:
    print('Probably independent')
else:
    print('Probably dependent')

stat = 0.688, p = 0.028
Probably dependent


## 2. Chi-Squared Test
Tests whether two categorical variables are related or independent.

**Assumptions**
- Observations used in the calculation of the contingency table are independent.
- 25 or more examples in each cell of the contingency table.

**Interpretation**
- H0: the two samples are independent.
- H1: there is a dependency between the samples.

**More Information**
- [scipy.stats.chi2_contingency](https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.chi2_contingency.html)
- [Chi-Squared test on Wikipedia](https://en.wikipedia.org/wiki/Chi-squared_test)

In [4]:
# Example of the Chi-Squared Test
from scipy.stats import chi2_contingency
table = [[10, 20, 30],[6,  9,  17]]
stat, p, dof, expected = chi2_contingency(table)
print('stat=%.3f, p=%.3f' % (stat, p))
if p > 0.05:
    print('Probably independent')
else:
    print('Probably dependent')

stat=0.272, p=0.873
Probably independent
