# T Test

## Independent Sample Test
The Independent Samples t Test or 2-sample t-test compares the means of two independent groups in order to determine whether there is statistical evidence that the associated population means are significantly different.

![image.png](attachment:image.png)

### Problem Statement

A global fast food chain wants to venture into a new city by setting up a store in a popular mall in the city. It has shortlisted 2 popular locations A and B for the same and wants to choose one of them based on the number of foot falls per week.

The company has obtained the data for the same and wants to know which of the two locations to choose and if there is significant difference between the two.

Here’s how the footfalls per day (in 1000’s) looks like for 10 randomly chosen days:<br>
![image.png](attachment:image.png)

In [21]:
from scipy.stats import ttest_ind
import pandas as pd
df = pd.read_csv("footfalls.csv")
tvalue, pvalue = ttest_ind(df.Loc_A, df.Loc_B)
print("t Statistic: ", tvalue)  
print("P Value: ", pvalue)

t Statistic:  2.1161259284967686
P Value:  0.048530501643401246


From T-distribution table we get value of t critical=2.101.
Since tvalue>tcritical Therefore there is significant difference.

Alternatively

In [22]:
if pvalue < 0.05:    # alpha value is 0.05 or 5%
    print(" Footfall is different")
else:
    print("Footfall is same")

 Footfall is different


![image.png](attachment:image.png)

##  One-sample T-test

The test will tell us whether means of the sample and the population are different

![image.png](attachment:image.png)

### Problem Statement
We have the potato yield from 12 different farms. We know that the standard potato yield for the given variety is µ=20.

x = [21.5, 24.5, 18.5, 17.2, 14.5, 23.2, 22.1, 20.5, 19.4, 18.1, 24.1, 18.5]

Test if the potato yield from these farms is same as the standard yield.

In [23]:
x =  [21.5, 24.5, 18.5, 17.2, 14.5, 23.2, 22.1, 20.5, 19.4, 18.1, 24.1, 18.5]

In [24]:
from scipy.stats import ttest_1samp
import numpy as np

print(np.mean(x))

20.175


In [25]:
tvalue, pvalue = ttest_1samp(x, popmean=20)
print("t value: ", tvalue)  
print("P Value: ", pvalue)

t value:  0.2006562773994862
P Value:  0.8446291893053613


From T-distribution table we get value of t critical=2.201.
Since tvalue<tcritical, Therefore potato yield is same

Alternatively

In [26]:
if pvalue < 0.05:    # alpha value is 0.05 or 5%
    print("  potato yield from these farms is diffrent from the standard yield")
else:
    print(" potato yield from these farms is same as the standard yield")

 potato yield from these farms is same as the standard yield


## Paired Sample Test
When you want to check how different samples from the same group are, you can go for a paired T-test

### Problem Statement

Consider we are given the performance of atheletes. Find if there is any significant change in their performance after change in diet.

In [27]:
before=[25,30,28,35,28,34,26,29,30,26,28,32,31,30,45]
after=[26,31,28,33,30,34,30,29,28,28,29,31,35,31,45]

In [28]:
from scipy.stats import ttest_rel
tvalue,pvalue=ttest_rel(a=before,b=after)

In [29]:
print("t Statistic: ", tvalue)  
print("P Value: ", pvalue)

t Statistic:  -1.5853558325268822
P Value:  0.13520778142018045


In [30]:
if pvalue < 0.05:    # alpha value is 0.05 or 5%
    print("Change in performance")
else:
    print("No change in performance")

No change in performance
