# Lab | Inferential statistics


### Instructions

1. It is assumed that the mean systolic blood pressure is `μ = 120 mm Hg`. In the Honolulu Heart Study, a sample of `n = 100` people had an average systolic blood pressure of 130.1 mm Hg with a standard deviation of 21.21 mm Hg. Is the group significantly different (with respect to systolic blood pressure!) from the regular population?

   - Set up the hypothesis test.
   - Write down all the steps followed for setting up the test.
   - Calculate the test statistic by hand and also code it in Python. It should be 4.76190. We will take a look at how to make decisions based on this calculated value.

2. If you finished the previous question, please go through the code for `principal_component_analysis_example` provided in the `files_for_lab` folder .

`Ho: μ = 120 mm Hg`

`H1: μ != 120 mm Hg`

x = 130.1 mmm Hg

std = 21.21 mmg Hg

n = 100 people

In [8]:
import math

sample_mean = 130.1
pop_mean = 120
sample_std = 21.21
n = 100
statistic = (sample_mean - pop_mean)/(sample_std/math.sqrt(n))
print("Statistic is: ", statistic)

Statistic is:  4.761904761904759


In [10]:
from scipy import stats
from numpy.random import normal
import numpy as np

samples = {}

for i in range(10):
    sample_name = "sample_" + str(i)
    samples[sample_name] = normal(loc = 130.1, scale = 21.21, size = 100)
    sample_mean = "sample_" + str(i) + "_mean"
    samples[sample_mean] = np.mean(samples[sample_name])
    sample_std = "sample_" + str(i) + "_std"
    samples[sample_std] = np.std(samples[sample_name],ddof=1)
    sample_statistic = "sample_" + str(i) + "_t-statistic"
    samples[sample_statistic] = (samples[sample_mean]- pop_mean)/(samples[sample_std]/math.sqrt(n)) 
    print("The t-statistic for the sample {} is: {}".format(i,samples[sample_statistic]))


The t-statistic for the sample 0 is: 4.48773030331358
The t-statistic for the sample 1 is: 4.347635037587775
The t-statistic for the sample 2 is: 4.980071958792182
The t-statistic for the sample 3 is: 4.948689828228232
The t-statistic for the sample 4 is: 5.6946054288828565
The t-statistic for the sample 5 is: 3.201277415864884
The t-statistic for the sample 6 is: 5.558808353073399
The t-statistic for the sample 7 is: 5.204316926108675
The t-statistic for the sample 8 is: 3.043592042807715
The t-statistic for the sample 9 is: 5.798983024776275


In [11]:
print("Assuming a significance level of 0.05")
print()

for i in range(10):
    sample_name = "sample_" + str(i)
    # In the next line, 85 is the population's mean.
    print("The p-value of sample {} is: {:-5.3}".format(i,stats.ttest_1samp(samples[sample_name],85)[1]))
    print("The values in the sample are: ")
    print(samples[sample_name])
    sample_mean = "sample_" + str(i) + "_mean"
    print(samples[sample_mean])
    print()
    if ( stats.ttest_1samp(samples[sample_name],85)[1] < 0.05 ):
        print("Therefore we discard the null hypothesis Ho, as it's very unlikely to get sample {} given Ho.".format(i))
    else: 
        print("We accept the null hypothesis Ho, as it's very likely to obtain sample {} given Ho".format(i) )
    print()

Assuming a significance level of 0.05

The p-value of sample 0 is: 4.41e-39
The values in the sample are: 
[109.11105249 177.76510899 136.09131462 101.27706505 108.36805435
 151.73981857 124.00573616 147.21840359 137.86758315 101.72999983
 100.06669447 126.86135185 112.08369421 128.28383337 127.89751457
 106.58212393 145.49576226 121.3327751  146.35815009 116.03522109
 140.8418746  161.64854456 104.88015556 138.68918179 155.8465356
  94.66705665 104.68077743 109.78539634 129.51965409 142.36501306
 135.50323355 121.08827388 139.80097066 139.21198381 133.15162814
 155.55240313 138.43534816 115.06004444 116.87704182 151.69963311
 166.12527419 125.37246482 136.59678857 161.44382399 129.02436288
 119.47775678 104.95326463 115.92874139 150.54082055 141.10629993
 127.09644069 107.5553452  146.64747644 163.08537446 151.56149006
 173.91757251  96.28425566 126.8156937  135.08407394 104.06125744
 102.06440947 155.58515955 146.37034554 137.81552628 126.78069715
 117.64604453 141.38139199 156.39479