# One-Sample z-test - Lab

## Introduction
In this lab you'll perform a few quick tests to help you better understand how hypothesis testing works.

## Objectives
You will be able to:
* Understand and explain use cases for a one-sample z-test
* Set up null and alternative hypotheses
* Calculate z-statistic using z-tables and CDF functions
* Calculate and interpret p-value for significance of results

## Exercise 1
A fast food chain claims that the mean time to order food at their restaurants is 60 seconds, with a standard deviation of 30 seconds. You decide to take this claim to the test and go to one of the restaurants to observe actual waiting times. You take a sample of 36 customers and find that the mean order time was 75 seconds. Does this finding prove enough evidence to contradict the fast food chain's claim of fast service?

Follow the 5 steps shown in previous lesson and use $\alpha$ = 0.05. 

# State you null and alternative hypotheses

Alternative hypothesis: the population mean at 60 seconds is less than the sample mean at 75 seconds
Null hypothesis: the population mean at 60 seconds is less than or equal to the sample mean

Alpha is .05

In [4]:
import scipy.stats as stats
from math import sqrt

sample_mean = 75
n = 36
sigma = 30
mu = 60# Your solution here

z = (sample_mean - mu)/(sigma/sqrt(n))
print(f'z = {z}')
print(f'cumulative probability = {cum_prob}')
cum_prob = stats.norm.cdf(z)
pvalue = 1 - cum_prob
print(f'pvalue = {pvalue}')

# (p = 0.0013498980316301035, z = 3.0)

z = 3.0
cumulative probability = 0.9986501019683699
pvalue = 0.0013498980316301035


# Interpret the results in terms of the p-value

The alternative hypothesis has been deemed significantly significant due to the fact that compared to alpha, there's a .13% probability that the mean wait time is 60 seconds.

## Exercise 2

25 students complete a preparation program for taking the SAT test.  Here are the SAT scores from the 25 students who completed  program:

``
434 694 457 534 720 400 484 478 610 641 425 636 454 
514 563 370 499 640 501 625 612 471 598 509 531
``

We know that the population average for SAT scores is 500 with a standard deviation of 100.

Are our 25 students’ SAT scores significantly bigger than a population mean? 

*Note that the SAT preparation program claims that it will increase (and not decrease) the SAT score.  So, you can conduct a one-directional test. (alpha = .05).*

In [10]:
SAT_scores = [434, 694, 457, 534, 720, 400, 484, 478, 610, 641, 425, 636, 454, 514, 563, 370, 499, 640, 501, 625, 612, 471, 598, 509, 531]# State your hypotheses 
sample_mean = sum(SAT_scores)/len(SAT_scores)
sample_mean

536.0

I hypothesize that the SAT prep course with a sample mean of 536 improves scores, as compared to the population mean of 500. 
The null hypothesis is that the population mean of 500 is less than the sample mean, so the program would have little to no affect on the studen's score.

In [11]:
sample_mean = 536
n = 25
sigma = 100
mu = 500# Your solution here

z = (sample_mean - mu)/(sigma/sqrt(n))
print(f'z = {z}')
print(f'cumulative probability = {cum_prob}')
cum_prob = stats.norm.cdf(z)
pvalue = 1 - cum_prob
print(f'pvalue = {pvalue}')# Give your solution here 



# p = 0.03593031911292577, z = 1.8

z = 1.8
cumulative probability = 0.9986501019683699
pvalue = 0.03593031911292577


# Interpret the results in terms of the p-value
There is a 3.5% chance that the null hypothesis would turn out to be true, you can reject the null hypothesis. The alternative hypothesis is statistically significant. 

## Summary

In this lesson, you conducted a couple of simple tests comparing sample and population means, in an attempt to reject our null hypotheses. This provides you with a strong foundation to move ahead with more advanced tests and approaches later on.