# p-value method 

After formulating the null and alternate hypotheses, the steps to follow in order to make a decision using the p-value method are as follows:

Calculate the value of z-score for the sample mean point on the distribution
Calculate the p-value from the cumulative probability for the given z-score using the z-table

Make a decision on the basis of the p-value (multiply it by 2 for a two-tailed test) with respect to the given value of α (significance value).


To find the correct p-value from the z-score, first find the cumulative probability by simply looking at the z-table, which gives you the area under the curve till that point.

Situation 1: The sample mean is on the right side of the distribution mean (the z-score is positive)

 

Example: z-score for sample point = + 3.02

Cumulative probability of sample point = 0.9987

 

For one-tailed test  →    p = 1 - 0.9987 = 0.0013

For two-tailed test  →    p = 2 (1 - 0.9987) = 2 * 0.0013 = 0.0026

 

Situation 2: The sample mean is on the left side of the distribution mean (the z-score is negative)

 

Example: z-score for sample point = -3.02        

Cumulative probability of sample point = 0.0013

 

For one-tailed test  →    p = 0.0013

For two-tailed test  →    p = 2 * 0.0013 = 0.0026

 

You can download the z-table from the attachment below. It will be useful in the subsequent questions.

## Problem 1

You are working as a data analyst at an auditing firm. A manufacturer claims that the average life of its product is 36 months. An auditor selects a sample of 49 units of the product, and calculates the average life to be 34.5 months. The population standard deviation is 4 months. Test the manufacturer’s claim at 3% significance level using the p-value method.

 

First, formulate the hypotheses for this two-tailed test, which would be:

 

                                   H₀: μ = 36 months and H₁: μ ≠ 36 months

 

Now, you need to follow the three steps to find the p-value and make a decision.

 

Try out the three-step process by answering the following questions.

Step 1: Calculate the value of z-score for the sample mean point on the distribution. Calculate z-score for sample mean (
¯
x
) = 34.5 months.

z = (¯x - μ)/sigma x bar

 You can calculate the z-score for sample mean 34.5 months using the formula: (​​
¯
x
​ - μ) / (σ/​
√
x
​). This gives you (34.5 - 36) / (4/
√
49
) = (-1.5) * 7/4 = -2.62. Notice that, since the sample mean lies on the left side of the hypothesised mean of 36 months, the z-score comes out to be negative.

In [1]:
import numpy as np

ho = "the average lifespan of the manufacturer's product is 36 months."
populationN = 49
standerdDeviationSigma = 4 #population Standerd Deviation
sampleMeanXbar = 34.5
populationMeanMue = 36
significanceLevel = 3/100

In [2]:
sigmax = round(standerdDeviationSigma/np.sqrt(populationN),2)
print("sigma x = ", sigmax)

Z = round((sampleMeanXbar - populationMeanMue) / sigmax, 2)
print("Z value =", Z)

sigma x =  0.57
Z value = -2.63


Step 2: Calculate the p-value from the cumulative probability for the given z-score using the z-table.
https://cdn.upgrad.com/UpGrad/temp/0ed10b23-e5f1-4299-9302-39658c3a4e56/z-table.pdf

Find out the p-value for the z-score of -2.62 (corresponding to the sample mean of 34.5 months). 

Hint: The sample mean is on the left side of the distribution and it is a two-tailed test.

from z table 0.0044 is the Cumulative probability of sample point

For one-tailed test  →    p = 0.0044

For two-tailed test  →    p = 2 * 0.0044 = 0.0088

The value in the z-table corresponding to -2.6 on the vertical axis and 0.02 on the horizontal axis is 0.0044. Since the sample mean is on the left side of the distribution and this is a two-tailed test, the p-value would be 2 * 0.0044 = 0.0088.

In [3]:
CumulativeProbabOfSamplePoint = 0.0044 #Cumulative probability of sample point
p = 2 * CumulativeProbabOfSamplePoint #two-tailed test
print("p =", p)

p = 0.0088


Step 3: Make the decision on the basis of the p-value with respect to the given value of α (significance value).

What would be the result of this hypothesis test?

Here, the p-value comes out to be 2 * 0.0044 = 0.0088. Since the p-value is less than the significance level (0.0088 < 0.03), you reject the null hypothesis that the average lifespan of the manufacturer's product is 36 months.

In [4]:
print("Significance Level =", significanceLevel)

if p >= significanceLevel:
    print("Since the p-value is greater than or equal to the significance level (",
          p, ">=", significanceLevel, 
          ") , Fail to reject the null hypothesis", ho)
else:
    print("Since the p-value is less than the significance level (",
          p, "<", significanceLevel, 
          ") , Reject the null hypothesis that", ho)


Significance Level = 0.03
Since the p-value is less than the significance level ( 0.0088 < 0.03 ) , Reject the null hypothesis that the average lifespan of the manufacturer's product is 36 months.


## Problem 2

Let’s say you work at a pharmaceutical company that manufactures an antipyretic drug in tablet form, with paracetamol as the active ingredient. An antipyretic drug reduces fever. The amount of paracetamol deemed safe by the drug regulatory authorities is 500 mg. If the value of paracetamol is too low, it will make the drug ineffective and become a quality issue for your company. On the other hand, a value that is too high would become a serious regulatory issue.

 

There are 10 identical manufacturing lines in the pharma plant, each of which produces approximately 10,000 tablets per hour.

 

Your task is to take a few samples, measure the amount of paracetamol in them, and test the hypothesis that the manufacturing process is running successfully, i.e. the paracetamol content is within regulation. You have the time and resources to take about 900 sample tablets and measure the paracetamol content in each.

 

Upon sampling 900 tablets, you get an average content of 510 mg with a standard deviation of 110. What does the test suggest, if you set the significance level at 5%? Should you be happy with the manufacturing process or should you ask the production team to alter the process? Is it a regulatory alarm or a quality issue?

 

Solve the following questions in order to find out the answers to the questions stated above.

 

One thing you can notice here is that the standard deviation of the sample of 900 is given as 110, instead of the population’s standard deviation. In such a case, you can use the sample standard deviation (110 in this case) to calculate an approximate population standard deviation.

In [5]:
ho = "the manufacturing process is running successfully, i.e. the paracetamol content is within regulation value of 500 mg."
populationN = 900
standerdDeviationSampleSigma = 110 # standard deviation of the sample of 900 
sampleMeanXbar = 510
populationMeanMue = 500
significanceLevel = 5/100
standerdDeviationSigma = standerdDeviationSampleSigma #population Standerd Deviation

Calculate the z-score for sample mean (
¯
x
) = 510 mg.

You can calculate the z-score for the sample mean 510 mg using the formula: (​
¯
x
​ - μ) / (σ /​
√
N
​). This gives you (510 - 500) / (110 /
√
900
) = (10) / (110 / 30) = 2.73. Notice that, since the sample mean lies on the right side of the hypothesised mean of 500 mg, the z-score comes out to be positive.

In [6]:
sigmax = round(standerdDeviationSigma/np.sqrt(populationN),2)
print("sigma x = ", sigmax)

Z = round((sampleMeanXbar - populationMeanMue) / sigmax, 2)
print("Z value =", Z)

sigma x =  3.67
Z value = 2.72


Find out the p-value for the z-score of 2.73 (corresponding to the sample mean of 510 mg).

The sample mean is on the right side of the distribution mean (the z-score is positive)

Check out the value in the z-table corresponding to 2.7 on the vertical axis and 0.03 on the horizontal axis. This would give you the cumulative area till the sample mean point. As the sample mean is on the right-hand side and this is a two-tail test (because if we want to test if the value of the paracetamol is too low or too high), subtract the cumulative area from 1 and multiply it by 2 to get the p-value r.

Cumulative probability of sample point = 0.9968

For one-tailed test → p = 1 - 0.9968 = 0.0031999999999999806

For two-tailed test → p = 2 * (1 - 0.9968) = 0.006399999999999961

In [7]:
CumulativeProbabOfSamplePoint = 0.9968 #Cumulative probability of sample point
p = 2 * round(1 - CumulativeProbabOfSamplePoint, 4) #two-tailed test
print("P =", p)

P = 0.0064


The value in the z-table corresponding to 2.7 on the vertical axis and 0.03 on the horizontal axis is 0.9968. Since the sample mean is on the right side of the distribution and this is a two-tailed test (because we want to test if the value of the paracetamol is too low or too high), the p-value would be 2 * (1 - 0.9968) = 2 * 0.0032 = 0.0064.

What decision would you make about the manufacturing process from this hypothesis test?

In [8]:
print("Significance Level =", significanceLevel)

if p >= significanceLevel:
    print("Since the p-value is greater than or equal to the significance level (",
          p, ">=", significanceLevel, 
          ") , Fail to reject the null hypothesis", ho)
    print("The manufacturing process is completely fine and need not be changed")
else:
    print("Since the p-value is less than the significance level (",
          p, "<", significanceLevel, 
          ") , Reject the null hypothesis that", ho)
    print("The manufacturing process is not fine and changes need to be made")

Significance Level = 0.05
Since the p-value is less than the significance level ( 0.0064 < 0.05 ) , Reject the null hypothesis that the manufacturing process is running successfully, i.e. the paracetamol content is within regulation value of 500 mg.
The manufacturing process is not fine and changes need to be made


Here, the p-value comes out to be 0.0064. Since the p-value is less than the significance level (0.0064 < 0.05) and smaller p-value gives you greater evidence against the null hypothesis. So you reject the null hypothesis that the average amount of paracetamol in medicines is 500 mg. So, this is a regulatory alarm for the company and the manufacturing process needs to change.

# Types Of Errors 

There are two types of errors that can result during the hypothesis testing process — type-I error and type-II error.

A type I-error represented by α occurs when you reject a true null hypothesis.

A type-II error represented by β occurs when you fail to reject a false null hypothesis.

The power of any hypothesis test is defined by 1 - β. Power of the test or calculation of β is beyond the scope of this course. You can study more about power of a test from this link.

reducing the probability of a type-I error may increase the probability of making a type-II error.

Type-I error occurs when the null hypothesis is rejected when it is in fact correct
Feedback : Type-I error occurs when the null hypothesis is true (i.e. the sample mean lies in the acceptance region) but you incorrectly reject it.

Type II error occurs when the null hypothesis is not rejected when it is in fact incorrect
Feedback : Type-II error occurs when the null hypothesis is not true (i.e. the sample mean lies in the critical region) but you incorrectly fail to reject the null hypothesis.

Suppose the null hypothesis is that a particular new process is as good as or better than the old one. A type-I error is to conclude that:

The old process is better than the new one, when it is not
Feedback : Type-I error means incorrectly rejecting a true null hypothesis. So, type-1 error means that the null hypothesis is true, i.e. the new process is as good as or better than the old one, but you reject it, i.e. you conclude that the old process is better.

## Graded Q 

Suppose you conduct a hypothesis test and observe that the values of the sample mean and sample standard deviation when n = 25 do not lead to the rejection of the null hypothesis. You calculate the p-value as 0.0667. What would happen to the p-value if you observe the same sample mean and sample standard deviation for a larger sample size, say greater than 50?

Decrease

Feedback : With an increase in the sample size, the denominator of the z-score decreases, and thus the absolute value of Z-score increases, which means that the sample mean would move away from the central tendency towards the tails. This means that the p-value would actually decrease. Conceptually, Increasing the sample size will make the distribution of sample means narrower, and chance of sample mean falling in the critical region decreases. So p-value will decrease.

Consider the null hypothesis that a process produces no more than the maximum permissible rate of defective items. In this situation, a type-II error would be:

To conclude that the process does not produce more than the maximum permissible rate of defective items, when it actually does

Feedback : Type-II error means not rejecting the incorrect null hypothesis. So, a type-II error would signify that the null hypothesis is actually incorrect, i.e. the process actually produces more than the maximum permissible rate of defective items, but you fail to reject it, i.e. you think it does not produce more than the maximum permissible rate of defective items.