# Chapter 7 - Estimation

Solutions for selected exercises.

In [2]:
import numpy as np
import scipy.stats as stats

## Section 7.1 Large Sample Estimation of a Population Mean

- $\mu$$ = Population mean. Assumed unknown in this chapter.
- $\sigma$ = Population standard deviation.
- $s$ = Sample standard deviation.
- $\bar{x}$ = Point estimate of $\mu$
- $E$ = Margin of error
- $[\bar{x} - E, \bar{x} + E]$ = interval estimation of $\mu$ using $\bar{x}$
- 95% confidence interval for $/mu$ means that 95% of all the intervals constructed from sample data by means of the interval formula, contain the unknown parameter $\mu$.
- For 95% confidence interval, $E = 1.96 \frac{\sigma}{\sqrt{n}}$ or alternatively $E = z_{\alpha/2}\frac{s}{ \sqrt{n}}$

### Basic Exercises

**Q1.** A random sample is drawn from a population of known standard deviation 11.3. Construct a 90% confidence interval for the population mean based on the information given (not all of the information given need be used).
1. n = 36, $\bar{x}$ =105.2, s = 11.2
2. n = 100, $\bar{x}$ =105.2, s = 11.2

In [3]:
def confidence_interval(confidence, mean, s, n):
    # print(stats.norm.interval(confidence, loc=mean, scale=s/math.sqrt(n)))
    a = 1 - confidence
    a2 = a/2
    za2 = stats.norm.ppf(1-a2)
    E = za2 * s /np.sqrt(n)
    interval = (mean-E, mean+E)
    return (interval, mean, E)

In [4]:
print(confidence_interval(0.90, 105.2, 11.2, 36))
print(confidence_interval(0.90, 105.2, 11.2, 100))

((102.12960656302393, 108.27039343697608), 105.2, 3.0703934369760812)
((103.35776393781435, 107.04223606218565), 105.2, 1.8422360621856488)


**Q3.** A random sample is drawn from a population of unknown standard deviation. Construct a 99% confidence interval for the population mean based on the information given.
1. n = 49, $\bar{x}$ =17.1, s = 2.1
2. n = 169, $\bar{x}$ =17.1, s = 2.1

In [5]:
print(confidence_interval(0.99, 17.1, 2.1, 49))
print(confidence_interval(0.99, 17.1, 2.1, 169))

((16.32725120893533, 17.872748791064673), 17.1, 0.7727487910646701)
((16.683904497119027, 17.516095502880976), 17.1, 0.41609550288097624)


**Q5.** A random sample of size 144 is drawn from a population whose distribution, mean, and standard deviation are all unknown. The summary statistics are $\bar{x}$ =58.2 and s = 2.6.

1. Construct an 80% confidence interval for the population mean μ.
2. Construct a 90% confidence interval for the population mean μ.
3. Comment on why one interval is longer than the other.

In [6]:
print(confidence_interval(0.80, 58.2, 2.6, 144))
print(confidence_interval(0.90, 58.2, 2.6, 144))

((57.922330494132005, 58.477669505868), 58.2, 0.27766950586799677)
((57.84361504749385, 58.55638495250616), 58.2, 0.3563849525061524)


**Q6.** A random sample of size 256 is drawn from a population whose distribution, mean, and standard deviation are all unknown. The summary statistics are $\bar{x}$=1011 and s = 34.

1. Construct a 90% confidence interval for the population mean μ.
2. Construct a 99% confidence interval for the population mean μ.
3. Comment on why one interval is longer than the other

In [7]:
mu, s, n =  1011, 34, 256
print(confidence_interval(0.80, mu, s, n))
print(confidence_interval(0.99, mu, s, n))

((1008.2767029232177, 1013.7232970767823), 1011, 2.7232970767822757)
((1005.5263627299586, 1016.4736372700414), 1011, 5.473637270041413)


### Applications

**Q7.** A government agency was charged by the legislature with estimating the length of time it takes citizens to fill out various forms. Two hundred randomly selected adults were timed as they filled out a particular form. The times required had mean 12.8 minutes with standard deviation 1.7 minutes. Construct a 90% confidence interval for the mean time taken for all adults to fill out this form.

In [8]:
mu, s, n = 12.8, 1.7, 200
print(confidence_interval(0.9, mu, s, n))

((12.602275183874966, 12.997724816125036), 12.8, 0.1977248161250345)


**Q9.** On every passenger vehicle that it tests an automotive magazine measures, at true speed 55 mph, the difference between the true speed of the vehicle and the speed indicated by the speedometer. For 36 vehicles tested the mean difference was −1.2 mph with standard deviation 0.2 mph. Construct a 90% confidence interval for the mean difference between true speed and indicated speed for all vehicles.

In [9]:
mu, s, n = -1.2, 0.2, 36

print(confidence_interval(0.9, mu, s, n))

((-1.2548284542317156, -1.1451715457682843), -1.2, 0.05482845423171575)


**Q11.** A sample of 250 workers aged 16 and older produced an average length of time with the current employer (“job tenure”) of 4.4 years with standard deviation 3.8 years. Construct a 99.9% confidence interval for the mean job tenure of all workers aged 16 or older.

In [10]:
mu, s, n = 4.4, 3.8, 250
print(confidence_interval(0.999, mu, s, n))

((3.6091775028380324, 5.190822497161968), 4.4, 0.7908224971619681)


**Q13.** A corporation that owns apartment complexes wishes to estimate the average length of time residents remain in the same apartment before moving out. A sample of 150 rental contracts gave a mean length of occupancy of 3.7 years with standard deviation 1.2 years. Construct a 95% confidence interval for the mean length of occupancy of apartments owned by this corporation.

In [11]:
mu, s, n = 3.7, 1.2, 150
print(confidence_interval(0.95, mu, s, n))

((3.5079635329457877, 3.8920364670542127), 3.7, 0.1920364670542124)


**Q15.** In order to estimate the mean amount of damage sustained by vehicles when a deer is struck, an insurance company examined the records of 50 such occurrences, and obtained a sample mean of $2,785 with sample standard deviation $221. Construct a 95% confidence interval for the mean amount of damage in all such accidents.

In [12]:
mu, s, n = 2785, 221, 50
print(confidence_interval(0.95, mu, s, n))

((2723.7430509637443, 2846.2569490362557), 2785, 61.256949036255754)


### Additional Exercises

**Q17.** For all settings a packing machine delivers a precise amount of liquid; the amount dispensed always has standard deviation 0.07 ounce. To calibrate the machine its setting is fixed and it is operated 50 times. The mean amount delivered is 6.02 ounces with sample standard deviation 0.04 ounce. Construct a 99.5% confidence interval for the mean amount delivered at this setting. Hint: Not all the information provided is needed.

In [13]:
mu, s, n = 6.02, 0.07, 50
print(confidence_interval(0.995, mu, s, n))

((5.992211783423382, 6.047788216576617), 6.02, 0.027788216576617522)


**Q19.** The number of trips to a grocery store per week was recorded for a randomly selected collection of households, with the results shown in the table.

2 2 2 1 4 2 3 2 5 4
2 3 5 0 3 2 3 1 4 3
3 2 1 6 2 3 3 2 4 4

Construct a 95% confidence interval for the average number of trips to a grocery store per week of all households.

In [15]:
x = np.array([2,2,3,2,3,2,2,5,1,1,0,6,4,3,2,2,2,3,3,3,3,2,1,2,5,4,4,4,3,4])
mu = np.mean(x)
s = np.std(x)
n = len(x)

print(confidence_interval(0.95, mu, s, n))

((2.2984277262902797, 3.2349056070430535), 2.7666666666666666, 0.468238940376387)


**Q21.** A town council commissioned a random sample of 85 households to estimate the number of four-wheel vehicles per household in the town. The results are shown in the following frequency table.

|x|0|1|2|3|4|5|
|-|-|-|-|-|-|-|
|f|1|16|28|22|12|6|

Construct a 98% confidence interval for the average number of four-wheel vehicles per household in the town.

In [26]:
x = np.array(sum([[0] * 1, [1] * 16, [2] * 28, [3] * 22, [4] * 12, [5] * 6], []))

mu = np.mean(x)
s = np.std(x)
n = len(x)

print(confidence_interval(0.98, mu, s, n))


((2.2424853323816567, 2.8398676087948136), 2.541176470588235, 0.29869113820657833)


## Section 7.2 Small Sample Estimation of a Population Mean