## Chapter 06
# Confidence Intervals

Adopted from ["Elementary Statistics - Picturing the World" 6th edition](https://www.amazon.com/Elementary-Statistics-Picturing-World-6th/dp/0321911210/)

In [1]:
from notebook.services.config import ConfigManager
cm = ConfigManager()
cm.update('livereveal', {
        'scroll': True,
        'width': "100%",
        'height': "100%",
})

{'width': '100%', 'height': '100%', 'scroll': True}


## 6.1 <br/>Confidence Intervals for the Mean ($\sigma$ Known)

### Estimating Population Parameters

- In this chapter, we will learn an important technique of statistical inference—to use sample statistics to estimate the value of an unknown population parameter.
- In this section and the next, we will learn how to use sample statistics to make an estimate of the population parameter $\mu$ when the population standard deviation $\sigma$ is known (this section) or when $\sigma$ is unknown. 
- To make such an inference, begin by finding a **point estimate**.


### Point Estimate

- A **point estimate** is a single value estimate for a population parameter. 
- The most unbiased point estimate of the population mean $\mu$ is the sample mean $\bar{x}$.
- The validity of an estimation method is increased when you use a sample statistic that is unbiased and has low variability. 
- A statistic is unbiased if it does not overestimate or underestimate the population parameter.

### Finding a Point Estimate [example 1]

- An economics researcher is collecting data about grocery store employees in a county. 
- The data listed below represents a random sample of the number of hours worked by $40$ employees from several grocery stores in the county. 
- Find a point estimate of the population mean $\mu$.

![](./image/6_1_ex_1_point_estimate.png)

### Finding a Point Estimate [solution]


- The sample mean of the data is:

$\bar{x} = \frac{\sum{x}}{n} = \frac{1184}{40} = 29.6$

- The point estimate for the mean number of hours worked by grocery store employees in this county is $29.6$ hours.
- The probability that the population mean is exactly $29.6$ is virtually zero. 
- Instead of estimating $\mu$ to be exactly $29.6$ using a point estimate, we can estimate that $\mu$ lies in an interval. 
- This is called making an **interval estimate**.


### Interval Estimate

- An **interval estimate** is an interval, or range of values, used to estimate a population parameter.
- To form an interval estimate, use the point estimate as the center of the interval, and then add and subtract a margin of error. 
- For instance, if the margin of error is $2.1$, then an interval estimate would be given by 
 - $29.6 \pm 2.1$ or 
 - $27.5 < \mu < 31.7$. 
- The point estimate and interval estimate are shown in the figure.

![](./image/6_1_interval_estimate.png)

- Before finding a margin of error for an interval estimate, we should first determine how confident we need to be that your interval estimate contains the population mean $\mu$.

### Level of Confidence

- The **level of confidence** $c$ is the probability that the interval estimate contains the population parameter, assuming that the estimation process is repeated a large number of times.
- We know from the Central Limit Theorem that when $n > 30$, the sampling distribution of sample means is a normal distribution. 
- The level of confidence $c$ is the area under the standard normal curve between the critical values, $-z_{c}$ and $z_{c}$.
- **Critical values** are values that separate sample statistics that are probable from sample statistics that are improbable, or unusual.
- We can see from the figure shown below that $c$ is the percent of the area under the normal curve between $-z_{c}$ and $z_{c}$. 

![](./image/6_1_level_of_confidence_graph.png)

- The area remaining is $1 - c$, so the area in each tail is $\frac{1}{2}(1 - c)$. 
- For instance, if $c = 90\%$, then $5\%$ of the area lies to the left of $-z_{c} -1.645$ and $5\%$ lies to the right of $z_{c} = 1.645$, as shown in the table.
