# Tutorial 9: Confidence Intervals for One Proportion #

## Objectives: ##
To become familiar with building and interpreting confidence intervals for one proportion and practice using the formula provided below.

## Instructions: ##
* Do NOT round any of the values unless your are explicitly told to do so in the question.
* You can compute the required values using R as your calculator.

## Formulae: ##
A confidence interval is calculated by finding
$$(point\  \  \  estimate) \pm z^*\times SE$$

Thus, the confidence interval for one proportion is calculated by finding
$$ \hat p \pm z^* \times\sqrt{\frac{\hat p \times (1-\hat p)}{n}}$$

## Tools: ##
If you would find it useful to have a graph to look at for one of these questions you can use the normalplot (defined below) just remember to run the code block for the normalplot. 
* Recall that to draw a normal curve with mean (m) and standard deviation (sd), that is shaded from min to max enter the command:
  * `normalplot(m, sd, c(min, max))`
* NOTE: You are not required to graph for any of this week's questions.

In [6]:
normalplot<-function(m,sd,region=0){
  x<-seq(m-(3.5)*sd,m+(3.5)*sd,length=1000)
  y<-dnorm(x,m,sd)
  plot(x,y,type="l",xlab="",ylab="", bty="n", yaxt="n")
  h <- dnorm(m,m,sd)
  z<-x[x>region[1]]
  z<-z[z<region[2]]
  polygon(c(region[1],z,region[2]),
          c(0,dnorm(z,m,sd),0),col="gray")
  abline(v=m)
  abline(h=0)}

## Data Information: ##
The data for this tutorial was inspired by two sources:
* "Canadian postsecondary enrolments and graduates, 2016/2017." Statistics Canada. 2018-11-28. https://www150.statcan.gc.ca/n1/daily-quotidien/181128/dq181128c-eng.htm.
* "The evolution of language populations in Canada, by mother tongue, from 1901 to 2016." Statistics Canada. 2018-02-21. https://www150.statcan.gc.ca/n1/pub/11-630-x/11-630-x2018001-eng.htm.

## Question 1. Construct a Confidence Interval ##

Suppose a sample of students enrolled in Canadian universities yielded the following data	

<table>				
<tr><th>	Field of study	</th><th>	2016/2017	</th></tr>
<tr><td>	Personal improvement and leisure	</td><td>	3	</td></tr>
<tr><td>	Education	</td><td>	77	</td></tr>
<tr><td>	Visual and performing arts, and communications technologies	</td><td>	41	</td></tr>
<tr><td>	Humanities	</td><td>	159	</td></tr>
<tr><td>	Social and behavioural sciences, and law	</td><td>	225	</td></tr>
<tr><td>	Business, management and public administration	</td><td>	262	</td></tr>
<tr><td>	Physical and life sciences, and technologies	</td><td>	136	</td></tr>
<tr><td>	Mathematics, computer and information sciences	</td><td>	55	</td></tr>
<tr><td>	Architecture, engineering and related technologies	</td><td>	136	</td></tr>
<tr><td>	Agriculture, natural resources and conservation	</td><td>	23	</td></tr>
<tr><td>	Health and related fields	</td><td>	170	</td></tr>
<tr><td>	Personal, protective and transportation services	</td><td>	7	</td></tr>
<tr><td>	Other fields of study	</td><td>	42	</td></tr>
<tr><th>	Total	</th><th>	1336	</th></tr>
</table>				


* a. What proportion of all the students in this survey reported their field of study to be mathematics, computer or information sciences?
* b. To calculate a 95% confidence interval, $z^* = 1.96$. Find the 95% confidence interval for the proportion of students studying mathematics, computer and information science.
* c. Write a sentence, including your 95% confidence interval, summarising the results of the study.
* d. Explain why the 93% confidence interval will be narrower than the 95% confidence interval.
* e. What value do you use for $z^*$ to create a 93% confidence interval (round to 2 decimal places). [You can use qnorm to find this value]
* f. Find the 93% confidence interval for the proportion of students studying mathematics, computer and information science.
* g. Write a sentence, including your 93% confidence interval, summarising the results of the study.

### Answer 1.a. ##

In [None]:
Calculate the proportion.

Answer with a sentence.

### Answer 1.b. ##

In [None]:
Calculate the lower limit.

In [None]:
Calculate the upper limit.

### Answer 1.c. ##

Answer with a sentence.

### Answer 1.d. ##

Answer with a sentence.

### Answer 1.e. ##

In [None]:
Calculate z*

Answer with a sentence.

### Answer 1.f. ##

In [None]:
Calculate the lower limit.

In [None]:
Calculate the upper limit.

### Answer 1.g. ##

Answer with a sentence.

## Question 2. Thinking critically about our results

<blockquote>Although their enumeration was certainly not complete, the 1901 Census counted close to 77,000 people whose mother tongue is an Aboriginal language, representing 1.4% of the population.<br>
<cite>"The evolution of language populations in Canada, by mother tongue, from 1901 to 2016." Statistics Canada. 2018-02-21. https://www150.statcan.gc.ca/n1/pub/11-630-x/11-630-x2018001-eng.htm.</cite></blockquote>

Suppose that in 2015 a sample of 5,000 Canadians found that 0.6% identified their mother tongue as an Aboriginal language.

* a. Create a 95% confidence interval for the true proportion of Canadians whose mother tongue was an Aboriginal language in 1901.
* b. Why should we be cautious/skeptical of the interval we just computed?
* c. Create a 95% confidence interval for the true proportion of Canadians whose mother tongue was an Aboriginal language in 2015.
* d. What do the two confidence intervals tell you about the change in the proportion of Canadians whose mother tongue was an aboriginal language? What information would make you more confident in your assessment?

### Answer 2.a. ##

In [None]:
Calculate the lower limit.

In [None]:
Calculate the upper limit.

Interpret the confidence interval. Answer with a sentence.

### Answer 2.b. ##

Answer with a sentence.

### Answer 2.c. ##

In [None]:
Calculate the lower limit.

In [None]:
Calculate the upper limit.

Interpret the confidence interval. Answer with a sentence.

### Answer 2.d. ##

Answer with a sentence.

---
---
#### This tutorial is released under a Creative Commons Attribution-ShareAlike 3.0 Unported.

This tutorial has been adapted from a lab that  was adapted for OpenIntro by Andrew Bray and Mine Çetinkaya-Rundel from a lab written by Mark Hansen of UCLA Statistics.

---
---