# Chapter 25 - Paired Samples and Blocks

## Paired Data

* **paired** data: data captured for:
  * subjects compared with themselves before and after treatment,
  * two related (non-independent) measurements
* we typically care about the _difference_ in measurements within a pair; we treat the _difference_ as the data (effectively ignoring the original values)
* we can use a simple one-sample $t$-test against the difference generated
* **paired $t$-test**: just a one-sample $t$-test for the means of these pairwise differences

## Assumptions and Conditions

### Paired Data Assumption

* the data must be paired
* must justify claim that data are paired and not independent

### Independence Assumption

* **independence assumption**: across groups, the _differences_ must be independent; (the detailed, original values are not)
* **randomization condition**

### Normal Population Assumption

* we need to assume that the population of _differences_ follows a Normal model
* **nearly normal condition**: use a historgram or Normal probability plot of the _differences_ to check this

### The Paired $t$-Test

When the conditions are met, we are ready to test whether the mean of paired differences is significantly different from zero.  We test the hypothesis

\begin{equation}
H_0: \mu_d = \Delta_0
\end{equation}

where the $d$'s are the pairwise differences and $\Delta_0$ is almost always 0.

We use the statistic

\begin{equation}
t_{n-1} = \frac
{\bar{d} - \Delta_0}
{SE(\bar{d})}
\end{equation}

where $\bar{d}$ is the mean of the pairwise differences, $n$ is the number of _pairs_, and 

\begin{equation}
SE(\bar{d}) = \frac
{s_d}
{\sqrt{n}}
\end{equation}

$SE(\bar{d})$ is the ordinary standard error for the mean, applied to the differences.

When the conditions are met and the null hypothesis is true, we can model the sampling distribution of this statistic with a Student's $t$-model with $n - 1$degrees of freedom, and we use that model to obtain a P-value.

## Step-by-Step Example: A Paired $t$-Test

* Plan:
  * state what you want to know
  * identify the parameter we wish to estimate
  * identify the variables and check the W's
* Hypotheses:
  * state the null and alternative hypotheses
  * the difference should be in the same units as the original values
* Model
  * think about the assumptions and check the conditions
  * state why you think data are paired
  * identify source of randomization
  * make a picture: plot distribution of _differences_
  * specify the sampling distribution model
  * choose the method
* Mechanics
  * $n$ is the number of pairs
  * find mean and standard deviation of the differences
  * find the standard error and t-score
  * make a picture: sketch a t-model
  * find the P-value
* Conclusion
  * link the P-value to your decision about $H_0$, and state your conclusion in context

## Confidence Intervals for Matched Pairs

### Paired $t$-Interval

When the conditions are met, we are ready to find the confidence interval for the mean of the paried differences.  The confidence interval is

\begin{equation}
\bar{d} \pm t^*_{n-1} \times SE(\bar{d})
\end{equation}

where the standard error of the mean difference is $SE(\bar{d}) = \frac{s_d}{\sqrt{n}}$.

The critical value $t^*$ from the Student's $t$-model depends on the particular confidence level, $C$, that you specify and on the degrees of freedom, $n - 1$, which is based on the number of paris, $n$.

## Step-by-Step Example: A Paired $t$-Interval

* Plan:
  * state what we want to know
  * identify the variables and check the W's
  * identify the parameter you wish to estimate
* Model:
  * think about the assumptions and check the conditions
  * make a picture: histogram or Normal probability plot of differences
  * state the sampling distribution of the model
  * choose your method
* Mechanics
  * $n$ is the number of pairs
  * $\bar{d}$ is the mean difference
  * $s_d$ is the standard deviation of the differences
  * include units with the statistics
  * find the critical value at appropriate confidence level and # of df's
* Conclusion
  * interpret the confidence interval in context

## Effect Size

## Blocking

### *The Sign Test Again

## What Can Go Wrong?

* Don't use a two-sample $t$-test when you have paired data.
* Don't use a paired-$t$ method when the samples aren't paired.
* Don't forget outliers
* Don't look for the difference between the means of paired grups with side-by-side boxplots.

## What Have We Learned?

* [p. 623]