#### Recall: In a comparative experiment, treatments are assigned to experimental units, and responses are observed.

#### A major analytical limitation is experimental error: 
   ##### the degree to which the responses differ among different experimental units that are treated in the same way.

   ##### Blocking is a method to increase precision in comparing treatments, by containing and analytically eliminating some of the contributions to experimental error. (Marden, 1995).
   
#### In general, blocks are groups of experimental units, designated by the experimenter, that provide a structure for the way treatments are randomized (Marden, 1995).

#### In a randomized complete block design (RCBD) with T treatments,
    􏲆1. each block has exactly T experimental units
    􏲆2. the blocks are disjoint
    􏲆3. every treatment is assigned to exactly one experimental unit in each block
    􏲆4. treatments are completely randomized within each block, and independently in different blocks


## Example 1
    An experiment was performed to compare methods for the production of penicillin. A total of T = 4 variants of the basic manufacturing process (treatments A, B, C, and D) were studied.
    It was known that an important raw material, corn steep liquor, was quite variable. Blends of corn steep liquor sufficient for 4 runs could be made, allowing the experiments to run all T = 4 treatments within each block (blend of corn steep liquor). A total of 5 such blocks were used, for a total of 5 × 4 = 20 runs. The experimental units were the successive runs. As dictated by the design, the order in which the treatments were run was randomized within each block.

In [1]:
install.packages("faraway", repos='http://cran.us.r-project.org')


The downloaded source packages are in
	‘/private/var/folders/44/z41l8sf111x6k2bjrjbdjhl80000gn/T/RtmpVITuLP/downloaded_packages’


Updating HTML index of packages in '.Library'
Making 'packages.html' ... done


In [2]:
library(faraway)
data(penicillin)

In [3]:
penicillin

Unnamed: 0,treat,blend,yield
1,A,Blend1,89
2,B,Blend1,88
3,C,Blend1,97
4,D,Blend1,94
5,A,Blend2,84
6,B,Blend2,77
7,C,Blend2,92
8,D,Blend2,79
9,A,Blend3,81
10,B,Blend3,87


#### You can verify that variables are factors in R

In [4]:
class(penicillin$treat)


The usual model equation for data from an experiment with a randomized complete block design has a term for the treatment factor and a term for the blocking factor:

$$
Y_{ij} = \mu + \tau_i + \beta_j + \epsilon_{ij}
$$

Blocks and treatments are assumed independent, and therefore there is no interaction term.

In [5]:
penimod <- lm(yield ~ blend + treat, data=penicillin)

In [6]:
anova(penimod)

Unnamed: 0,Df,Sum Sq,Mean Sq,F value,Pr(>F)
blend,4,264,66.0,3.504425,0.04074617
treat,3,70,23.33333,1.238938,0.3386581
Residuals,12,226,18.83333,,


What if we treated this model as a factorial design, i.e. a CRD?

In [7]:
crd_mod <- lm(yield ~ blend*treat, data=penicillin)

In [8]:
anova(crd_mod)

In anova.lm(crd_mod): ANOVA F-tests on an essentially perfect fit are unreliable

Unnamed: 0,Df,Sum Sq,Mean Sq,F value,Pr(>F)
blend,4,264,66.0,,
treat,3,70,23.33333,,
blend:treat,12,226,18.83333,,
Residuals,0,0,,,


#### Q1. Why is there a perfect fit?

#### Relative Efficiency is a general concept in estimation theory, in which we try to evaluate two statistical procedures based on overall amount of data either procedure needs in order to have a particular result. This is typically represented as a ratio of two sample sizes

$$
\frac{n_1}{n_2}
$$

Where the ratio tells you how much of a sample size of procedure 1 you would need to get the same result from procedure 2. You can vaguely attach this meaning to the definition in Longnecker of Relative Efficiency

$$
RE(RCBD, CR) = \frac{(b-1)MSB + b(t-1)MSE}{(bt-1)MSE}
$$

#### Q2. Use the ANOVA table above to calculate the Relative Efficiency of the randomized complete block design to the completely randomized design.

#### Two devices have been proposed to reduce the air pollution resulting from the emission of carbon monoxide (CO) from the exhaust of automobiles. To evaluate the effectiveness of the devices, 48 cars of varying age and mechanical condition were selected for the study. The amount of carbon monoxide in the exhaust (in ppm) was measured prior to installing the device on each of the cars. Because there were considerable differences in the mechanical condition of the cars, the cars were paired based on the level of CO in their exhaust. The two devices were then ran- domly assigned to the cars within each pair of cars. Five months after installation, the amount of CO in the exhaust was again measured on each of the cars. The reduction in carbon monoxide from the initial measurements are given here.

In [12]:
data <- read.csv('../ASCII-comma/CH15/ex15-4.TXT', header = T)
colnames(data)<-c("pair", "before", "after")

In [13]:
library(reshape2)

In [15]:
data<-melt(data, id.vars ="pair")
colnames(data)<- c("pair", "treat", "CO")

In [16]:
head(data)

Unnamed: 0,pair,treat,CO
1,1,before,2.37
2,2,before,3.17
3,3,before,3.07
4,4,before,2.73
5,5,before,3.49
6,6,before,4.35


In [17]:
tail(data)

Unnamed: 0,pair,treat,CO
43,19,after,3.11
44,20,after,1.9
45,21,after,2.5
46,22,after,3.18
47,23,after,3.24
48,24,after,2.16


#### a. Does the device appear to reduce the average amount of CO in the exhaust of the cars? Use $\alpha=0.05$.
#### b. Compute the relative efficiency of the randomized complete block design (blocking on car) relative to a completely randomized design in which the 48 cars would have been randomly assigned to the two devices without regard to any pairing. Interpret the value of the relative efficiency.
#### c. Based on the relative efficiency computed in part (b), would you recommend pairing the cars in future studies?
#### d. In Chapter 6, we introduced the paired t-test. Analyze the above data using this test statistic.
#### e. Show that the paired t-test is equivalent to the F-test from the randomized block AOV by showing that your computed values for the t-test and F-test satisfy $t^2=F$

http://www.stat.ucla.edu/history/latin_square.gif

## Latin Squares

Consider a study in which four cars and four drivers are employed to study the differences between four gasoline additives on automobile emissions. Even if the cars are identical models, there are likely slight differences in their performance. Also, even if drivers are given a strict protocol for how to drive, there are likely to be driver-to-driver differences.
The Latin square arrangement shows how these car and driver effects can be separated from the gasoline additive effects ...

For a Latin square design, the Latin square should be chosen at random. Methods for doing this are described in other textbooks (and implemented in some software).

In [18]:
library(faraway)
data(abrasion)

matrix(abrasion$material,4,4)

0,1,2,3
C,A,D,B
D,B,C,A
B,D,A,C
A,C,B,D


$$
Y_{ijk} = \mu + \tau_i + \beta_j + \gamma_k + \epsilon_{ijk}
$$


In [19]:
linmod <- lm(wear ~ position + run + material, data=abrasion)
anova(linmod)

Unnamed: 0,Df,Sum Sq,Mean Sq,F value,Pr(>F)
position,3,1468.5,489.5,7.991837,0.01616848
run,3,986.5,328.8333,5.368707,0.03901297
material,3,4621.5,1540.5,25.15102,0.0008498192
Residuals,6,367.5,61.25,,


In [20]:
TukeyHSD(aov(wear ~ position + run + material, data=abrasion))$material

Unnamed: 0,diff,lwr,upr,p adj
B-A,-45.75,-64.90706,-26.59294,0.0007030032
C-A,-24.0,-43.15706199,-4.84293801,0.01903555
D-A,-35.25,-54.407061991,-16.092938009,0.002866207
C-B,21.75,2.59293801,40.90706199,0.02947738
D-B,10.5,-8.657062,29.657062,0.3206306
D-C,-11.25,-30.407062,7.907062,0.2742765
