## The Data

In [111]:
head(mtcars, 2)

Unnamed: 0,mpg,cyl,disp,hp,drat,wt,qsec,vs,am,gear,carb
Mazda RX4,21,6,160,110,3.9,2.62,16.46,0,1,4,4
Mazda RX4 Wag,21,6,160,110,3.9,2.875,17.02,0,1,4,4


## Before Centering

In [113]:
y = mtcars$mpg
x1 = mtcars$wt
x2 = mtcars$hp

In [114]:
lm(y ~ x1 + x2)


Call:
lm(formula = y ~ x1 + x2)

Coefficients:
(Intercept)           x1           x2  
   37.22727     -3.87783     -0.03177  


## Regression Through the Origin on Centering Data

In [115]:
yc = y - mean(y)
x1c = x1 - mean(x1)
x2c = x2 - mean(x2)
lm(yc ~ x1c + x2c - 1)


Call:
lm(formula = yc ~ x1c + x2c - 1)

Coefficients:
     x1c       x2c  
-3.87783  -0.03177  


## Some Useful Equalities
Let $\mathbf{1}_{n}$ be all-ones *nx1* vector $\begin{pmatrix}
  1 \\
  \vdots \\
  1
\end{pmatrix}$ and $\mathbf H$ be 1 $n_{th}$ of all-ones *nxn* matrix $\begin{pmatrix}
  1/n & 1/n & \dots & 1/n \\
  \vdots & \vdots & \ddots & \vdots \\
  1/n & 1/n & \dots & 1/n 
\end{pmatrix}$ then:

\begin{align}
\mathbf H = \mathbf{1}_{n} (\mathbf{1}_{n}^{\mathit t} \mathbf{1}_{n})^{-1} \mathbf{1}_{n}^{\mathit t}
\end{align}

Also the mean centering matrix $\mathbf{yc} = \mathbf y - \mathbf{\bar{y}}$ can be calculated as:

\begin{align}
\mathbf{yc} = (\mathbf I - \mathbf H) \mathbf y
\end{align}

### Examples

In [77]:
n = 5
y = matrix(round(runif(n, 1, 100)), ncol=1)
t(y)

0,1,2,3,4
27,82,77,37,4


In [78]:
# The identity matrix
I = diag(rep(1, n))

In [79]:
H = matrix(1, n, n)/n
H

0,1,2,3,4
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2


In [80]:
Jn = matrix(1, n, 1)
t(Jn)

0,1,2,3,4
1,1,1,1,1


In [81]:
Jn %*% solve(t(Jn) %*% Jn)  %*% t(Jn)

0,1,2,3,4
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2
0.2,0.2,0.2,0.2,0.2


In [82]:
H == Jn %*% solve(t(Jn) %*% Jn)  %*% t(Jn)

0,1,2,3,4
True,True,True,True,True
True,True,True,True,True
True,True,True,True,True
True,True,True,True,True
True,True,True,True,True


In [84]:
yc = (I - H) %*% y
t(yc)

0,1,2,3,4
-18.4,36.6,31.6,-8.4,-41.4


In [86]:
yc1 = y - mean(y)
t(yc1)

0,1,2,3,4
-18.4,36.6,31.6,-8.4,-41.4


In [99]:
t(round(yc-yc1, 10))

0,1,2,3,4
0,0,0,0,0
