# Model Selection - Centering
$$Y=\beta_0 + \beta_1X  + \epsilon$$

In [17]:
set.seed(1) #set random seed so that the simulation is reproducible

n <- 100 #number of observations per sample

b0 <- 2
b1 <- 3

x <- 2 + rnorm(n) # specify the predictors with mean of 2

e <- 7*rnorm(n) #draw a samples of errors
y <- b0 + b1*x + e #generate the samples

lm.fit=lm(y~x) #fit the linear model
summary(lm.fit) #display


Call:
lm(formula = y ~ x)

Residuals:
     Min       1Q   Median       3Q      Max 
-13.1379  -4.2964  -0.9765   3.7758  16.4232 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   1.7510     1.7272   1.014 0.313183    
x             2.9926     0.7541   3.968 0.000138 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 6.739 on 98 degrees of freedom
Multiple R-squared:  0.1385,	Adjusted R-squared:  0.1297 
F-statistic: 15.75 on 1 and 98 DF,  p-value: 0.0001379


We now consider the "centered" regression:

$$Y-\beta_1\bar X=\beta_0 + \beta_1 X- \beta_1\bar X  + \epsilon$$

Note that this is equivalent to:
\begin{align}
Y&=\beta_0 +\beta_1\bar X + \beta_1 (X-\bar X)  + \epsilon\\
Y&=\beta_c + \beta_1 X_c  + \epsilon
\end{align}

where $\beta_c = \beta_0 +\beta_1\bar X$.

Both regressions yield **the same** estimate of $\beta_1$, but the interpretation of the estimate changes.

In [13]:
x.centered <- x-mean(x)

In [14]:
lm.fit=lm(y~x.centered) #fit the linear model
summary(lm.fit) #display


Call:
lm(formula = y ~ x.centered)

Residuals:
     Min       1Q   Median       3Q      Max 
-13.1379  -4.2964  -0.9765   3.7758  16.4232 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   8.0620     0.6739  11.963  < 2e-16 ***
x.centered    2.9926     0.7541   3.968 0.000138 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 6.739 on 98 degrees of freedom
Multiple R-squared:  0.1385,	Adjusted R-squared:  0.1297 
F-statistic: 15.75 on 1 and 98 DF,  p-value: 0.0001379
