# Chapter 4 - Linear Model Interpretation

Joshua French

To open this information in an interactive Colab notebook, click the Open in Colab graphic below.

<a href="https://colab.research.google.com/github/jfrench/LinearRegression/blob/master/notebooks/04-linear-model-interpretation-notebook.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg"> </a>

------------------------------------------------------------------------

In [1]:
if(!require(palmerpenguins, quietly = TRUE)) {
  install.packages("palmerpenguins", repos = "https://cran.rstudio.com/")
  library(palmerpenguins)
}

In [2]:
if(!require(car, quietly = TRUE)) {
  install.packages("car", repos = "https://cran.rstudio.com/")
  library(car)
}

In [3]:
if(!require(effects, quietly = TRUE)) {
  install.packages("effects", repos = "https://cran.rstudio.com/")
  library(effects)
}

# Interpretation of coefficients

The standard approach to interpreting the coefficients of a fitted linear model is to consider the expected change in the response in relation to changes in the regressors in the model.

Consider the typical multiple linear regression model of the response

$$
Y=\beta_0+\beta_1 X_1 +\ldots + \beta_{p-1}X_{p-1}+\epsilon.
$$

-   We treat the values of our regressor variables as being fixed, known values
-   The error term is treated as a random variable
-   Consequently, the response variable is also a random variable.

**Random Error Assumption**

------------------------------------------------------------------------

We assume that the errors all have mean 0, conditional on the values of the regressor variables.

$$
E(\epsilon \mid X_1, X_2, \ldots, X_{p-1})=0.
$$ Or, using alternative notation:

$$
E(\epsilon \mid \mathbb{X})=0.
$$

**Expected Value of Response**

------------------------------------------------------------------------

Using the assumption of mean zero errors, we have:

</br> </br> </br> </br> </br> </br> </br>

Note: All the coefficient $\beta_i$ terms are fixed, non-random.

**Interpretation for simple linear regression**

------------------------------------------------------------------------

Suppose we have a simple linear regression model, so that $$
E(Y\mid X)=\beta_0 + \beta_1 X.
$$ The interpretations of the coefficients are:

-   $\beta_0$ is the expected response when the regressor is 0, i.e., $\beta_0=E(Y\mid X=0)$.
-   $\beta_1$ is the expected change in the response when the regressor increases 1 unit, i.e., $\beta_1=E(Y\mid X=x^*+1)-E(Y\mid X=x^*)$, where $x^*$ is a fixed, real number.

**Interpretation of Intercept**

------------------------------------------------------------------------

The intercept term $\beta_0$ is the expected value of the response when $X= 0$.

</br> </br> </br>

**Interpretation of Slope**

------------------------------------------------------------------------

Similarly, for $\beta_1$, we notice that

</br> </br> </br> </br> </br> </br> </br>

Thus, $\beta_1$ literally equals the change in the expected response when the regressor increases by 1 unit.

It may not make sense to say “we increase $X$ by 1 unit” or “when $X$ increases by 1 unit”. For example??

To illustrate the interpretations given above, we interpret the simple linear regression model fit to the `penguins` data. The fitted simple linear regression model of `body_mass_g` regressed on `body_mass_g` is $$
\hat{E}(\mathtt{bill\_length\_mm}\mid \mathtt{body\_mass\_g})=26.9+0.004 \,\mathtt{body\_mass\_g}.
$$

Some basic interpretations of the coefficients are:

</br> </br> </br> </br> </br> </br> </br>

**Rescaling Predictors to aid Interpretation**

------------------------------------------------------------------------

-   A weight difference of 1 gram is negligible in the context of penguin weights.
-   A bill length change of 0.004 mm is unlikely to be noticed.

In the code below, we divide the `body_mass_g` variable by 1000 to convert the variable from grams to kilograms.

We then fit the model regressing `bill_length_mm` on `body_mass_kg` and extract the estimated coefficients.

In [4]:
# load penguins data
data(penguins, package = "palmerpenguins")
# transform body mass variable from g to kg
penguins <- penguins |> transform(body_mass_kg = body_mass_g/1000)
# fit model with body_mass_kg
slmod_scaled <- lm(bill_length_mm ~ body_mass_kg, data = penguins)
# extract coefficients
coefficients(slmod_scaled)

**Question**

-   How do we reinterpret the model in the context of kilograms?

Dividing `body_mass_g` by 1000 resulted in the estimated coefficient changing by a factor of 1000.

More generally, if $\hat{\beta}_j$ is the estimated coefficient for $X_j$, then the regressor $(X_j + a)/c$ will have an estimated coefficient of $c\hat{\beta}_j$, where $a$ and $c$ are fixed, real numbers and assuming nothing else in the fitted model changes.

**Interpretation for first-order multiple linear regression models**

------------------------------------------------------------------------

Suppose we have a multiple linear regression model with $p-1$ *numeric* regressors, so that

$$
E(Y\mid X_1,\ldots,X_{p-1})=\beta_0 + \beta_1 X_1 + \cdots + \beta_{p-1} X_{p-1}.
$$

Relying on the definition of $\mathbb{X}$, we denote the set of regressors without $X_j$ as $\mathbb{X}_{-j} = \mathbb{X}\setminus\{X_j\}$.

The interpretations of the coefficients from the model in Equation are:

-   $\beta_0$ is the expected response when all regressors are 0, i.e., $\beta_0=E(Y\mid X_1=0,\ldots,X_{p-1}=0)$.
-   $\beta_j$, $j = 1,\ldots,p-1$, represents the expected change in the response when regressor $j$ increases 1 unit and the other regressors stay the same, i.e., $\beta_j=E(Y\mid \mathbb{X}_{-j} = \mathbf{x}^*_{-j}, X_{j+1} = x_{j}^*+1)-E(Y\mid \mathbb{X}_{-j} = \mathbf{x}^*_{-j}, X_{j+1} = x_{j}^*)$ where $\mathbf{x}_{-j}^*=[x^*_1,\ldots,x_{j-1}^*,x_{j+1}^*,\ldots,x_{p-1}^*]\in \mathbb{R}^{p-2}$ is a vector with $p-2$ fixed values (the number of regressors excluding $X_j$) and $x_j^*$ is a fixed real number. The non-intercept coefficients of a multiple linear regression model are known as *partial slopes*.

Regarding the interpretation of $\beta_0$, from the regression model:

$$
\begin{aligned}
E(Y\mid X_1=0,\ldots,X_{p-1}=0) &= \beta_0 + \beta_1 \cdot 0 + \cdots + \beta_{p-1} \cdot 0\\
&= \beta_0.
\end{aligned}
$$

**Question:**

-   Does the interpretation of the intercept always make sense?
-   What if $X_1$ was heart rate?

**Issues Interpreting Polynomial Models**

------------------------------------------------------------------------

In multiple regression models, a single predictor can be used more than once in the model.

E.g., in the 2nd-degree polynomial regression model:

$$E(Y\mid X) = \beta_0 + \beta_1 X + \beta_2 X^2,$$

$X$ is used in both the second and third terms.

**Questions**

-   How does this affect the interpretation of $\beta_1$?
-   Is is possible to increase $X$ while keeping $X^2$ fixed?

The standard interpretation we discussed is applicable to first-order linear regression models.

*First-order linear regression model:* no regressor is a function of any other regressor.