# Chapter 3 - Conceptual exercises

##### Q: Describe the null hypotheses to which the p-values given in Table 3.4 correspond. Explain what conclusions you can draw based on these p-values. Your explanation should be phrased in terms of sales, TV, radio, and newspaper, rather than in terms of the coefficients of the linear model.

In [2]:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

import warnings
warnings.filterwarnings('ignore')

%matplotlib inline

![table34](./figures/table34.png)

A: Increasing advertising in TV and radio will have a positive impact on sales. Newspaper doesn't seem to be such a safe bet in the context of the other two.

##### Q: Carefully explain the differences between the KNN classifier and KNN regression methods.

A: both methods take K nearest neighbours, given some function of distance. Regression methods average the interesting dependent variable to get an estimation. Classifiers use majority votes to get the class estimation.

##### Q: Suppose we have a data set with five predictors, X1 = GPA, X2 = IQ, X3 = Gender (1 for Female and 0 for Male), X4 = Interaction between GPA and IQ, and X5 = Interaction between GPA and Gender. The response is starting salary after graduation (in thousands of dollars).

##### Suppose we use least squares to fit the model, and get: 

```
βˆ0 = 50, βˆ1 = 20 , βˆ2 = 0.07 , βˆ3 = 35 , βˆ4 = 0.01 , βˆ5 = −10.
```

##### (a) Which answer is correct, and why?

##### i. For a fixed valueof IQ and GPA, males earn more on average than females.
##### ii. For a fixed value of IQ and GPA, females earn more on average than males.
##### iii. For a fixed valueof IQ and GPA, males earn more on average than females provided that the GPA is high enough.
##### iv. For a fixed value of IQ and GPA, females earn more on average than males provided that the GPA is high enough.

##### (b) Predict the salary of a female with IQ of 110 and a GPA of 4.0.

##### (c) True or false: Since the coefficient for the GPA/IQ interaction term is very small, there is very little evidence of an interaction effect. Justify your answer.

__(a)__

In [5]:
def f_hat(gpa, iq, gender):
    return 50 + (20 * gpa) + (0.07 * iq) + (35 * gender) + 0.01 * (gpa * iq) + -10 * (gpa * gender)

In [7]:
def _removing_all_non_gender_terms(gender, gpa):
    return (35 * gender) - 10 * (gpa * gender)

In [8]:
# given that gender can be either 1 (female) or 0 (male)

def _female(gpa):
    return 35 - 10 * gpa

def _male(gpa):
    return 0

When `_female` == `_male`, that is, when __gpa is 3.5__, gender makes no difference on salary. For lower values of gpa, there's an increase in salary for females. For higher values of gpa there's a decrease in salary for females.

The right answer is __iii: For a fixed valueof IQ and GPA, males earn more on average than females provided that the GPA is high enough__

__(b)__

In [14]:
female = 1
salary = f_hat(gpa=4, iq=110, gender=female)
print('USD {:.0f}'.format(salary * 1000))

USD 137100


__(c)__

False, the evidence is given by the p-value, not the coefficient.

##### Q: I collect a set of data (n = 100 observations) containing a single predictor and a quantitative response. I then fit a linear regression model to the data, as well as a separate cubic regression, i.e. Y = β0 +β1X +β2X2 +β3X3 +ε

##### (a) Suppose that the true relationship between X and Y is linear, i.e. Y = β0 + β1X + ε. Consider the training residual sum of squares (RSS) for the linear regression, and also the training RSS for the cubic regression. Would we expect one to be lower than the other, would we expect them to be the same, or is there not enough information to tell? Justify your answer.

##### (b) Answer (a) using test rather than training RSS.
##### (c) Suppose that the true relationship between X and Y is not linear, but we don’t know how far it is from linear. Consider the training RSS for the linear regression, and also the training RSS for the cubic regression. Would we expect one to be lower than the other, would we expect them to be the same, or is there not enough information to tell? Justify your answer.
##### (d) Answer (c) using test rather than training RSS.

__(a)__

Cubic regression should have a lower RSS. The flexibility of the model may also estimate `ε`, overfitting to the training set.

__(b)__

The cubic regression model may have a higher RSS because overfitting.

__(c)__

The more flexible model would have a lower training set error.

__(d)__

There is not enough information to tell. It depends on `Var(ε)` and the non-linearity of real `f`

__Q5 Q6 Q7: TODO__