# Chapter 1 : Why Linear Regression?

Q-1: Given ($\theta_0$,$\theta_1$)= (3,2), $y_{true}$=200, what would be $y - y_{true}$ for x=100?

A) 3 [Correct]

B) -3

C) 102

D) -102

**Explanation:** 

Equation of the given line will be

y=$\theta_0$ + $\theta_1$x

y=3 + 2x

For x= 100,

y= 3+200= 203

Therefore $y - y_{true}$=203-200=3

Q-2: A regression analysis is inappropriate when the pattern of data points form a spiral.

A) True [Correct]

B) False

**Explanation:**
Assumption of Linear Regression is that the relationship between response (Dependent Variables) and feature variables (Independent Variables) should be linear.
Linear regression only captures the linear relationship, as its trying to fit a linear model to the data.



Q-3: Which of the following is not an assumption of Linear Regression modeling?

A) Little or no Multicollinearity

B) Heteroscadisticity [Correct]

C) Homoscadisticity

D) Normal distribution of Error Terms

**Explanation:**
Assumption of Linear Regression is Homoscadisticity not Heteroscadisticity because non-constant variance arises in presence of outliers or extreme leverage values which will result in poor modeling of data by Linear Regression


# Chapter 2: Ordinary Least Squares

Q-1: The coefficients of the least squares regression line are determined by minimizing the sum of the squares of the

A) X-coordinates
    
B) Y-coordinates
    
C) residuals [Correct]
    

**Explanation:** 
Our goal is to estimate the parameters in the $\beta$ vector given the equation $\varepsilon = \mathbf{y} - X\beta$. 

If we take the actual value of the residuals, they might be negative. To avoid that, we take the squared sum of the residuals which is given by $\varepsilon^T\varepsilon$ resulting in us having to minimise $\varepsilon^T\varepsilon$ 

If you didn't get this right, head back to the OLS section


Q-2: What is the formual for Residuals?

A) True value + Predicted Value

B) True value - Predicted Value [Correct]

C) True value * Predicted Value

D) True value /Predicted Value

**Explanation**

Residuals in a statistical or machine learning model are the differences between observed and predicted values of data. 

# Chapter 3: Implementing Linear Regression with Sklearn

Q-1: Look at the following:

```python

# instantiate linear model
model = LinearRegression()

# fit model on training data
model.fit(X_train, (y_train))


var = model.predict(X_test)


```

What is `'var'` in the above code snippet?

A) Array of residual errors

B) Array of predicted values [Correct]

C) Array of 0s

D) Array of 1s 

**Explanation**

`predict()` function of `LinearRegression()` model(Or any machine learning model) will give out the predicted values for the `test data` based on the fitting of `train data`. 



   
Q-2: RMSE severely punishes large differences in prediction.

A) True [Correct]

B) False

**Explanation**
Formula for RMSE is as follows:

$$RMSE = {\sqrt {\frac{1} {N}{\sum\limits_{i = 1}^N {(y_{i} - \hat{y}_{i} } })^{2} } }$$

Since each difference is `squared`, errors end up increasing in factor. 

# Concept Quiz: Linear Regression

Q-1: Which of the following is NOT a possible value of the correlation coefficient?

A) - 0.9

B) 0

C) +0.15

D) +1.02 [Correct]

**Explanation:**

The correlation coefficient only ranges from [-1,1].


Q-2: Which of these problems would be more suitable to apply linear regression?

A) Find the status of loan repayment

B) Find the price of car [Correct]

**Explanation:**

`Finding price of car` is a regression problem whereas `Finding status of loan repayment` is classification problem 

Q-3: What is the slope of a line parallel to the X-axis?

A) 1
    
B) 0 [Correct]
  
**Explanation:** 
A line parallel to X-Axis will have y=$\theta_0$[constant]

Since $\theta_1$[Slope]= (y- $\theta_1$)/x,  (From y= $\theta_0$ + $\theta_1$x)

we get $\theta_1$=0

Q-4: The regression line is drawn such that:

A) The line goes through more points than any other possible line, straight or curved

B) The same number of points are below and above the regression line.

C) The sum of the absolute errors is as small as possible. [Correct Answer]

**Explanation:**

Option A will result in overfitting of the model. In other words, if a new unseen data point comes, the model will result in an error.

Option B  is equivalent to a line that is no better than predicting a point randomly

Q-5: Which of the following metrics can be used for evaluating regression models?

A) Accuracy

B) R Squared [Correct]

C) Adjusted R Squared [Correct]

D) MSE / MAE [Correct]

**Explanation:**

Accuracy for regression models is not an ideal metric.The predictions rarely can equal the expected values(Except for overfitting). And if predictions differ from expected values by 1%, the accuracy will be zero,even though these predictions are great

Q-6: What will happen to the value of R-squared if we increase the number of features?

A) Increase [Correct]

B) Remain the same [Correct]

C) Decrease

**Explanation:**
R squared is the proportion of the outcome variance that can be explained by the explanatory variables. 
R-squared can never decrease on addition of features


Let’s consider the following regression model:

yi=β0+β1x1i+⋯+βkxki+εi,

or in its matrix form:

Y=Xβ+ε,

where i=1,…,n.

R squared is defined as 

R2=ESSTSS=∑ni=1(y^i−y^¯)2∑ni=1(yi−y¯)2

where y^i=yi−ε^. (ESS and TSS stand respectively for Explained Sum of Squares and Total Sum of Squares.)

Now, to understand why the R2 inflates when you increase the value of k, you have to recall that a regression minimizes the sum of squared errors, by solving

minβ∑ni=1ε2i=minβ∑ni=1(yi−Xiβ)2.

The above equation solves for the values of the coefficients such that the squared errors are minimized, or equivalently, for the values of the coefficients such that what you are able to explain, i.e. the R2, is maximized.

Therefore, whenever you add a variable to your model, the value of its estimated coefficient can either be zero, in which case the proportion of explained variance (R2) stays unchanged, or take a nonzero value because it improves the quality of the fit. By construction, your R2 cannot be smaller after adding a variable.
8.2k Views · View Upvoters


Q-7: In a simple linear regression model the slope coefficient measures

A) the elasticity of Y with respect to X

B) the change in Y which the model predicts for a unit change in X [Correct Answer]

C) the change in X which the model predicts for a unit change in Y 

D) the ratio Y/X

**Explanation:**

Slope of a given line is also known as rate of change of y with respect to x and calculated as $\frac{y_2-y_1}{x_2-x_1}$


Q-8: A “Linear regression” model perfectly fits the training data (train error is zero). Which of the following statement is true?

A) Test error is never going to be 0

B) Test error is always going to be 0

C) Neither of the above [Correct Answer]


**Explanation:**

Test error may be zero, if the test data is perfect representative of train data, otherwise non zero.



Q-9: For the following image which one of the statements is true?

![](sor.png)


A) X has higher sum of residuals than Y

B) X has lower sum of residuals than Y

C) Both have the same sum of residuals [Correct]

**Explanation:**

Both have the same sum of residuals because sum of residuals is always zero.

Q-10: Changing the units of measurement of the Y variable will affect all but which one of the following?

A) The estimated intercept parameter

B) The Total Sum of Squares for the regression

C) R squared for the regression [Correct Answer]

D) The estimated slope parameter

**Explanation:**

R squared value doesn't change with scale of Y as it's a ratio with both the numerator and denominator having Y