In [None]:
Absolutely! Here's the **Linear and Polynomial Regression Q&A** with **clean, sequential question numbers** for easy reference:

---

### 🟦 **Simple Linear Regression**

**Q1. What is Simple Linear Regression?**  
A statistical method to model the relationship between a dependent variable (Y) and one independent variable (X) using the equation `Y = mX + c`.

**Q2. What are the key assumptions of Simple Linear Regression?**  
1. Linearity  
2. Independence  
3. Homoscedasticity  
4. Normality of residuals  
5. No multicollinearity (not applicable with one predictor)

**Q3. What does the coefficient m represent in the equation Y = mX + c?**  
It represents the change in Y for a one-unit change in X.

**Q4. What does the intercept c represent in the equation Y = mX + c?**  
The predicted value of Y when X = 0.

**Q5. How do we calculate the slope m in Simple Linear Regression?**  
\[
m = \frac{n(SUM XY) - (SUM X)(SUM Y)}{n(SUM X^2) - (SUM X)^2}
\]

**Q6. What is the purpose of the least squares method in Simple Linear Regression?**  
To minimize the sum of squared differences between actual and predicted Y values.

**Q7. How is the coefficient of determination (R²) interpreted in Simple Linear Regression?**  
R² represents the proportion of variance in Y explained by X, ranging from 0 to 1.

---

### 🟨 **Multiple Linear Regression**

**Q8. What is Multiple Linear Regression?**  
An extension of simple linear regression that models Y using two or more independent variables.

**Q9. What is the main difference between Simple and Multiple Linear Regression?**  
Simple regression has one independent variable; multiple has two or more.

**Q10. What are the key assumptions of Multiple Linear Regression?**  
Same as simple regression, plus:  
- No perfect multicollinearity  
- No autocorrelation of residuals

**Q11. What is heteroscedasticity, and how does it affect regression results?**  
Unequal variance of residuals. It leads to inefficient estimates and unreliable hypothesis testing.

**Q12. How can you improve a Multiple Linear Regression model with high multicollinearity?**  
- Remove or combine correlated predictors  
- Use PCA  
- Apply Ridge or Lasso regression

**Q13. What are common techniques for transforming categorical variables in regression models?**  
- One-hot encoding  
- Label encoding  
- Target encoding

**Q14. What is the role of interaction terms in Multiple Linear Regression?**  
They capture the combined effect of two or more predictors on the response variable.

**Q15. How can the interpretation of intercept differ between Simple and Multiple Regression?**  
Simple: Y when X = 0  
Multiple: Y when all predictors = 0 (may not be meaningful)

**Q16. What is the significance of the slope in regression, and how does it affect predictions?**  
The slope shows how much the dependent variable changes per unit change in the predictor.

**Q17. How does the intercept provide context for variable relationships?**  
It's the predicted value of Y when all predictors are zero; serves as a baseline.

**Q18. What are the limitations of using R² alone as a performance metric?**  
- Always increases with more variables  
- Doesn't reflect model complexity  
- Use Adjusted R² instead

**Q19. How would you interpret a large standard error for a regression coefficient?**  
Indicates that the coefficient estimate is not reliable—could be due to noise or multicollinearity.

**Q20. How can heteroscedasticity be identified in residual plots, and why address it?**  
Residuals form a fan shape or pattern. Violates assumptions; can affect inference.

**Q21. What does it mean if R² is high but Adjusted R² is low?**  
Model may be overfitted with irrelevant predictors.

**Q22. Why is it important to scale variables in Multiple Linear Regression?**  
Scaling ensures equal contribution, especially important for regularization methods (like Ridge/Lasso).

---

### 🟩 **Polynomial Regression**

**Q23. What is polynomial regression?**  
A regression model where the relationship is expressed as a polynomial equation.

**Q24. How does polynomial regression differ from linear regression?**  
Polynomial regression fits curves, while linear fits straight lines.

**Q25. When is polynomial regression used?**  
When data shows a non-linear relationship between X and Y.

**Q26. What is the general equation for polynomial regression?**  
\[
Y = a + b1X + b2X^2 + ... + bnX^n
\]

**Q27. Can polynomial regression be applied to multiple variables?**  
Yes, by including polynomial terms for each predictor variable.

**Q28. What are the limitations of polynomial regression?**  
- Prone to overfitting  
- Less interpretable  
- Poor extrapolation outside data range

**Q29. How to select the degree of a polynomial model?**  
Use cross-validation, adjusted R², AIC, or BIC for best fit.

**Q30. Why is visualization important in polynomial regression?**  
Helps detect overfitting, assess fit, and communicate non-linear trends.

**Q31. How is polynomial regression implemented in Python?**  
```python
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression
from sklearn.pipeline import make_pipeline

model = make_pipeline(PolynomialFeatures(degree=2), LinearRegression())
model.fit(X, y)
```

---

Let me know if you want these grouped into flashcards, exported to Notion, or formatted as quiz questions!