# Introduction to Statistical Learning - Chapter 3

- [3. Classification](#3.-Classification)
    * [3.1. Logistic Regression](#3.1.-Logistic-Regression)
        + [3.1.1. The Logistic Model](#3.1.1.-The-Logistic-Model)

# 3. Classification

- A situation where the response variable is `qualitative` instead of quantitative
    * Techniques used to predict a qualitative response include:
        + Logistic regression
        + Linear discriminant analysis
        + K-nearest neighbours
    * Other more computer-intensive methods include:
        + Generalized additive models
        + Trees and random forests and boosting
        + Support vector machines
- Forcing qualitative variables into a regression model wrongly assumes that the difference between the predictors is similar
- Two types of qualitative data:
    * Ordinal
        + Variables with a specific order (Low, Medium, High)
    * Nominal
        + Variables with no specific order (Book, Television, Car)

## 3.1. Logistic Regression

- Logisitic regression models the probability that the response $Y$ belongs to a particular category
    * Values will range between 0 and 1
    
$$ p(X) = Pr(Y = 1|X) $$

### 3.1.1. The Logistic Model

- To model the $p(X)$ that gives outputs between 0 and 1 for all values of X, we can use the logistic function

$$ p(X) = \frac{e^{\beta_{0} + \beta_{1}X}}{1 + e^{\beta_{0} + \beta_{1}X}} $$

where $\frac{p(X)}{1-p(X)}$ is the `odds` and can take on any value between 0 and $\infty$

- Values of the odds close to 0 and $\infty$ indicate *very low* and *high* probabilities of the response belonging to a particular category respectively
- By taking the logarithm of both sides, we can get the log-odds or logit

$$ log(\frac{p(X)}{1-p(X)} = \beta_{0} + \beta_{1}X $$

where increasing $X$ by one unit changes the log odds by $\beta_{1}$.
- However, $\beta_{1}$ does not correspond to the change in $p(X)$ associated with a one-unit increase in X

### 3.1.2. Estimating the Regression Coefficients

- Maximum likelihood to fit a logistic regression model
    * Seeks estimates for $\beta_{0}$ and $\beta_{1}$ such that the predicted proability $\hat{p}(X_{i})$ will correspond as closely as possible to the observed response.
        + One unit increase in the predictor $X_{j}$ is associated with an increase in the log odds of the response (Y=1) by $\hat{\beta_{1}}$
- To measure the accuracy of the coefficient estimates, the standard error can be computed
    * *Z-statistic* is associated with $\beta_{1}$ is equal to $\hat{\beta_{1}}/SE(\hat{\beta_{1}})$
        + A large value of *z-statistic* indicates evidence against the null hypothesis (p < 0.05)

### 3.1.3. Making Predictions

- Qualitative predictors can be used with the logistic regression model using the dummy variable approach
    * We can assign a dummy variable that takes on a value of 1 and 0

**For value of 1:**

$$ p(X) = \frac{e^{\beta_{0} + \beta_{1}X}}{1 + e^{\beta_{0} + \beta_{1}X}} $$

**For value of 0:**

$$ p(X) = \frac{e^{\beta_{0}}}{1 + e^{\beta_{0}}} $$

### 3.1.4. Multiple Logistic Regression