# **Logistic Regression Overview**

Logistic regression is a statistical and machine learning technique used for binary classification problems. It models the probability of a dependent variable belonging to a particular class based on one or more independent variables. Unlike linear regression, logistic regression outputs probabilities that can be mapped to class labels using a decision threshold.

---

## **Approximation**

Logistic regression models the probability of the target variable belonging to a class as follows:

## \\[ P(y=1|x) = \sigma(wx + b) \\]
## \\[ P(y=0|x) = 1 - \sigma(wx + b) \\]

**Where:**

- \\(\sigma(z)\\): Sigmoid function, defined as \\(\sigma(z) = \frac{1}{1 + e^{-z}}\\), maps the linear combination of features to a probability.
- \\(w\\): Weight (slope).
- \\(x\\): Independent variable(s).
- \\(b\\): Intercept (bias term).
- \\(P(y=1|x)\\): Probability that the target variable \\(y\\) belongs to class 1.

---

## **Evaluation Metric**

### **Log Loss (Cross-Entropy Loss):**

Logistic regression optimizes the log loss function to measure the performance of the model:

## \\[
LogLoss = J(w, b) = -\frac{1}{N} \sum_{i=1}^N \left[ y_i \log(\hat{y}_i) + (1 - y_i) \log(1 - \hat{y}_i) \right]
\\]

**Where:**

- \\(y_i\\): Actual class label of the \\(i\\)-th observation (0 or 1).
- \\(\hat{y}_i\\): Predicted probability for the \\(i\\)-th observation.
- \\(N\\): Number of observations.

---

Log loss penalizes predictions that are far from the true labels, encouraging the model to output probabilities close to 0 or 1 for correct predictions.

---