
## 1

**Linear Regression:**
- **Purpose:** Predicts a continuous output variable based on input features.
- **Model:** β0+β1x1+β2x2+.........+βnxn
- **Use Case:** Predicting house prices based on features like size, location, and number of bedrooms.

**Logistic Regression:**
- **Purpose:** Predicts a binary (0 or 1) output variable based on input features.
- **Model:** Outputs a probability that the dependent variable belongs to a particular class using the logistic function:
![6be2cc11-2fcd-466c-af29-5ec91d01ca8d.png](attachment:0b6e37a8-bc91-412e-8cab-51a00748d9ff.png)
- **Use Case:** Predicting whether an email is spam (1) or not spam (0) based on features like the presence of certain keywords.

**Example Scenario for Logistic Regression:**
- **Scenario:** A healthcare application predicting whether a patient has diabetes (1) or not (0) based on features like age, BMI, blood pressure, and blood sugar levels.
- **Reason:** The output is binary (diabetes or no diabetes), making logistic regression more appropriate than linear regression.

## 2

**Cost Function:**
- The cost function in logistic regression is the **logistic loss** (also known as log-loss or binary cross-entropy):

![9fac6f17-d916-4aa2-947c-0f4df1286ffe.png](attachment:c641b126-22e2-410c-bb5b-d4c8a73c94c2.png)

**Optimization:**
- The cost function is optimized using **Gradient Descent** or other optimization algorithms like:
  - **Stochastic Gradient Descent (SGD)**
  - **Mini-batch Gradient Descent**
  - **Advanced Optimizers:** such as Adam or L-BFGS

## 3

**Regularization:**
- Regularization involves adding a penalty term to the cost function to constrain the model complexity, discouraging the model from fitting the noise in the training data.

**Types:**
1. **L1 Regularization (Lasso):** Adds the absolute value of coefficients as a penalty term:


![Y65xZ.png](attachment:eb3fbe34-5059-4d26-b421-58ea9ee984e4.png)

2. **L2 Regularization (Ridge):** Adds the squared value of coefficients as a penalty term:

![E3A1s.png](attachment:06e1f2f4-5983-4781-845c-6bce243fe818.png)

**Prevention of Overfitting:**
- Regularization discourages large coefficients, which can cause the model to be overly sensitive to small fluctuations in the training data. This leads to a more generalized model that performs better on unseen data.


## 4

**ROC Curve:**
- The Receiver Operating Characteristic (ROC) curve plots the **True Positive Rate (TPR)** against the **False Positive Rate (FPR)** at various threshold settings.

**True Positive Rate (TPR) / Sensitivity:**

TPR = TP/(TP + FN)

**False Positive Rate (FPR):**

FPR = FP/(FP + TN)

**Usage:**
- The area under the ROC curve (AUC-ROC) quantifies the overall ability of the model to discriminate between positive and negative classes. A higher AUC indicates better model performance.

## 5

**Common Techniques:**
1. **Filter Methods:**
   - **Statistical Tests:** Use chi-square tests, ANOVA, or correlation coefficients to select features.
2. **Wrapper Methods:**
   - **Recursive Feature Elimination (RFE):** Iteratively builds the model and removes the least significant features.
3. **Embedded Methods:**
   - **Regularization:** Use Lasso (L1) regularization, which can shrink some coefficients to zero, effectively performing feature selection.
4. **Information Gain:**
   - Select features based on their information gain with respect to the target variable.

**Benefits:**
- Reduces model complexity.
- Improves model interpretability.
- Decreases overfitting by removing irrelevant or redundant features.

## 6

**Strategies:**
1. **Resampling Techniques:**
   - **Oversampling:** Increase the number of instances in the minority class (e.g., SMOTE).
   - **Undersampling:** Decrease the number of instances in the majority class.
2. **Class Weighting:**
   - Assign higher weights to the minority class to penalize misclassifications more.
3. **Synthetic Data Generation:**
   - Generate synthetic samples for the minority class using techniques like SMOTE.
4. **Anomaly Detection Models:**
   - Use models designed for imbalance, such as one-class SVM.

## 7

**Common Issues and Challenges:**
1. **Multicollinearity:**
   - When independent variables are highly correlated, leading to unstable coefficient estimates.
   - **Solutions:**
     - Remove or combine correlated predictors.
     - Use **Ridge Regression** (L2 regularization) to mitigate the effects of multicollinearity.
2. **Imbalanced Data:**
   - As discussed, class imbalance can skew model performance.
   - **Solutions:** Apply resampling techniques, use class weighting, or implement synthetic data generation.
3. **Overfitting:**
   - The model performs well on training data but poorly on new data.
   - **Solutions:** Use regularization (L1, L2), simplify the model, or collect more training data.
4. **Feature Scaling:**
   - Logistic regression can be sensitive to the scale of features.
   - **Solutions:** Normalize or standardize the features before training the model.
5. **Non-linearity:**
   - Logistic regression assumes a linear relationship between the independent variables and the log-odds of the dependent variable.
   - **Solutions:** Use polynomial features or transformations, or consider more complex models like decision trees or neural networks if non-linearity is significant.