Q1. What is the mathematical formula for a linear SVM?

The mathematical formula for a linear Support Vector Machine (SVM) can be expressed as follows:

1. **Objective Function:**
   Minimize the cost function:
   \[
   \frac{1}{2} \| \mathbf{w} \|^2 + C \sum_{i=1}^{N} \xi_i
   \]
   where:
   - \(\mathbf{w}\) is the weight vector.
   - \(C\) is the regularization parameter.
   - \(\xi_i\) are the slack variables representing the margin violations.

2. **Constraints:**
   For each training example \((\mathbf{x}_i, y_i)\), where \(y_i \in \{+1, -1\}\) (the class label), the constraints are:
   \[
   y_i (\mathbf{w}^T \mathbf{x}_i + b) \geq 1 - \xi_i
   \]
   where:
   - \(\mathbf{x}_i\) is the feature vector.
   - \(b\) is the bias term.

In summary, the linear SVM aims to find the weight vector \(\mathbf{w}\) and bias \(b\) that maximize the margin between the two classes while minimizing classification errors and margin violations.

Q2. What is the objective function of a linear SVM?

The objective function of a linear Support Vector Machine (SVM) is:

\[
\text{Minimize } \frac{1}{2} \| \mathbf{w} \|^2 + C \sum_{i=1}^{N} \xi_i
\]

where:

- \(\mathbf{w}\) is the weight vector.
- \(C\) is the regularization parameter.
- \(\xi_i\) are the slack variables representing margin violations.

This function aims to find the weight vector \(\mathbf{w}\) and bias \(b\) that maximize the margin between classes while minimizing classification errors and margin violations.

Q3. What is the kernel trick in SVM?

The kernel trick in Support Vector Machines (SVM) is a technique used to handle non-linearly separable data. It involves using a kernel function to implicitly map the original feature space into a higher-dimensional space where the data may be linearly separable, without explicitly computing the coordinates in that higher-dimensional space.

**Key Points:**

- **Kernel Function:** A function \( K(\mathbf{x}, \mathbf{x}') \) computes the dot product in the higher-dimensional space directly from the original features \(\mathbf{x}\) and \(\mathbf{x}'\).
- **Common Kernels:** Examples include polynomial kernels \( K(\mathbf{x}, \mathbf{x}') = (\mathbf{x} \cdot \mathbf{x}' + c)^d \) and radial basis function (RBF) kernels \( K(\mathbf{x}, \mathbf{x}') = \exp(-\gamma \| \mathbf{x} - \mathbf{x}' \|^2) \).

The kernel trick allows SVMs to fit complex decision boundaries efficiently by leveraging the properties of the kernel function.

Q4. What is the role of support vectors in SVM Explain with example

Support vectors are the data points that lie closest to the decision boundary in a Support Vector Machine (SVM) model. They play a crucial role in defining the position and orientation of the hyperplane that separates different classes.

**Role of Support Vectors:**

1. **Defining the Margin:** Support vectors are the points that are on the edges of the margin or closest to it. They are used to determine the maximum-margin hyperplane.
2. **Influencing the Model:** The position of the hyperplane is adjusted based on these support vectors. Removing support vectors would change the position of the hyperplane, affecting the model’s decision boundary.

**Example:**

In a 2D classification problem where two classes are separated by a line, the support vectors are the data points that are closest to the line but still correctly classified. These points are critical in determining the position of the line (hyperplane). The distance between these support vectors and the line defines the margin, and the SVM aims to maximize this margin.

In summary, support vectors are essential for constructing the optimal decision boundary in SVMs and directly influence the model's performance.

Q5. Illustrate with examples and graphs of Hyperplane, Marginal plane, Soft margin and Hard margin in
SVM?

Here's a brief illustration of key concepts in SVM, with a description of each:

1. **Hyperplane:**
   - **Definition:** The decision boundary that separates different classes in the feature space.
   - **Example:** In a 2D space, a hyperplane is a line that separates two classes.
   - **Graph:** A line (in 2D) or a plane (in 3D) that divides the feature space into two regions.

2. **Margin Plane:**
   - **Definition:** Planes parallel to the hyperplane that are at the distance of the margin from it. These planes define the boundary of the margin.
   - **Example:** For a linear SVM, there are two margins (one for each class) that are equidistant from the hyperplane.
   - **Graph:** Lines (or planes) parallel to the hyperplane, one on each side, marking the boundary of the margin.

3. **Soft Margin:**
   - **Definition:** Allows some misclassifications or margin violations to enable a better fit on data that is not linearly separable.
   - **Example:** A soft margin SVM will tolerate some data points being within or outside the margin.
   - **Graph:** The hyperplane and margin planes are adjusted to allow some data points to be inside or outside the margin.

4. **Hard Margin:**
   - **Definition:** Enforces that all data points are correctly classified with no margin violations. Used when data is perfectly linearly separable.
   - **Example:** All data points lie outside the margin planes, and no points are allowed to be within the margin.
   - **Graph:** The hyperplane and margin planes are positioned so that all data points are correctly classified and lie outside the margin.

These illustrations help to visualize how SVMs separate classes with different approaches depending on whether the data is linearly separable or not.