### Q1. What is Min-Max scaling, and how is it used in data preprocessing? Provide an example to illustrate its application.

**Min-Max scaling** is a normalization technique used in data preprocessing to transform features to a specific range, typically [0, 1]. This technique helps ensure that different features contribute equally to the model's performance, especially in algorithms that rely on distance calculations.

**How it works**: The Min-Max scaling formula is given by:

\[ 
X' = \frac{X - X_{\text{min}}}{X_{\text{max}} - X_{\text{min}}} 
\]

where \(X\) is the original value, \(X_{\text{min}}\) is the minimum value of the feature, and \(X_{\text{max}}\) is the maximum value of the feature.

**Example**: For a dataset with values [1, 5, 10, 15, 20]:

1. Minimum value \(X_{\text{min}} = 1\)
2. Maximum value \(X_{\text{max}} = 20\)

Transforming the value 10:

\[ 
X' = \frac{10 - 1}{20 - 1} = \frac{9}{19} \approx 0.474 
\]

### Q2. What is the Unit Vector technique in feature scaling, and how does it differ from Min-Max scaling? Provide an example to illustrate its application.

The **Unit Vector technique** (also known as vector normalization or scaling to unit length) transforms features so that they have a length (or magnitude) of 1. This is done by dividing each feature value by the Euclidean norm (L2 norm) of the feature vector.

**How it works**: The formula for unit vector scaling is:

\[ 
X' = \frac{X}{\|X\|} 
\]

where \(\|X\|\) is the Euclidean norm of the feature vector.

**Difference from Min-Max Scaling**: While Min-Max scaling transforms values to a specific range (e.g., [0, 1]), the Unit Vector technique focuses on the relative proportions of feature values and scales them based on their magnitude.

**Example**: For a feature vector [3, 4]:

1. Calculate the Euclidean norm:

\[ 
\|X\| = \sqrt{3^2 + 4^2} = \sqrt{9 + 16} = 5 
\]

2. Transforming the values:

\[ 
X' = \left[\frac{3}{5}, \frac{4}{5}\right] = [0.6, 0.8] 
\]

### Q3. What is PCA (Principal Component Analysis), and how is it used in dimensionality reduction? Provide an example to illustrate its application.

**Principal Component Analysis (PCA)** is a statistical technique used for dimensionality reduction while preserving as much variance as possible. PCA transforms the original features into a new set of orthogonal features called principal components, which capture the maximum variance in the data.

**How it works**:
1. Standardize the dataset.
2. Compute the covariance matrix.
3. Calculate the eigenvalues and eigenvectors of the covariance matrix.
4. Sort the eigenvalues and select the top k eigenvectors to form the principal components.

**Example**: Given a dataset with two features (height and weight), PCA might transform them into a single principal component that captures the most variance. If the original features are highly correlated, PCA can effectively reduce the dimensions while maintaining essential information.

### Q4. What is the relationship between PCA and Feature Extraction, and how can PCA be used for Feature Extraction? Provide an example to illustrate this concept.

**Relationship**: PCA is a powerful method for feature extraction because it identifies the most informative features in a dataset by capturing the directions of maximum variance. 

**Using PCA for Feature Extraction**: 
- By applying PCA, we can reduce the number of features in the dataset while retaining the most important information.
- The principal components obtained from PCA can be used as new features for modeling.

**Example**: In a dataset with multiple features like age, income, and spending score, PCA could extract two principal components that represent the most variance. Instead of using all original features, we can use these two principal components for subsequent modeling, which simplifies the model and reduces overfitting.

### Q5. You are working on a project to build a recommendation system for a food delivery service. The dataset contains features such as price, rating, and delivery time. Explain how you would use Min-Max scaling to preprocess the data.

In building a recommendation system for a food delivery service, Min-Max scaling would be applied as follows:

1. **Identify Features**: Determine the numerical features to scale, such as price, rating, and delivery time.
2. **Calculate Min-Max**: For each feature, calculate the minimum and maximum values.
3. **Apply Scaling**: Transform each feature using the Min-Max scaling formula to ensure all features are within the same range (e.g., [0, 1]). This ensures that the model treats each feature equally without bias towards features with larger scales.
4. **Build the Model**: Use the scaled features to train the recommendation model, improving the effectiveness of distance-based algorithms.

### Q6. You are working on a project to build a model to predict stock prices. The dataset contains many features, such as company financial data and market trends. Explain how you would use PCA to reduce the dimensionality of the dataset.

To use PCA for reducing dimensionality in a stock price prediction project:

1. **Preprocess Data**: Standardize the features to have zero mean and unit variance, ensuring that PCA is not biased toward features with larger scales.
2. **Compute Covariance Matrix**: Calculate the covariance matrix of the standardized data to understand feature relationships.
3. **Perform PCA**: Extract the eigenvalues and eigenvectors, sort them, and select the top k components that explain the most variance (e.g., 95% of the total variance).
4. **Transform Dataset**: Project the original dataset onto the selected principal components, reducing the number of features while retaining critical information.
5. **Build the Model**: Use the transformed dataset with reduced dimensions for training the prediction model, improving computational efficiency and potentially enhancing performance.

### Q7. For a dataset containing the following values: [1, 5, 10, 15, 20], perform Min-Max scaling to transform the values to a range of -1 to 1.

To perform Min-Max scaling to a range of -1 to 1:

1. **Identify Min and Max**: 
   - \(X_{\text{min}} = 1\)
   - \(X_{\text{max}} = 20\)

2. **Apply Transformation**:
The formula for scaling to a range [a, b] is:

\[ 
X' = a + \frac{(X - X_{\text{min}})(b - a)}{X_{\text{max}} - X_{\text{min}}} 
\]

For the range [-1, 1], we set \(a = -1\) and \(b = 1\):

\[ 
X' = -1 + \frac{(X - 1)(1 - (-1))}{20 - 1} 
\]

Calculating for each value:

- For 1: 
\[ 
X' = -1 + \frac{(1 - 1)(2)}{19} = -1 
\]
- For 5: 
\[ 
X' = -1 + \frac{(5 - 1)(2)}{19} \approx -0.578 
\]
- For 10: 
\[ 
X' = -1 + \frac{(10 - 1)(2)}{19} \approx -0.052 
\]
- For 15: 
\[ 
X' = -1 + \frac{(15 - 1)(2)}{19} \approx 0.474 
\]
- For 20: 
\[ 
X' = -1 + \frac{(20 - 1)(2)}{19} = 1 
\]

### Q8. For a dataset containing the following features: [height, weight, age, gender, blood pressure], perform Feature Extraction using PCA. How many principal components would you choose to retain, and why?

In performing Feature Extraction using PCA on the dataset [height, weight, age, gender, blood pressure]:

1. **Preprocessing**: Standardize the features, especially numerical ones like height, weight, age, and blood pressure. Convert categorical features (gender) into numerical format (e.g., one-hot encoding).

2. **PCA Execution**: After applying PCA, calculate the explained variance ratio for each principal component.

3. **Choosing Components**: Typically, you would choose enough principal components to explain a substantial amount of variance (e.g., 90-95%). This decision can be based on a cumulative explained variance plot.

**Conclusion**: The number of principal components to retain would depend on the cumulative explained variance. If the first three principal components explain 95% of the variance, I would retain those three for further analysis, as they capture the most significant patterns in the data while reducing dimensionality.