<span style=color:red;font-size:55px>ASSIGNMENT</span>

<span style=color:pink;font-size:50px>FEATURE ENGINEERING-2</span>

## Q1. What is Min-Max scaling, and how is it used in data preprocessing? Provide an example to illustrate itsapplication.

## Ans-

## Min-Max Scaling in Data Preprocessing

Min-Max scaling is a data normalization technique used to rescale numerical features to a fixed range, typically between 0 and 1. It transforms the data in such a way that the minimum value of the feature becomes 0, the maximum value becomes 1, and all other values are scaled proportionally within this range.

### How Min-Max Scaling Works

- **Formula**:
  - For each feature \( x \):
    - \( x_{\text{scaled}} = \frac{{x - \text{min}(x)}}{{\text{max}(x) - \text{min}(x)}} \)

- **Range**:
  - Scaled values are between 0 and 1.
  - If \( x = \text{min}(x) \), then \( x_{\text{scaled}} = 0 \).
  - If \( x = \text{max}(x) \), then \( x_{\text{scaled}} = 1 \).

### Example Application

Suppose we have a dataset containing the following numerical feature representing the age of individuals:



In [1]:
from sklearn.preprocessing import MinMaxScaler
import numpy as np

# Sample dataset
data = np.array([[25], [30], [40], [35], [28]])

# Initialize MinMaxScaler
scaler = MinMaxScaler()

# Fit scaler to the data and transform
scaled_data = scaler.fit_transform(data)

# Print original and scaled data
print("Original Data:")
print(data)
print("\nScaled Data:")
print(scaled_data)


Original Data:
[[25]
 [30]
 [40]
 [35]
 [28]]

Scaled Data:
[[0.        ]
 [0.33333333]
 [1.        ]
 [0.66666667]
 [0.2       ]]


## Q2. What is the Unit Vector technique in feature scaling, and how does it differ from Min-Max scaling? Provide an example to illustrate its application.

## Ans-

## Unit Vector Scaling in Feature Scaling

Unit Vector scaling, also known as normalization, is a technique used to rescale numerical features to have unit norm, i.e., their magnitude becomes 1. It transforms the data in such a way that each sample (row) in the dataset has a Euclidean norm of 1.

### How Unit Vector Scaling Works

- **Formula**:
  - For each feature \( x \) in a sample:
    - \( x_{\text{scaled}} = \frac{x}{\|x\|} \)

- **Normalization**:
  - Scaled values are adjusted such that the Euclidean norm (magnitude) of each sample becomes 1.

### Example Application

Suppose we have a dataset containing two numerical features representing the length and width of objects:



In [3]:
import numpy as np

# Sample dataset
data = np.array([[3, 4], [1, 2], [4, 6]])

# Calculate Euclidean norms for each sample
norms = np.linalg.norm(data, axis=1, keepdims=True)

# Apply Unit Vector scaling
scaled_data = data / norms

# Print original and scaled data
print("Original Data:")
print(data)
print("\nScaled Data:")
print(scaled_data)


Original Data:
[[3 4]
 [1 2]
 [4 6]]

Scaled Data:
[[0.6        0.8       ]
 [0.4472136  0.89442719]
 [0.5547002  0.83205029]]


## Q3. What is PCA (Principle Component Analysis), and how is it used in dimensionality reduction? Provide an example to illustrate its application.

## Ans-

## PCA (Principal Component Analysis) for Dimensionality Reduction

PCA, or Principal Component Analysis, is a popular technique used for dimensionality reduction in machine learning and data analysis. It transforms high-dimensional data into a lower-dimensional representation by identifying the directions, or principal components, that capture the maximum variance in the data. These principal components are orthogonal to each other and are ordered by the amount of variance they explain.

### How PCA Works

1. **Compute Covariance Matrix**:
   - Calculate the covariance matrix of the original data.

2. **Eigenvalue Decomposition**:
   - Perform eigenvalue decomposition (or Singular Value Decomposition) on the covariance matrix to obtain eigenvectors and eigenvalues.

3. **Select Principal Components**:
   - Sort the eigenvectors by their corresponding eigenvalues in descending order.
   - Select the top \( k \) eigenvectors (principal components) that explain the most variance in the data, where \( k \) is the desired dimensionality of the reduced data.

4. **Transform Data**:
   - Project the original data onto the selected principal components to obtain the lower-dimensional representation.

### Example Application

Suppose we have a dataset containing two numerical features representing the height and weight of individuals:




To apply PCA to this dataset:

1. **Compute Covariance Matrix**:
   - Calculate the covariance matrix of the original data.

2. **Eigenvalue Decomposition**:
   - Perform eigenvalue decomposition on the covariance matrix to obtain eigenvectors and eigenvalues.

3. **Select Principal Components**:
   - Select the eigenvector corresponding to the highest eigenvalue as the first principal component, and the second-highest eigenvalue as the second principal component.

4. **Transform Data**:
   - Project the original data onto the selected principal components to obtain the lower-dimensional representation.

### Benefits of PCA for Dimensionality Reduction

- **Dimensionality Reduction**: PCA helps reduce the number of features (dimensions) in the data while preserving as much variance as possible.
- **Feature Interpretability**: The principal components can often be interpreted in terms of the original features, providing insights into the underlying structure of the data.
- **Computational Efficiency**: PCA simplifies the data representation, making it computationally more efficient for subsequent analysis and modeling.

PCA is widely used in various fields, including image processing, genetics, finance, and natural language processing, for reducing the dimensionality of high-dimensional datasets while retaining essential information.


## Q4. What is the relationship between PCA and Feature Extraction, and how can PCA be used for Feature Extraction? Provide an example to illustrate this concept.

## Ans-

## Relationship between PCA and Feature Extraction

PCA (Principal Component Analysis) can be used for feature extraction, especially in cases where the original features are highly correlated or redundant. Feature extraction involves transforming the original features into a new set of features (or components) that capture the most important information in the data.

### How PCA Works for Feature Extraction

1. **Dimensionality Reduction**:
   - PCA identifies the directions (principal components) in the feature space that capture the maximum variance in the data.

2. **Feature Transformation**:
   - The original features are projected onto the principal components, effectively transforming the data into a lower-dimensional space.

3. **Feature Extraction**:
   - The principal components themselves serve as the extracted features, representing combinations of the original features that explain the most variance in the data.

### Example Application

Suppose we have a dataset containing three numerical features representing different aspects of a product:




To apply PCA for feature extraction to this dataset:

1. **Standardize Features**:
   - Standardize the features by subtracting the mean and dividing by the standard deviation.

2. **Perform PCA**:
   - Use PCA to identify the principal components that capture the most variance in the standardized data.

3. **Select Principal Components**:
   - Select the top \( k \) principal components that explain the desired amount of variance (e.g., 95%).

4. **Transform Data**:
   - Project the original data onto the selected principal components to obtain the lower-dimensional representation.

### Benefits of PCA for Feature Extraction

- **Dimensionality Reduction**: PCA reduces the dimensionality of the data by transforming it into a smaller set of principal components.
- **Noise Reduction**: PCA can help remove noise and irrelevant information from the data, focusing on the most significant features.
- **Interpretability**: The principal components can often be interpreted in terms of the original features, providing insights into the underlying structure of the data.

PCA for feature extraction is widely used in various domains, including image processing, signal processing, and natural language processing, to extract meaningful features from high-dimensional datasets while reducing computational complexity.


## Q5. You are working on a project to build a recommendation system for a food delivery service. The dataset contains features such as price, rating, and delivery time. Explain how you would use Min-Max scaling to preprocess the data.

## Ans-

## Preprocessing Data for a Food Delivery Recommendation System Using Min-Max Scaling

When working on a project to build a recommendation system for a food delivery service, preprocessing the dataset is essential to ensure that the features are on a similar scale. Min-Max scaling is a common technique used to rescale numerical features to a fixed range, typically between 0 and 1. Here's how you could use Min-Max scaling to preprocess the data:

1. **Identify Numerical Features**:
   - Review the dataset to identify numerical features such as price, rating, and delivery time.

2. **Compute Min and Max Values**:
   - For each numerical feature, calculate the minimum and maximum values in the dataset. This step helps determine the range to which the features will be scaled.

3. **Apply Min-Max Scaling**:
   - For each numerical feature \( x \), apply the Min-Max scaling formula:
     - \( x_{\text{scaled}} = \frac{{x - \text{min}(x)}}{{\text{max}(x) - \text{min}(x)}} \)
   - This formula rescales each feature to a range between 0 and 1, where the minimum value becomes 0, and the maximum value becomes 1.

4. **Transform Data**:
   - Apply the Min-Max scaling transformation to all numerical features in the dataset.

5. **Normalized Dataset**:
   - After preprocessing, the dataset will contain the same features but with values scaled to a common range, making it suitable for modeling and analysis.

### Example:

Suppose we have a dataset with the following numerical features:

- Price: ranging from $5 to $20
- Rating: ranging from 3 to 5
- Delivery Time: ranging from 15 to 45 minutes

To preprocess this dataset using Min-Max scaling:

- Price (scaled) = \( \frac{{\text{Price} - 5}}{{20 - 5}} \)
- Rating (scaled) = \( \frac{{\text{Rating} - 3}}{{5 - 3}} \)
- Delivery Time (scaled) = \( \frac{{\text{Delivery Time} - 15}}{{45 - 15}} \)

After applying Min-Max scaling, all numerical features will be transformed to a range between 0 and 1, ensuring uniformity in scale across different features.

By using Min-Max scaling to preprocess the data for the food delivery recommendation system, we ensure that the features are on a similar scale, which can improve the performance of machine learning models and recommendation algorithms.


## Q6. You are working on a project to build a model to predict stock prices. The dataset contains many features, such as company financial data and market trends. Explain how you would use PCA to reduce the dimensionality of the dataset.



## Ans-

## Using PCA for Dimensionality Reduction in Stock Price Prediction

When working on a project to predict stock prices, dealing with a dataset containing numerous features, including company financial data and market trends, can lead to high dimensionality. PCA (Principal Component Analysis) can be a valuable technique to reduce the dimensionality of such datasets while preserving essential information. Here's how you could use PCA for dimensionality reduction:

1. **Data Preprocessing**:
   - Standardize the dataset: It's crucial to standardize the features to have a mean of 0 and a standard deviation of 1 before applying PCA. This ensures that all features contribute equally to the analysis.

2. **Compute Covariance Matrix**:
   - Calculate the covariance matrix of the standardized dataset. The covariance matrix represents the relationships between different features, providing insights into their correlations.

3. **Perform PCA**:
   - Apply PCA to the covariance matrix to obtain the principal components. PCA will identify the directions in the feature space that capture the maximum variance in the data.

4. **Select Principal Components**:
   - Determine the number of principal components to retain based on the desired level of variance explained. You can choose the number of principal components that collectively explain a significant portion of the total variance in the dataset (e.g., 95%).

5. **Transform Data**:
   - Project the original dataset onto the selected principal components. This transformation results in a lower-dimensional representation of the dataset, with fewer features capturing the most significant variance.

### Example:

Suppose we have a dataset containing various features related to stock performance, such as earnings per share (EPS), price-to-earnings ratio (P/E), market capitalization, and industry sector indicators.

To apply PCA for dimensionality reduction to this dataset:

- Standardize the features to have a mean of 0 and a standard deviation of 1.
- Compute the covariance matrix of the standardized dataset.
- Perform PCA on the covariance matrix to obtain the principal components.
- Select the top \( k \) principal components that collectively explain a significant portion of the total variance (e.g., 95%).
- Project the original dataset onto the selected principal components to obtain the lower-dimensional representation.

After applying PCA, the dataset will contain a reduced number of principal components capturing the most important information about stock performance, making it more manageable for modeling while retaining relevant features for predicting stock prices.

By using PCA for dimensionality reduction in stock price prediction, you can effectively handle high-dimensional datasets and improve the efficiency and interpretability of machine learning models.


## Q7. For a dataset containing the following values: [1, 5, 10, 15, 20], perform Min-Max scaling to transform the values to a range of -1 to 1.

## Ans-

## Min-Max Scaling to Transform Values to a Range of -1 to 1

Given the dataset: [1, 5, 10, 15, 20]

### Min-Max Scaling Formula:

To scale the values to a range of -1 to 1, we can use the following formula:

\[ x_{\text{scaled}} = \frac{{x - \text{min}(x)}}{{\text{max}(x) - \text{min}(x)}} \times (b - a) + a \]

Where:
- \( x \) is the original value.
- \( \text{min}(x) \) is the minimum value in the dataset.
- \( \text{max}(x) \) is the maximum value in the dataset.
- \( a \) and \( b \) are the desired minimum and maximum values of the scaled range, respectively.

### Applying Min-Max Scaling:

Given the dataset: [1, 5, 10, 15, 20]

- \( \text{min}(x) = 1 \)
- \( \text{max}(x) = 20 \)
- \( a = -1 \)
- \( b = 1 \)

Using the Min-Max scaling formula, we can compute the scaled values as follows:

1. For \( x = 1 \):
   \[ x_{\text{scaled}} = \frac{{1 - 1}}{{20 - 1}} \times (1 - (-1)) + (-1) = 0 \times 2 - 1 = -1 \]

2. For \( x = 5 \):
   \[ x_{\text{scaled}} = \frac{{5 - 1}}{{20 - 1}} \times (1 - (-1)) + (-1) = \frac{4}{19} \times 2 - 1 \approx -0.8421 \]

3. For \( x = 10 \):
   \[ x_{\text{scaled}} = \frac{{10 - 1}}{{20 - 1}} \times (1 - (-1)) + (-1) = \frac{9}{19} \times 2 - 1 \approx -0.3684 \]

4. For \( x = 15 \):
   \[ x_{\text{scaled}} = \frac{{15 - 1}}{{20 - 1}} \times (1 - (-1)) + (-1) = \frac{14}{19} \times 2 - 1 \approx 0.1053 \]

5. For \( x = 20 \):
   \[ x_{\text{scaled}} = \frac{{20 - 1}}{{20 - 1}} \times (1 - (-1)) + (-1) = 1 \times 2 - 1 = 1 \]

### Scaled Values:

The scaled values of the dataset [1, 5, 10, 15, 20] in the range of -1 to 1 are approximately:

\[ [-1, -0.8421, -0.3684, 0.1053, 1] \]


In [4]:
import numpy as np

# Given dataset
data = np.array([1, 5, 10, 15, 20])

# Define the desired range
a = -1
b = 1

# Perform Min-Max scaling
min_val = np.min(data)
max_val = np.max(data)
scaled_data = (data - min_val) / (max_val - min_val) * (b - a) + a

# Print the scaled values
print("Scaled Values:")
print(scaled_data)


Scaled Values:
[-1.         -0.57894737 -0.05263158  0.47368421  1.        ]


## Q8. For a dataset containing the following features: [height, weight, age, gender, blood pressure], perform Feature Extraction using PCA. How many principal components would you choose to retain, and why?

## Ans-

## Feature Extraction Using PCA for the Given Dataset

When performing feature extraction using PCA (Principal Component Analysis), it's essential to determine the number of principal components to retain. Here's how you can approach this for the given dataset containing features: [height, weight, age, gender, blood pressure].

### Steps for Feature Extraction Using PCA:

1. **Data Preprocessing**:
   - Standardize the dataset: Ensure that the features have a mean of 0 and a standard deviation of 1 before applying PCA.

2. **Perform PCA**:
   - Apply PCA to the standardized dataset to obtain the principal components.

3. **Determine the Number of Principal Components**:
   - Assess the cumulative explained variance ratio to decide how many principal components to retain. You can choose a threshold value (e.g., 95%) and select the minimum number of principal components that collectively explain this percentage of variance.




In [5]:
from sklearn.decomposition import PCA
from sklearn.preprocessing import StandardScaler
import numpy as np

# Given dataset features: [height, weight, age, gender, blood pressure]
data = np.array([
    [170, 68, 30, 1, 120],
    [165, 55, 25, 0, 110],
    [180, 75, 35, 1, 130],
    [160, 50, 40, 0, 115],
    # Add more data as needed
])

# Standardize the dataset
scaler = StandardScaler()
scaled_data = scaler.fit_transform(data)

# Perform PCA
pca = PCA()
pca.fit(scaled_data)

# Assess cumulative explained variance ratio
cumulative_variance_ratio = np.cumsum(pca.explained_variance_ratio_)

# Determine the number of principal components to retain
threshold_variance = 0.95  # Choose a threshold (e.g., 95%)
num_components = np.argmax(cumulative_variance_ratio >= threshold_variance) + 1

# Print the number of principal components to retain
print("Number of Principal Components to Retain:", num_components)

Number of Principal Components to Retain: 2
