### Q1

Min-Max Scaling is a data preprocessing technique used to normalize the range of features in a dataset. It transforms the values of a feature to a specific range, typically [0, 1] or [-1, 1], by applying the following formula:

𝑥
scaled
=
𝑥
−
𝑥
min
𝑥
max
−
𝑥
min
x
scaled
​
 =
x
max
​
 −x
min
​

x: Original value of the feature.

x
min
​
 : Minimum value of the feature.

x
max
​
 : Maximum value of the feature.

x
scaled
​
 : Scaled value of the feature.

This scaling technique ensures that all features have the same scale, which is particularly important for algorithms sensitive to the magnitude of feature values, such as:

Gradient-based models (e.g., logistic regression, neural networks).
Distance-based algorithms (e.g., k-Nearest Neighbors, k-Means Clustering).

Steps in Min-Max Scaling

Compute the minimum (
𝑥
min
x
min
​
 ) and maximum (
𝑥
max
x
max
​
 ) values of the feature.

Apply the scaling formula to each data point in the feature.
Repeat the process for all features in the dataset.


In [1]:
from sklearn.preprocessing import MinMaxScaler
import pandas as pd

# Example Data
data = {'Age': [25, 30, 35, 40, 50],
        'Salary': [50000, 80000, 120000, 150000, 200000]}
df = pd.DataFrame(data)

# Min-Max Scaling
scaler = MinMaxScaler()
scaled_data = scaler.fit_transform(df)

# Scaled Data
scaled_df = pd.DataFrame(scaled_data, columns=['Age', 'Salary'])
print(scaled_df)


   Age    Salary
0  0.0  0.000000
1  0.2  0.200000
2  0.4  0.466667
3  0.6  0.666667
4  1.0  1.000000


### Q2

The Unit Vector technique, also called Normalization, scales each data point to have a unit norm (e.g., a magnitude of 1). It is achieved by dividing each data point by its norm (e.g., Euclidean norm or
𝐿
2
L2-norm), which ensures that the magnitude of the data points is scaled to 1.

For a feature vector
𝑥
=
[
𝑥
1
,
𝑥
2
,
…
,
𝑥
𝑛
]
x=[x
1
​
 ,x
2
​
 ,…,x
n
​
 ], the normalized vector
𝑥
normalized
x
normalized
​
  is calculated as:

𝑥
normalized
=
𝑥
∥
𝑥
∥
x
normalized
​
 =
∥x∥
x
​

Where
∥
𝑥
∥
∥x∥ is the norm of the vector, computed as:

L2-Norm (Euclidean):

∥
𝑥
∥
2
=
𝑥
1
2
+
𝑥
2
2
+
⋯
+
𝑥
𝑛
2
∥x∥
2
​
 =
x
1
2
​
 +x
2
2
​
 +⋯+x
n
2
​

​

L1-Norm (Manhattan):

∥
𝑥
∥
1
=
∣
𝑥
1
∣
+
∣
𝑥
2
∣
+
⋯
+
∣
𝑥
𝑛
∣
∥x∥
1
​
 =∣x
1
​
 ∣+∣x
2
​
 ∣+⋯+∣x
n
​
 ∣


In [2]:
from sklearn.preprocessing import normalize
import numpy as np

# Example data
data = np.array([[3, 4], [1, 2], [5, 12], [2, 1], [0, 3]])

# Normalize data (L2 norm by default)
normalized_data = normalize(data, norm='l2')

print(normalized_data)


[[0.6        0.8       ]
 [0.4472136  0.89442719]
 [0.38461538 0.92307692]
 [0.89442719 0.4472136 ]
 [0.         1.        ]]


### Q3

Principal Component Analysis (PCA) is a statistical technique used for dimensionality reduction while preserving as much variance as possible in the data. It achieves this by transforming the original features into a new set of uncorrelated variables called principal components, which are linear combinations of the original features. The principal components are ordered such that the first component captures the maximum variance, the second captures the next highest variance orthogonal to the first, and so on.

#How PCA Works
Standardize the Data: Ensure all features are on the same scale by standardizing them.

Compute the Covariance Matrix: Calculate the covariance matrix of the standardized data to understand feature relationships.

Find Eigenvalues and Eigenvectors: Determine the eigenvalues (variances) and eigenvectors (directions) of the covariance matrix.

Select Principal Components: Choose the top
𝑘
k eigenvectors corresponding to the
𝑘
k largest eigenvalues.

Transform the Data: Project the original data onto the selected eigenvectors to form the reduced-dimensional representation.
#Applications of PCA
Dimensionality Reduction: Simplifies datasets with many features while retaining the most critical information.

Visualization: Reduces high-dimensional data to 2 or 3 dimensions for easy visualization.

Noise Reduction: Removes less significant features or noise by excluding components with minimal variance.

Preprocessing for Machine Learning: Reduces dimensionality to avoid overfitting and improve model performance.

In [3]:
import numpy as np
from sklearn.decomposition import PCA
from sklearn.preprocessing import StandardScaler

# Example data
data = np.array([
    [2, 8, 4, 6],
    [3, 6, 5, 8],
    [4, 7, 6, 10],
    [5, 5, 7, 12],
    [6, 4, 8, 14]
])

# Step 1: Standardize the data
scaler = StandardScaler()
data_standardized = scaler.fit_transform(data)

# Step 2: Apply PCA
pca = PCA(n_components=2)  # Reduce to 2 dimensions
data_reduced = pca.fit_transform(data_standardized)

# Print results
print("Original Shape:", data.shape)
print("Reduced Shape:", data_reduced.shape)
print("Explained Variance Ratio:", pca.explained_variance_ratio_)
print("Principal Components:\n", data_reduced)


Original Shape: (5, 4)
Reduced Shape: (5, 2)
Explained Variance Ratio: [0.96300648 0.03699352]
Principal Components:
 [[-2.82765352  0.0661481 ]
 [-1.07469155 -0.58739941]
 [-0.33913521  0.62047346]
 [ 1.41382676 -0.03307405]
 [ 2.82765352 -0.0661481 ]]


### Q4

#Relationship Between PCA and Feature Extraction
Feature Extraction refers to the process of creating new features that are informative and represent the underlying patterns in the data. These new features are derived from the original dataset.

Principal Component Analysis (PCA) is a feature extraction method that transforms the original features into a new set of uncorrelated features called principal components. These components are ordered by the amount of variance they capture in the data.

The relationship lies in PCA's ability to extract the most relevant features (principal components) from a dataset, effectively summarizing the original data into fewer dimensions while retaining as much variance as possible.

#How PCA is Used for Feature Extraction
Identify Key Variance: PCA identifies the directions (principal components) that maximize variance in the dataset.

Transform Data: PCA projects the data onto these principal components, creating new features that are combinations of the original features.

Select Top Components: Only the top
𝑘
k components that capture most of the variance are retained as extracted features.

This reduces the dimensionality of the data and removes redundancy while preserving the most critical information.

### Q5

Using Min-Max Scaling for Preprocessing Data in a Recommendation System
When building a recommendation system for a food delivery service, features like price, rating, and delivery time may have different ranges. Min-Max scaling is a preprocessing technique that rescales these features to a uniform range (e.g., 0 to 1), ensuring they contribute equally to the model.

#Steps to Apply Min-Max Scaling
Understand the Data: Identify the range of each feature. For example:

Price: $5 to $50
Rating: 1 to 5
Delivery Time: 10 to 60 minutes
Formula for Min-Max Scaling:

𝑋
scaled
=
𝑋
−
𝑋
min
𝑋
max
−
𝑋
min
X
scaled
​
 =
X
max
​
 −X
min
​

X−X
min
​

​

This formula rescales each feature to a range of 0 to 1.

Preprocess Features: Apply the Min-Max scaling to each feature separately.

Use the Scaled Data: Feed the scaled data into the recommendation system model for consistent weighting.


In [4]:
import pandas as pd
from sklearn.preprocessing import MinMaxScaler

# Dataset
data = {
    "Food Item": ["Pizza", "Burger", "Sushi", "Sandwich", "Pasta"],
    "Price": [15, 8, 40, 10, 25],
    "Rating": [4.5, 4.2, 4.8, 3.9, 4.7],
    "Delivery Time": [30, 25, 50, 20, 40]
}

# Create a DataFrame
df = pd.DataFrame(data)

# Step 1: Initialize the Min-Max Scaler
scaler = MinMaxScaler()

# Step 2: Scale the numerical features
scaled_data = scaler.fit_transform(df[["Price", "Rating", "Delivery Time"]])

# Step 3: Create a DataFrame for scaled features
scaled_df = pd.DataFrame(scaled_data, columns=["Price", "Rating", "Delivery Time"])

# Combine scaled features with Food Item
result = pd.concat([df[["Food Item"]], scaled_df], axis=1)

print(result)


  Food Item    Price    Rating  Delivery Time
0     Pizza  0.21875  0.666667       0.333333
1    Burger  0.00000  0.333333       0.166667
2     Sushi  1.00000  1.000000       1.000000
3  Sandwich  0.06250  0.000000       0.000000
4     Pasta  0.53125  0.888889       0.666667


#Why Use Min-Max Scaling?
Uniform Weighting: Without scaling, features with larger ranges (e.g., delivery time) might dominate features with smaller ranges (e.g., ratings).

Improved Model Performance: Many machine learning algorithms (e.g., KNN, SVM) are sensitive to feature magnitudes. Min-Max scaling helps these models perform better.

Interpretability: Scaled data makes it easier to interpret feature contributions during analysis or visualization.

#Application in Recommendation System
Price: Helps recommend items within the customer's budget.

Rating: Ensures highly rated items are prioritized.

Delivery Time: Optimizes for faster delivery.

By using Min-Max scaling, you ensure that all features contribute equally to

the recommendation system, enhancing its accuracy and reliability.








### Q6

Using PCA for Dimensionality Reduction in Stock Price Prediction
When building a predictive model for stock prices, the dataset often contains numerous features such as company financial data (e.g., revenue, profit margins) and market trends (e.g., indices, sector performance). Principal Component Analysis (PCA) can help reduce the dataset's dimensionality, making it more manageable while retaining the most important information.

#Steps to Apply PCA for Stock Price Prediction
1. Understand the Data
High Dimensionality: A dataset with many features may lead to computational inefficiency and overfitting.

Correlated Features: Features like "revenue" and "profit margin" may be correlated, contributing redundant information.

2. Preprocess the Data
Standardize Features: PCA is sensitive to the scale of the data, so standardization is necessary to ensure all features contribute equally.
𝑧
=
𝑥
−
𝜇
𝜎
z=
σ
x−μ
​

where
𝜇
μ is the mean and
𝜎
σ is the standard deviation of the feature.

3. Apply PCA
Compute Principal Components: PCA identifies the directions (principal components) that capture the maximum variance in the data.

Select the Number of Components: Use the explained variance ratio to decide how many principal components to retain (e.g., 95% variance).

4. Use Reduced Data for Modeling
Train the stock price prediction model using the reduced dataset with fewer components.


In [5]:
import pandas as pd
from sklearn.decomposition import PCA
from sklearn.preprocessing import StandardScaler

# Example: Financial and market data
data = {
    "Revenue": [500, 600, 700, 800, 900],
    "Profit Margin": [0.15, 0.20, 0.18, 0.22, 0.25],
    "Market Index": [3000, 3100, 3200, 3300, 3400],
    "Sector Growth": [2.1, 2.3, 2.2, 2.4, 2.5]
}

# Step 1: Create a DataFrame
df = pd.DataFrame(data)

# Step 2: Standardize the data
scaler = StandardScaler()
scaled_data = scaler.fit_transform(df)

# Step 3: Apply PCA
pca = PCA(n_components=2)  # Reduce to 2 components
pca_data = pca.fit_transform(scaled_data)

# Step 4: Create a DataFrame for PCA results
pca_df = pd.DataFrame(pca_data, columns=["PC1", "PC2"])

print("Explained Variance Ratio:", pca.explained_variance_ratio_)
print(pca_df)


Explained Variance Ratio: [0.95294948 0.04631811]
        PC1       PC2
0 -2.855419 -0.026639
1 -0.707446  0.705701
2 -0.646636 -0.651113
3  1.354082 -0.054588
4  2.855419  0.026639


### Q7

In [6]:
import numpy as np

# Original dataset
data = np.array([1, 5, 10, 15, 20])

# Desired range for Min-Max scaling: [-1, 1]
min_val, max_val = -1, 1

# Min-Max scaling formula
scaled_data = min_val + (data - data.min()) * (max_val - min_val) / (data.max() - data.min())

print("Scaled Data:", scaled_data)


Scaled Data: [-1.         -0.57894737 -0.05263158  0.47368421  1.        ]


### Q8

#Performing Feature Extraction Using PCA
For a dataset containing the features [height, weight, age, gender, blood pressure], the process of applying Principal Component Analysis (PCA) involves the following steps:

#Steps for PCA-Based Feature Extraction
1. Preprocessing
Standardization: PCA is sensitive to feature scaling. Therefore, all features must be standardized to have a mean of 0 and a standard deviation of 1.

2. Apply PCA
Compute the principal components to determine how much variance each component explains.

3. Determine Number of Components
Retain the principal components that collectively explain a significant percentage of the variance (e.g., 95%).

#Choosing the Number of Principal Components
Explanation

Original Features: The dataset has 5 features. PCA will produce 5 principal components, each explaining a part of the variance.
Explained Variance Ratio: Calculate the explained variance ratio to determine the contribution of each principal component.

Threshold: Choose the number of components that explain a cumulative variance of 95% or more.



In [7]:
import numpy as np
from sklearn.decomposition import PCA
from sklearn.preprocessing import StandardScaler

# Example dataset (height, weight, age, gender, blood pressure)
data = np.array([
    [170, 65, 25, 1, 120],
    [160, 70, 30, 0, 130],
    [180, 80, 35, 1, 140],
    [165, 55, 28, 0, 125],
    [175, 75, 40, 1, 135]
])

# Step 1: Standardize the data
scaler = StandardScaler()
scaled_data = scaler.fit_transform(data)

# Step 2: Apply PCA
pca = PCA()
pca_data = pca.fit_transform(scaled_data)

# Step 3: Determine explained variance
explained_variance_ratio = pca.explained_variance_ratio_
cumulative_variance = np.cumsum(explained_variance_ratio)

print("Explained Variance Ratio:", explained_variance_ratio)
print("Cumulative Variance:", cumulative_variance)

# Select components explaining at least 95% of the variance
n_components = np.argmax(cumulative_variance >= 0.95) + 1
print("Number of Components to Retain:", n_components)


Explained Variance Ratio: [0.71497306 0.19496105 0.05512943 0.03493645 0.        ]
Cumulative Variance: [0.71497306 0.90993411 0.96506355 1.         1.        ]
Number of Components to Retain: 3
