In [None]:
Q1. Write a Python code to implement the KNN classifier algorithm on load_iris dataset in
sklearn.datasets.


Ans:
    
     Here's an example of how to implement the K-Nearest Neighbors (KNN) classifier 
        algorithm on the `load_iris` dataset from `sklearn.datasets`:


import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score

# Load the iris dataset
iris = load_iris()
X = iris.data
y = iris.target

# Split the dataset into a training set and a test set
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Initialize the KNN classifier with a specified number of neighbors (k)
k = 3
knn_classifier = KNeighborsClassifier(n_neighbors=k)

# Fit the classifier on the training data
knn_classifier.fit(X_train, y_train)

# Predict the labels for the test set
y_pred = knn_classifier.predict(X_test)

# Calculate the accuracy of the classifier
accuracy = accuracy_score(y_test, y_pred)
print(f"Accuracy of KNN with k={k}: {accuracy}")


In this code:

1. We import the necessary libraries and load the `load_iris` dataset.
2. We split the dataset into a training set and a test set using `train_test_split`.
3. We initialize a KNN classifier with a specified number of neighbors 
(`k` in this case is set to 3).
4. We fit the classifier on the training data using the `fit` method.
5. We use the trained classifier to make predictions on the test set.
6. Finally, we calculate and print the accuracy of the KNN classifier on the test set.

You can adjust the value of `k` to experiment with different numbers of 
neighbors for the KNN algorithm.











Q2. Write a Python code to implement the KNN regressor algorithm on load_boston dataset in
sklearn.datasets.

Ans:
      Here's a Python code example to implement the K-Nearest Neighbors (KNN)
        regressor algorithm on the `load_boston` dataset from scikit-learn:


# Import necessary libraries
import numpy as np
from sklearn.datasets import load_boston
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.neighbors import KNeighborsRegressor
from sklearn.metrics import mean_squared_error, r2_score

# Load the Boston Housing dataset
boston = load_boston()

# Split the dataset into features and target variable
X = boston.data
y = boston.target

# Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Standardize the features (important for KNN)
scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

# Initialize the KNN regressor
knn_regressor = KNeighborsRegressor(n_neighbors=5)  # You can change the number of neighbors

# Fit the model to the training data
knn_regressor.fit(X_train, y_train)

# Predict the target variable on the test data
y_pred = knn_regressor.predict(X_test)

# Evaluate the model
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

print("Mean Squared Error:", mse)
print("R-squared:", r2)


In this code:

1. We load the Boston Housing dataset using `load_boston()`.

2. We split the dataset into features (X) and the target variable (y).

3. We further split the data into training and testing sets using `train_test_split()`.

4. Standardization of features is performed using `StandardScaler()` to 
ensure that all features have the same scale.

5. We initialize the KNN regressor with the desired number of neighbors 
(in this case, 5) using `KNeighborsRegressor`.

6. We fit the KNN regressor model to the training data using `fit()`.

7. We make predictions on the test data using `predict()`.

8. Finally, we evaluate the model using mean squared error (`mse`) and 
R-squared (`r2`) metrics to measure its performance.

You can adjust the number of neighbors and other hyperparameters to optimize
the KNN regressor for your specific problem.












Q3. Write a Python code snippet to find the optimal value of K for the KNN classifier algorithm using
cross-validation on load_iris dataset in sklearn.datasets.


Ans:
    
      You can use cross-validation to find the optimal value of K for the K-Nearest
Neighbors (KNN) classifier algorithm using the `load_iris` dataset from scikit-learn.
Here's a Python code snippet to do that:


import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import cross_val_score, train_test_split
from sklearn.neighbors import KNeighborsClassifier

# Load the iris dataset
iris = load_iris()
X = iris.data
y = iris.target

# Split the data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Define a range of K values to test
k_values = list(range(1, 31))  # You can adjust this range based on your needs

# Create an empty list to store cross-validation scores
cv_scores = []

# Perform 10-fold cross-validation for each K value
for k in k_values:
    knn = KNeighborsClassifier(n_neighbors=k)
    scores = cross_val_score(knn, X_train, y_train, cv=10, scoring='accuracy')
    cv_scores.append(scores.mean())

# Find the optimal K value that gives the highest cross-validation score
optimal_k = k_values[cv_scores.index(max(cv_scores))]

# Print the optimal K value
print(f"The optimal K value is: {optimal_k}")

# You can also plot the cross-validation scores to visualize the results
import matplotlib.pyplot as plt

plt.figure(figsize=(10, 6))
plt.plot(k_values, cv_scores, marker='o', linestyle='-')
plt.xlabel('Number of Neighbors (K)')
plt.ylabel('Cross-Validation Accuracy')
plt.title('KNN Cross-Validation Performance')
plt.grid(True)
plt.show()


This code will output the optimal value of K for the KNN classifier using 10-fold
cross-validation on the iris dataset and also provide a plot to visualize the
cross-validation accuracy for different K values. Adjust the `k_values` 
range as needed for your specific analysis.
    
    
    
    
    
    
    
    
    
    
    
    
    
    
Q4. Implement the KNN regressor algorithm with feature scaling on load_boston dataset in
sklearn.datasets.   


Ans:
    
    To implement the K-Nearest Neighbors (KNN) regressor algorithm with feature 
    scaling on the `load_boston` dataset from scikit-learn, you can follow these steps:

1. Import the necessary libraries and load the dataset.
2. Split the dataset into training and testing sets.
3. Scale the features to have zero mean and unit variance.
4. Create and train the KNN regressor model.
5. Make predictions on the test set.
6. Evaluate the model's performance using appropriate regression metrics.

Here's a Python code implementation of these steps:


import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import load_boston
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.neighbors import KNeighborsRegressor
from sklearn.metrics import mean_squared_error, r2_score

# Step 1: Load the Boston Housing dataset
boston = load_boston()
X = boston.data
y = boston.target

# Step 2: Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Step 3: Scale the features using StandardScaler
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

# Step 4: Create and train the KNN regressor model
k = 5  # Number of neighbors (you can adjust this value)
knn_regressor = KNeighborsRegressor(n_neighbors=k)
knn_regressor.fit(X_train_scaled, y_train)

# Step 5: Make predictions on the test set
y_pred = knn_regressor.predict(X_test_scaled)

# Step 6: Evaluate the model's performance
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

print(f"Mean Squared Error (MSE): {mse:.2f}")
print(f"R-squared (R2) Score: {r2:.2f}")

# You can also plot the predicted vs. actual values to visualize the performance
plt.scatter(y_test, y_pred)
plt.xlabel("Actual Values")
plt.ylabel("Predicted Values")
plt.title("Actual vs. Predicted Values (KNN Regressor)")
plt.show()


Make sure to adjust the `k` value (number of neighbors) and other hyperparameters as 
needed for your specific regression task. Additionally, you can explore 
different scaling techniques or other regression algorithms to further
improve performance if necessary.











Q5. Write a Python code snippet to implement the KNN classifier algorithm with weighted voting on
load_iris dataset in sklearn.datasets.


Ans:
    
    
    
    To implement the K-Nearest Neighbors (KNN) classifier algorithm with weighted voting 
on the `load_iris` dataset from the `sklearn.datasets` module, you can use the 
following Python code snippet. We will use the `KNeighborsClassifier` 
class from scikit-learn and assign weights based on the inverse 
of distance for weighted voting.


import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score

# Load the Iris dataset
iris = load_iris()
X = iris.data
y = iris.target

# Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Define the KNN classifier with weighted voting
k = 5  # Number of neighbors to consider
weights = 'distance'  # Weighting method based on distance
knn_classifier = KNeighborsClassifier(n_neighbors=k, weights=weights)

# Fit the classifier to the training data
knn_classifier.fit(X_train, y_train)

# Make predictions on the test data
y_pred = knn_classifier.predict(X_test)

# Calculate accuracy
accuracy = accuracy_score(y_test, y_pred)
print(f"Accuracy: {accuracy * 100:.2f}%")


In this code:

1. We load the Iris dataset using `load_iris()` and split it into training and testing sets.
2. We create a KNN classifier with `KNeighborsClassifier` and set `n_neighbors` to the
number of neighbors (in this case, 5) and `weights` to 'distance' to enable weighted voting.
3. We fit the classifier to the training data.
4. We make predictions on the test data and calculate the accuracy of the classifier.

This code will output the accuracy of the KNN classifier with weighted voting on
the Iris dataset. You can adjust the `k` value and weighting method as
needed for your specific application.












Q6. Implement a function to standardise the features before applying KNN classifier.


Ans:
    
    Standardizing the features is an essential preprocessing step when using K-Nearest
    Neighbors (KNN) or many other machine learning algorithms because it ensures that
    all features have the same scale. Standardization involves transforming the data such 
    that it has a mean of 0 and a standard deviation of 1. This helps KNN (which relies on 
        distance metrics) to treat all features equally.

You can use Python and the `StandardScaler` class from the `scikit-learn` library to
standardize the features. Here's how you can implement a function to standardize
the features before applying a KNN classifier:

from sklearn.preprocessing import StandardScaler

def standardize_features(X_train, X_test):
    """
    Standardize the features in X_train and apply the same transformation to X_test.

    Parameters:
    - X_train: Training feature matrix (2D numpy array or DataFrame).
    - X_test: Testing feature matrix (2D numpy array or DataFrame).

    Returns:
    - X_train_scaled: Standardized training feature matrix.
    - X_test_scaled: Standardized testing feature matrix.
    """
    # Initialize a StandardScaler object
    scaler = StandardScaler()

    # Fit the scaler to the training data and transform it
    X_train_scaled = scaler.fit_transform(X_train)

    # Use the same scaler to transform the testing data
    X_test_scaled = scaler.transform(X_test)

    return X_train_scaled, X_test_scaled




You can use this function to standardize your training and testing feature
matrices before applying the KNN classifier. Here's how you might use it:


from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score

# Load your training and testing data
# X_train, y_train, X_test, y_test = load_data()

# Standardize the features
X_train_scaled, X_test_scaled = standardize_features(X_train, X_test)

# Initialize and fit a KNN classifier
knn_classifier = KNeighborsClassifier(n_neighbors=5)
knn_classifier.fit(X_train_scaled, y_train)

# Make predictions on the standardized testing data
y_pred = knn_classifier.predict(X_test_scaled)

# Calculate accuracy or other evaluation metrics
accuracy = accuracy_score(y_test, y_pred)
print("Accuracy:", accuracy)


Make sure to replace the `load_data()` function and data loading code with your actual 
data loading process. This code assumes that you have separate arrays or DataFrames for 
training and testing data (`X_train`, `y_train`, `X_test`, `y_test`).
    
    
    
    
    
    
    
    
    
 




 Q7. Write a Python function to calculate the euclidean distance between two points.
  

 Ans:
        You can calculate the Euclidean distance between two points (x1, y1) 
        and (x2, y2) in Python using the following function:


import math

def euclidean_distance(x1, y1, x2, y2):
    """
    Calculate the Euclidean distance between two points.
    
    Parameters:
    x1 (float): x-coordinate of the first point.
    y1 (float): y-coordinate of the first point.
    x2 (float): x-coordinate of the second point.
    y2 (float): y-coordinate of the second point.
    
    Returns:
    float: Euclidean distance between the two points.
    """
    return math.sqrt((x1 - x2)**2 + (y1 - y2)**2)

# Example usage:
point1 = (1, 2)
point2 = (4, 6)

distance = euclidean_distance(point1[0], point1[1], point2[0], point2[1])
print("Euclidean distance between point1 and point2:", distance)


In this function, we first import the `math` module to use the square root function.
Then, we calculate the Euclidean distance using the formula: sqrt((x1 - x2)^2 + (y1 - y2)^2). 
Finally, we return the calculated distance. You can call this function with the
coordinates of the two points you want to find the distance between.
    
    
    
    
    
    
    
    
    
    
  



 Q8. Write a Python function to calculate the manhattan distance between two points.
    
    
Ans:
    
    You can calculate the Manhattan distance between two points (x1, y1) and (x2, y2) using 
    the following Python function:


def manhattan_distance(x1, y1, x2, y2):
    return abs(x1 - x2) + abs(y1 - y2)

# Example usage:
point1 = (1, 2)
point2 = (4, 6)
distance = manhattan_distance(point1[0], point1[1], point2[0], point2[1])
print("Manhattan distance:", distance)


This function takes four arguments: the x and y coordinates of the first point (x1, y1) and the
x and y coordinates of the second point (x2, y2). It calculates the absolute difference 
between the x coordinates and the absolute difference between the y coordinates and then
adds these two differences together to obtain the Manhattan distance.




