### Q1. What is the KNN algorithm?

K-Nearest Neighbors (KNN) is a supervised machine learning algorithm used for classification and regression tasks. 
It is a simple but effective algorithm that can be used for both classification and regression tasks.

Here's how the KNN algorithm works:

#### Training Phase:
In the training phase, KNN simply stores all the available data points and their corresponding class labels (for classification) or target values (for regression).

#### Prediction Phase:
When a new, unlabeled data point is given for prediction, KNN calculates the distance between this data point and all the data points in its training dataset. Common distance metrics include Euclidean distance, Manhattan distance, or others depending on the problem.

KNN then selects the top K data points (nearest neighbors) from the training dataset based on their calculated distances. K is a user-defined parameter.

   * ##### For classification:
If K is odd, it chooses the class label that occurs most frequently among the K nearest neighbors and assigns it to the new data point. If K is even, it may use techniques like majority voting or weighted voting to determine the class label.

   * ##### For regression:
If K is odd, it calculates the average (or weighted average) of the target values of the K nearest neighbors and assigns it as the predicted target value for the new data point. If K is even, it may use weighted averaging based on the distances of the neighbors.

#### Output:

* For classification, KNN outputs the class label assigned to the new data point.
* For regression, KNN outputs the predicted target value.
    
##### Key considerations when using KNN:

1. The choice of the distance metric and the value of K can significantly impact the algorithm's performance.
2. KNN can be sensitive to the scale of the features, so feature scaling (e.g., normalization) is often necessary.
3. It can be computationally expensive, especially with large datasets, as it requires calculating distances for all data   
   points.
4. It's a lazy learner, meaning it doesn't build an explicit model during training, which can make it computationally efficient    during training but potentially slower during prediction.

KNN is a versatile algorithm used in various fields, such as pattern recognition, image classification, recommendation systems, and more, but it may not always be the best choice for every problem, especially when dealing with high-dimensional data or imbalanced datasets.

### Q2. How do you choose the value of K in KNN?

Choosing the right value of K in the K-Nearest Neighbors (KNN) algorithm is a crucial step because it can significantly impact the algorithm's
performance. Selecting an appropriate K value depends on the specific dataset and problem you are working on. Here are some common methods and 
considerations for choosing the value of K:

#### Trial and Error:
One of the simplest approaches is to try different values of K and evaluate the algorithm's performance using techniques like 
cross-validation. Start with a small value of K (e.g., K=1) and gradually increase it while monitoring the algorithm's performance.Plot the model's accuracy or error rate as a function of K and choose the value that provides the best performance on your validation data.

#### Odd vs. Even K:
It's often recommended to use an odd value for K, especially in classification tasks. This avoids ties in the voting process and ensures that the algorithm can make a clear decision.However, in some cases, using an even K with appropriate tie-breaking mechanisms (e.g., weighted voting) can also work effectively.

#### Consider the Dataset Size:
If you have a small dataset, using a small K value (e.g., K=1 or K=3) may work better because it captures local patterns.
For larger datasets, you may need to use a larger K to capture more global patterns.

#### Cross-Validation:
Utilize techniques like k-fold cross-validation to assess the performance of different K values on various subsets of your data.
Cross-validation helps you estimate how well your model will generalize to unseen data and choose a K value that balances bias and variance.

#### Domain Knowledge:
Consider the domain-specific knowledge and problem characteristics. Some problems may have inherent properties that suggest an appropriate range of K values.

#### Grid Search:
If you are using KNN as part of a machine learning pipeline with other algorithms, you can perform a grid search or hyperparameter optimization to find the best K value along with other hyperparameters.

#### Distance Metrics:
The choice of distance metric (e.g., Euclidean, Manhattan) can also influence the selection of K. Different metrics may perform better with different K values, so it's a good idea to experiment with multiple combinations.

#### Visualizations:
Visualize your data and the decision boundaries created by different K values. This can provide insights into how K affects the model's behavior.

#### Error Analysis:
Analyze the errors made by the model for different K values. Understanding the types of mistakes it makes can help you select an appropriate K.

Keep in mind that there is no one-size-fits-all solution for choosing K in KNN. The optimal K value may vary from one dataset to another, so it's important to experiment and evaluate different options to find the best K for your specific problem.

### Q3. What is the difference between KNN classifier and KNN regressor?

The primary difference between K-Nearest Neighbors (KNN) classifier and KNN regressor lies in the type of machine learning task they are 
designed for and how they make predictions:

#### KNN Classifier:

* Task:
KNN classifier is used for classification tasks. In classification, the goal is to assign a label or category to a data point based on its similarity to neighboring data points.

* Prediction:
When making predictions, KNN classifier calculates the distances between the new data point and its K nearest neighbors in the
training dataset. It then assigns the class label that is most frequent among these K neighbors (i.e., majority vote). The predicted output is a class label representing the category to which the new data point is assigned.

#### KNN Regressor:

* Task:
KNN regressor is used for regression tasks. In regression, the goal is to predict a continuous numerical value (e.g., a real number) for a new data point based on the values of its K nearest neighbors.

* Prediction:
When making predictions, KNN regressor calculates the distances between the new data point and its K nearest neighbors. It then
calculates the average (or weighted average) of the target values (numerical values) of these K neighbors. The predicted output is a numerical value representing the estimated target value for the new data point.


In summary, KNN classifier and KNN regressor both use the K-nearest neighbors approach to make predictions, but they differ in the type of output they produce. KNN classifier assigns categorical class labels, while KNN regressor predicts continuous numerical values.

The choice between the two depends on the nature of your machine learning problem:

1. Use KNN Classifier for problems where you want to classify data points into distinct categories or classes (e.g., image          classification, spam detection).
2. Use KNN Regressor for problems where you want to predict numerical values (e.g., predicting house prices based on features, 
   estimating a person's age based on other attributes).

It's essential to choose the appropriate variant of KNN based on the problem's nature and the type of output you need.

### Q4. How do you measure the performance of KNN?

To measure the performance of a K-Nearest Neighbors (KNN) model, you can use a variety of evaluation metrics and techniques, depending on 
whether you are working on a classification or regression problem. Here are some common methods for evaluating the performance of KNN:

##### For Classification Problems:

* Accuracy: Accuracy is the most straightforward metric for classification tasks. It measures the proportion of correctly classified data points over the total number of data points in the test dataset. However, accuracy may not be the best choice for imbalanced datasets.

        Accuracy = (Number of Correct Predictions) / (Total Number of Predictions)

* Confusion Matrix: A confusion matrix provides a more detailed view of the model's performance. It breaks down the number of true positives, true negatives, false positives, and false negatives. From the confusion matrix, you can compute metrics such as precision, recall, and F1-score.


* Precision: Precision measures the proportion of true positive predictions among all positive predictions. It is useful when false positives are costly.

        Precision = (True Positives) / (True Positives + False Positives)

* Recall (Sensitivity): Recall measures the proportion of true positive predictions among all actual positive instances. It is useful when false negatives are costly.

        Recall = (True Positives) / (True Positives + False Negatives)

* F1-Score: The F1-score is the harmonic mean of precision and recall. It provides a balanced measure of a model's performance when precision and recall are both important.

        F1-Score = 2 * (Precision * Recall) / (Precision + Recall)

* ROC Curve and AUC: For binary classification problems, you can plot the Receiver Operating Characteristic (ROC) curve and calculate the Area Under the ROC Curve (AUC). AUC measures the model's ability to distinguish between positive and negative classes, and the ROC curve visually shows the trade-off between true positive rate and false positive rate at different decision thresholds.


##### For Regression Problems:

* Mean Absolute Error (MAE): MAE measures the average absolute difference between the predicted values and the actual target values. It gives equal weight to all errors.

        MAE = (1/n) * Σ|Actual - Predicted|

* Mean Squared Error (MSE): MSE measures the average of the squared differences between predicted values and actual target values. It penalizes larger errors more than MAE.

        MSE = (1/n) * Σ(Actual - Predicted)^2

* Root Mean Squared Error (RMSE): RMSE is the square root of the MSE and provides a more interpretable metric in the same unit as the target variable.

        RMSE = sqrt(MSE)

* R-squared (R2): R2 measures the proportion of the variance in the target variable that is explained by the model. It ranges from 0 to 1, with higher values indicating a better fit.

        R2 = 1 - (MSE(Model) / MSE(Baseline))

Here, the baseline could be the mean or median of the target values.

* Adjusted R-squared: Adjusted R-squared adjusts the R2 value for the number of predictors in the model. It penalizes the addition of unnecessary predictors.

        Adjusted R2 = 1 - [(1 - R2) * ((n - 1) / (n - k - 1))]

        where n is the number of data points and k is the number of predictors.


When evaluating the performance of a KNN model, it's essential to consider the specific goals and constraints of your problem, as well as the characteristics of your data. Different metrics may be more appropriate for different scenarios, and you should choose the one that best aligns with your objectives. Additionally, cross-validation can help you get a more reliable estimate of your model's performance by testing it on multiple subsets of your data.

### Q5. What is the curse of dimensionality in KNN?

The "curse of dimensionality" is a phenomenon that arises in various machine learning and data analysis tasks, including K-Nearest Neighbors
(KNN). It refers to the challenges and issues that arise as the dimensionality (the number of features or attributes) of the data increases. 
The curse of dimensionality can have significant implications for the performance and efficiency of algorithms like KNN. 
Here are some key aspects of the curse of dimensionality in the context of KNN:

#### Increased Computational Complexity:
As the number of dimensions (features) in the dataset grows, the computational complexity of KNN increases significantly. This is because calculating distances between data points in high-dimensional spaces requires more computations, which can slow down the algorithm.

#### Data Sparsity: 
In high-dimensional spaces, data points become increasingly sparse. This means that, as the number of dimensions increases, the data points are spread farther apart from each other. Consequently, it becomes more challenging to find nearby neighbors, which is a fundamental concept in KNN. In extremely high-dimensional spaces, many data points may appear to be equally distant from a query point.

#### Increased Data Requirements:
To maintain a sufficient density of data points in high-dimensional spaces, you may need a much larger dataset. Gathering enough data to accurately represent the space becomes more challenging as the dimensionality increases.

#### Overfitting: 
High-dimensional spaces can lead to overfitting in KNN. When you have many features relative to the number of data points, the model may fit the training data extremely well but generalize poorly to new, unseen data. This is sometimes referred to as the "pebble problem" because in high-dimensional spaces, it's easy to find a pebble (data point) that fits exactly in the gaps between other pebbles (data points).

#### Curse of Choice: 
High dimensionality introduces challenges in selecting meaningful features and reducing the dimensionality of the dataset. Feature selection and dimensionality reduction techniques are often needed to mitigate the curse of dimensionality.




###### To address the curse of dimensionality when using KNN or other machine learning algorithms, you can consider the following strategies:

#### Feature Selection: 
Choose the most relevant features and eliminate irrelevant or redundant ones to reduce dimensionality.

#### Dimensionality Reduction: 
Techniques like Principal Component Analysis (PCA) or t-Distributed Stochastic Neighbor Embedding (t-SNE) can be used to project 
high-dimensional data into a lower-dimensional space while preserving essential information.

#### Regularization: 
If you have many features, consider using regularization techniques in KNN to prevent overfitting. Regularization adds penalties for complex models with many features.

#### Feature Engineering: 
Create new features that capture meaningful relationships or patterns in the data, potentially reducing the need for high-dimensional representations.

#### Consider Alternative Algorithms: 
In some cases, it might be more appropriate to use algorithms that are less sensitive to high dimensionality, such as decision trees or ensemble methods.


Overall, the curse of dimensionality highlights the importance of careful feature selection, dimensionality reduction, and model evaluation when working with high-dimensional data in KNN and other machine learning models.

### Q6. How do you handle missing values in KNN?

Handling missing values in the K-Nearest Neighbors (KNN) algorithm requires careful consideration, as missing data can affect the distance
calculations and the accuracy of predictions. Here are some common approaches to dealing with missing values in KNN:

#### Remove Data Points with Missing Values:
One straightforward approach is to remove data points (rows) that contain missing values. This can be a viable option if you have a relatively small number of missing values and removing them doesn't significantly reduce the size of your dataset.
However, this approach can lead to data loss, especially if you have a large number of missing values.

#### Imputation:

Imputation involves filling in missing values with estimated or imputed values. The choice of imputation method depends on the nature of your data:

* Mean/Median Imputation: 
Replace missing numeric values with the mean or median of the non-missing values in the same feature. This method is simple and 
effective but may not be suitable for data with skewed distributions.

* Mode Imputation: 
Replace missing categorical values with the mode (most frequent category) of the non-missing values in the same feature.

* KNN Imputation: 
Use KNN to impute missing values. For each missing value, find its K nearest neighbors (based on the other features with available data) and use their values to impute the missing one.

* Regression Imputation: 
Fit a regression model (e.g., linear regression) to predict the missing values based on the other features. This method is suitable for cases where the relationships between variables are linear.

* Interpolation: 
For time series or sequential data, you can use interpolation methods to estimate missing values based on adjacent data points.
When using imputation, make sure to impute missing values separately for the training and test datasets to avoid data leakage.


#### Indicator Variables (Dummy Variables):
In cases where missingness is informative (i.e., missing values convey some information), you can create indicator variables to capture the presence or absence of missing values in each feature.
This approach allows you to preserve the information about missingness while still using the available data.

#### KNN-Based Imputation:
You can use a KNN-based imputation approach specifically tailored to KNN. In this method, you treat missing values as separate "missing data points" during the distance calculation step. When finding the K nearest neighbors for a data point with missing values, consider other data points with missing values in those features as well.
For each missing value, calculate a weighted average of the neighboring values based on their distances. The weights are determined by the inverse of the distances.

#### Multiple Imputation:
Multiple imputation involves creating multiple imputed datasets with different plausible values for missing data. You perform KNN or other analyses on each imputed dataset and then combine the results to account for the uncertainty introduced by imputation.


The choice of the appropriate method for handling missing values in KNN depends on the nature of your data, the extent of missingness, and the potential impact of missing values on your analysis. It's essential to carefully consider the implications of each approach and perform appropriate validation to ensure that the handling of missing data does not introduce bias or affect the quality of your KNN model's predictions.

### Q7. Compare and contrast the performance of the KNN classifier and regressor. Which one is better for which type of problem?

K-Nearest Neighbors (KNN) classifier and KNN regressor are two variants of the KNN algorithm used for different types of machine learning 
problems: classification and regression. Here, we'll compare and contrast their performance and discuss which one is better suited for 
different types of problems:


### KNN Classifier:

* Problem Type: 
KNN classifier is used for classification problems, where the goal is to assign data points to predefined classes or categories.

* Output: 
The output of KNN classifier is a class label representing the category to which a data point belongs.

* Evaluation Metrics: 
Classification performance is typically evaluated using metrics like accuracy, precision, recall, F1-score, and the confusion matrix.

* Distance Calculation: 
KNN classifier uses distance metrics (e.g., Euclidean distance) to measure the similarity between data points.

* Applications: 
KNN classifier is suitable for problems such as image classification, spam detection, sentiment analysis, and identifying the species of plants or animals.


#### KNN Regressor:

* Problem Type: 
KNN regressor is used for regression problems, where the goal is to predict continuous numerical values.

* Output: 
The output of KNN regressor is a numerical value representing the estimated target value for a data point.

* Evaluation Metrics: 
Regression performance is evaluated using metrics like Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), R-squared (R2), and others.

* Distance Calculation: 
KNN regressor also uses distance metrics, but it calculates the average (or weighted average) of target values of the nearest neighbors instead of class labels.

* Applications: 
KNN regressor is useful for problems like predicting house prices, stock price forecasting, and estimating the age or income of
individuals based on other attributes.

##### Comparison and Considerations:

* Output Type: 
The most significant difference is the type of output they provide: classification (KNN classifier) or regression (KNN regressor).

* Evaluation Metrics: 
The choice between them depends on the nature of your problem and the type of output you want to predict. If your target variable is categorical, you should use KNN classifier; if it's continuous, use KNN regressor.

* Data Distribution: 
Consider the distribution of your target variable and the nature of your data. If your target variable has a wide range of continuous values, regression may be more appropriate. If it involves distinct categories or classes, classification is the way to go.

* Overfitting: 
KNN regressor is more prone to overfitting when dealing with high-dimensional data or a large number of features. It's important to consider dimensionality reduction techniques or other regression algorithms in such cases.

* Data Interpretability: 
KNN regressor provides continuous numerical predictions, which may be more interpretable for some regression problems. KNN classifier provides categorical class labels.



Summary, the choice between KNN classifier and KNN regressor depends on the problem at hand. Use KNN classifier for classification problems where you need to assign data points to classes, and use KNN regressor for regression problems where you need to predict continuous numerical values. It's crucial to match the problem type and desired output when selecting the appropriate variant of KNN for your machine learning task.

### Q8. What are the strengths and weaknesses of the KNN algorithm for classification and regression tasks, and how can these be addressed?

K-Nearest Neighbors (KNN) is a versatile algorithm with its own strengths and weaknesses for both classification and regression tasks.Understanding these can help you make informed decisions when choosing or using KNN, and address its limitations effectively. 

Here's an overview of the strengths and weaknesses of KNN:

#### Strengths of KNN:

1. Simplicity: KNN is conceptually simple and easy to understand. It doesn't require a complex training phase or model, making it a good choice for quick and straightforward implementations.

2. Non-parametric: KNN is non-parametric, meaning it makes no assumptions about the underlying data distribution. This flexibility allows it to work well with various types of data.

3. Versatility: KNN can be applied to both classification and regression problems, making it a versatile algorithm for a wide range of tasks.

4. Adaptability to Complex Decision Boundaries: KNN can capture complex decision boundaries in the data, as it relies on the proximity of data points rather than predefined shapes.

5. Effective for Small Datasets: KNN can perform well when you have a relatively small dataset, as it doesn't require a large amount of training data.

#### Weaknesses of KNN:

1. Computational Complexity: Calculating distances between data points can be computationally expensive, especially for large datasets or high-dimensional data. This can lead to slow predictions and high memory usage.

2. Sensitivity to Feature Scaling: KNN is sensitive to the scale of features, so it's essential to normalize or scale your data before using it.Otherwise, features with larger scales can dominate the distance calculations.

3. Sensitivity to the Choice of K: The choice of the hyperparameter K (the number of neighbors) can significantly impact KNN's performance.Selecting an appropriate K requires experimentation and validation.

4. Outliers: KNN is sensitive to outliers, as they can disproportionately affect the nearest neighbor calculations. Outliers can lead to incorrect predictions, so it's important to handle them appropriately.

5. Imbalanced Datasets: In classification tasks with imbalanced class distributions, KNN may favor the majority class if K is not chosen carefully. Techniques like oversampling, undersampling, or using different distance metrics can help address this issue.

6. Curse of Dimensionality: KNN performance can degrade in high-dimensional spaces due to the curse of dimensionality. As the number of dimensions increases, the data points become more spread out, making it difficult to find meaningful neighbors.

##### Addressing Weaknesses of KNN:

###### To address the weaknesses of KNN, you can take the following steps:

1. Optimize Hyperparameters: Experiment with different values of K and choose the one that provides the best performance on your validation data.

2. Feature Scaling: Normalize or standardize your features to ensure that they have a consistent scale, reducing sensitivity to feature scaling.

3. Dimensionality Reduction: When dealing with high-dimensional data, consider dimensionality reduction techniques like Principal Component Analysis (PCA) or feature selection to reduce the number of features while preserving essential information.

4. Distance Metrics: Experiment with different distance metrics (e.g., Euclidean, Manhattan, cosine similarity) to find the one that works best for your specific problem.

5. Outlier Handling: Detect and handle outliers in your dataset using appropriate techniques, such as removing them, transforming the data, or using robust distance metrics.

6. Parallelization: To address computational complexity, you can parallelize the distance calculations for faster predictions on multi-core processors or distributed computing environments.

7. Ensemble Methods: Combine multiple KNN models or use ensemble methods like Random Forest or Gradient Boosting to improve predictive performance and reduce sensitivity to the choice of K.


In summary, while KNN has its strengths and weaknesses, it can be a valuable tool when used appropriately and when its limitations are addressed effectively. Careful preprocessing, hyperparameter tuning, and consideration of the specific characteristics of your data can help you make the most of the KNN algorithm for classification and regression tasks.

### Q9. What is the difference between Euclidean distance and Manhattan distance in KNN?

Euclidean distance and Manhattan distance are two common distance metrics used in the K-Nearest Neighbors (KNN) algorithm and
other machine learning and data analysis tasks. They measure the dissimilarity or similarity between two data points, but they do so in 
different ways. Here are the key differences between Euclidean distance and Manhattan distance:

#### Euclidean Distance:

* Formula: The Euclidean distance between two points, often represented in a two-dimensional space (x, y), is calculated as the square root of the sum of the squared differences between their coordinates.

    Euclidean Distance = √[(x2 - x1)^2 + (y2 - y1)^2]

* In higher dimensions, the formula generalizes to:

    Euclidean Distance = √[Σ(xi - yi)^2]

* Geometric Interpretation: Euclidean distance measures the straight-line or shortest distance between two points. It corresponds to the length of the hypotenuse in a right triangle formed by the data points' coordinates.

* Sensitivity to Scale: Euclidean distance is sensitive to the scale of the features. If the features have different units or scales, the ones with larger scales can dominate the distance calculation.

#### Manhattan Distance:

* Formula: The Manhattan distance between two points is calculated as the sum of the absolute differences between their coordinates.

    Manhattan Distance = |x2 - x1| + |y2 - y1|

* In higher dimensions, the formula generalizes to:

    Manhattan Distance = Σ|xi - yi|

* Geometric Interpretation: Manhattan distance measures the distance traveled along the grid or city block. It corresponds to the total number of unit steps you need to take to move from one point to another, moving only horizontally or vertically.

* Scale Independence: Manhattan distance is not sensitive to the scale of features. It treats all feature dimensions equally, making it suitable for situations where different units or scales are involved.

##### Comparison:

* Euclidean distance generally gives more weight to differences in diagonal directions, as it measures straight-line distance. It is suitable when you want to capture the true geometric distance between points.

* Manhattan distance, on the other hand, measures the distance as the sum of vertical and horizontal steps, which can be more appropriate when you are constrained to moving along a grid-like path or when features have different units.

The choice between Euclidean and Manhattan distance in KNN depends on the problem and the characteristics of the data. For example, if you're dealing with features that represent physical measurements (e.g., height and weight), Euclidean distance may be more appropriate.If you're dealing with features that represent counts (e.g., number of rooms and number of bathrooms), Manhattan distance might be a better choice.

In practice, you can experiment with both distance metrics and choose the one that works best for your specific problem and dataset.Additionally, other distance metrics, such as Minkowski distance, can be used to generalize and interpolate between Euclidean and Manhattan distances by adjusting a parameter (p) to control the level of emphasis on different dimensions.

## Q10. What is the role of feature scaling in KNN?

Feature scaling plays a crucial role in the K-Nearest Neighbors (KNN) algorithm and many other machine learning algorithms. 
It is the process of standardizing or normalizing the feature values of your dataset to ensure that all features have a similar scale. 
The role of feature scaling in KNN includes the following aspects:

#### Distance Metric Consistency:
KNN relies on distance metrics (e.g., Euclidean distance, Manhattan distance) to measure the similarity between data points. 
These distance metrics are sensitive to the scale of the features. Features with larger scales can dominate the distance calculations, making the algorithm biased toward those features. Feature scaling ensures that all features contribute equally to the distance calculations, preventing such bias.

#### Improving Model Performance: 
By scaling features, you make it easier for KNN to identify the nearest neighbors accurately. This can lead to improved model 
performance, as the algorithm can more effectively capture the underlying patterns in the data. Properly scaled features can result in a better representation of similarity between data points.

#### Convergence Speed: 
Feature scaling can also impact the convergence speed of the KNN algorithm. Without scaling, it may take longer for the algorithm to find the nearest neighbors, especially when there is a large discrepancy in feature scales.

#### Dimensionality Reduction: 
Scaling can help reduce the curse of dimensionality, a problem where the performance of KNN degrades as the number of dimensions (features) increases. By scaling the features, you can mitigate this issue and make the algorithm more effective in high-dimensional spaces.
    
##### Common methods for feature scaling include:

* Min-Max Scaling (Normalization): Scales the features to a specific range, usually between 0 and 1. It is defined as:

        Scaled Value = (Value - Min) / (Max - Min)

where Min and Max are the minimum and maximum values in the feature.

* Z-Score Standardization: Standardizes features to have a mean (average) of 0 and a standard deviation of 1. It is defined as:

        Scaled Value = (Value - Mean) / Standard Deviation

where Mean and Standard Deviation are calculated from the feature values.

* Robust Scaling: Scales features by subtracting the median and dividing by the interquartile range (IQR). It is robust to outliers.

        Scaled Value = (Value - Median) / IQR


The choice of feature scaling method depends on the characteristics of your data and the requirements of your problem. In practice, it's a good practice to scale your features before applying the KNN algorithm to ensure that it performs optimally and that feature values with different scales do not introduce bias into the distance calculations.