# Measure Of Central Tendency

 In machine learning and statistics, a measure of central tendency is a concept used to describe the central or average value of a dataset. It provides a way to summarize the data by identifying a representative value around which the data points tend to cluster. There are three main measures of central tendency: the mean, median, and mode.

* Mean: The mean is the most common measure of central tendency. It is calculated by adding up all the values in the dataset and then dividing by the number of values. The mean is sensitive to outliers; if there are extreme values in the dataset, they can heavily influence the mean.

* Median: The median is the middle value of a dataset when it is arranged in ascending or descending order. It is not affected by extreme values (outliers) as much as the mean, making it a more robust measure of central tendency in the presence of outliers.

* Mode: The mode is the value that appears most frequently in the dataset. A dataset can have one mode (unimodal), two modes (bimodal), or more than two modes (multimodal). Unlike the mean and median, the mode is not necessarily unique and may not exist at all in some datasets.



The choice of which measure of central tendency to use depends on the characteristics of the data and the specific analysis you are performing. In some cases, a combination of these measures might provide a more comprehensive understanding of the data's central tendency.

When working with machine learning algorithms or statistical analysis, it's important to consider these measures to better understand the distribution and characteristics of your data. Each measure provides valuable insights into the data's central behavior, and using them appropriately can help in making more informed decisions in various applications.






## Mean

In [1]:
import numpy as np

In [4]:
ages=[12,15,16,44,12,60,34]
np.mean(ages)

27.571428571428573

In [5]:
## When We have outliers
ages_with_outliers=[12,15,16,44,12,60,34,1000]
np.mean(ages_with_outliers)

149.125

In [6]:
## the mean values differs in great number when outliers are present so now we will be using median technic

## Median

In [7]:
np.median(ages)

16.0

In [8]:
np.median(ages_with_outliers)

25.0

In [9]:
## Now the mean value without outlier and median value with outlier are somewhat equal to similar

## Mode

In [12]:
from scipy import stats as sp

In [13]:
sp.mode(ages)

  sp.mode(ages)


ModeResult(mode=array([12]), count=array([2]))

In [14]:
sp.mode(ages_with_outliers)

  sp.mode(ages_with_outliers)


ModeResult(mode=array([12]), count=array([2]))