Statistics are a cornerstone of data analysis, allowing researchers to summarize complex datasets, uncover patterns, and make data-driven decisions.

In Physics and scientific research, statistical tools are indispensable for interpreting experimental results, validating hypotheses, and quantifying uncertainty. 

In this section, you’ll learn how to leverage NumPy to calculate essential statistical measures such as mean, median, and standard deviation. Additionally, we’ll explore foundational concepts like hypothesis testing and confidence intervals, equipping you with practical techniques to analyze data effectively and draw meaningful conclusions from your research.

## Calculating Mean, Median, and Standard Deviation Using NumPy

### Mean

The __mean__ is a measure of central tendency that represents the average value of a dataset. It is calculated by summing all values in the dataset and dividing by the number of observations.

In [1]:
import numpy as np

# Example dataset
data = np.array([2, 4, 6, 8, 10])

# Calculate the mean
mean = np.mean(data)

print("Mean:", mean)

Mean: 6.0


### Median

The __median__ is the middle value of a dataset when it is ordered in ascending or descending order. If the dataset has an odd number of observations, the median is the middle value. If the dataset has an even number of observations, the median is the average of the two middle values.

In [2]:
# Calculate the median
median = np.median(data)
print("Median:", median)

Median: 6.0


### Standard Deviation

The __standard deviation__ is a measure of the dispersion of a dataset. It quantifies the amount of variation or dispersion in a dataset. A low standard deviation indicates that the data points tend to be close to the mean, while a high standard deviation indicates that the data points are spread out over a wider range.

For example, as a Physics researcher, you may want to calculate the standard deviation of a dataset to understand the variability in repeated measurements of a particle’s velocity or the spread of values in an experiment measuring acceleration due to gravity. This information can help you evaluate the precision of your measurements, assess the reliability of your data, and determine whether the observed variability is within acceptable limits for drawing meaningful conclusions.

In [3]:
# Calculate the standard deviation

std_dev = np.std(data)
print("Standard Deviation:", std_dev)

Standard Deviation: 2.8284271247461903
