# Box Plot

## What is a box plot?
It's a graphical representation of the statistical distribution of data.

Some definitions:
- Minimum: lowest boundary, excluding outliers
- Q1 (first quartile): 25% of the data falls below this value
- Median: Middle value of the data set
- Q3 (third quartile): 75% of the data falls below this value
- Maximum: highest boundary, excluding outliers
- Whiskers: represent data points outside the middle 50% of data (outside of the box)
- IQR (Interquartile range): data points between Q1 and Q3 (50% of data)
- Outlier: data point that lines outside of the minimum and maximum, defined as (Q1-1.5\*IQR) or (Q3+1.5\*IQR)

More here: [https://en.wikipedia.org/wiki/Box_plot](https://en.wikipedia.org/wiki/Box_plot)

![Alt text](../../resources/images/image_1.png)

## Task
We are tasked to create box plots for all Pokemon stats in one graph.

In [None]:
import pandas as pd 
import numpy as np 
import matplotlib.pyplot as plt

In [None]:
df = pd.read_csv(r'../../resources/data/Pokemon.csv')
df.head()

In [None]:
# Generate list of columns 
labels = list(df.columns)[5:11]
print(labels)

# Get data lists to create boxplots boxplots
data = [df[stat] for stat in labels]


In [None]:
# Use the pandas describe method to review the data
df[labels].describe()

In [None]:
# Create plot
plt.title('Pokemon Stats Box Plots')
plt.boxplot(data, labels=labels)

plt.show()