# 05: Distributions

This notebook demonstrates **data distribution visualizations** using Seaborn.  
We explore the distribution of Iris dataset features and examine differences across species.


In [None]:
import pandas as pd
import matplotlib.pyplot as plt
%matplotlib inline
import seaborn as sns

print("Setup Complete")


**Libraries Used:**
- `pandas` for data manipulation
- `matplotlib` for plotting
- `seaborn` for distribution plots like histograms and KDE plots


In [None]:
iris_data = pd.read_csv("Iris.csv")
iris_data.head()
iris_data.tail()


The **Iris dataset** contains measurements of **sepal and petal lengths and widths** for different species.  
We will visualize the distributions of these features to understand their patterns.


In [None]:
sns.histplot(data=iris_data['PetalLengthCm'])


A **histogram** shows the frequency distribution of **Petal Length** across all samples.


In [None]:
sns.jointplot(data=iris_data, x='PetalLengthCm', y='PetalWidthCm', kind='kde')


A **joint plot with KDE** shows the density of **Petal Length vs Petal Width**, helping to identify correlated regions.


In [None]:
sns.histplot(data=iris_data, x='PetalLengthCm', hue='Species')


Using `hue='Species'`, we can compare the **Petal Length distribution** across different Iris species.


In [None]:
sns.kdeplot(data=iris_data, x='PetalLengthCm', hue='Species', shade=True)


A **KDE plot** with shading visualizes the **smoothed distribution** of Petal Length for each species.


## Observations / Results

- The histogram shows that most petal lengths cluster within a certain range.  
- Jointplot indicates the relationship between **Petal Length** and **Petal Width**, highlighting denser regions.  
- Distributions vary by species: some species have longer petals while others are shorter.  
- KDE plots make it easier to compare species-specific distributions and observe overlapping regions.
