### Task 1: Exploring and Visualizing a Simple Dataset

The objective of this task is to load, inspect, and visualize the Iris dataset.
You will practice:
- Reading datasets with pandas
- Performing exploratory data analysis (EDA)
- Understanding distributions and relationships
- Visualizing data using matplotlib and seaborn


In [None]:
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

sns.set(style="whitegrid")


iris = sns.load_dataset("iris")
iris.head()

#Inspect the Dataset

print("Shape of dataset:", iris.shape)
print("\nColumn names:", iris.columns.tolist())

print("\n--- First 5 rows ---")
display(iris.head())

print("\n--- Dataset Info ---")
iris.info()

print("\n--- Summary Statistics ---")
iris.describe()

In [None]:
#Scatter Plot (Relationships Between Features)
plt.figure(figsize=(8,6))
sns.scatterplot(
    data=iris,
    x="sepal_length",
    y="petal_length",
    hue="species",
    s=80
)
plt.title("Sepal Length vs Petal Length")
plt.show()

#Histograms (Feature Distributions)
iris.hist(figsize=(10,8), bins=20)
plt.suptitle("Feature Distributions", fontsize=16)
plt.show()

#Box Plots (Outlier Detection)
plt.figure(figsize=(10,6))
sns.boxplot(data=iris)
plt.title("Box Plots of Iris Features")
plt.show()

###Final Insights
### Key Insights
- Species show clear separation when comparing sepal and petal measurements.
- Petal length and petal width vary significantly between species.
- Distributions mostly look normal, with few potential outliers.
- The dataset is clean and contains no missing values.