π Analyzing Data with Pandas & Visualizing with Matplotlib π Overview
This project demonstrates how to analyze and visualize a dataset using Pandas for data analysis and Matplotlib/Seaborn for data visualization. The classic Iris dataset is used for this assignment.
π― Objectives
Load and explore a dataset (Iris).
Perform basic data analysis (statistics, groupings, patterns).
Create visualizations (line chart, bar chart, histogram, scatter plot).
Handle missing values and errors gracefully.
π οΈ Requirements
Install the dependencies before running:
pip install pandas matplotlib seaborn scikit-learn
π How to Run Jupyter Notebook
Open the provided .ipynb file in Jupyter Notebook or JupyterLab.
Run the cells step by step to see the analysis and visualizations.
Python Script
If you have the .py version:
python analysis.py
π Visualizations Included
Line Chart β Trend of sepal length across samples.
Bar Chart β Average petal length per species.
Histogram β Distribution of sepal width.
Scatter Plot β Sepal length vs petal length (with species color-coded).
π Findings
No missing values in the Iris dataset.
Petal length & width are strong indicators of flower species.
Setosa species clearly separates from others in scatter plots.
Virginica shows larger petal dimensions compared to Versicolor and Setosa.
β Ubuntu-Inspired Principles Applied
Community: Used a globally shared dataset.
Respect: Handled potential errors in loading/cleaning data.
Sharing: Visualizations make findings accessible to others.