Skip to content

MelissaMatindi/iris_dataset_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŒธ Iris Dataset Analysis

This repository contains a Jupyter Notebook (iris_analysis.ipynb) that explores and analyzes the Iris dataset ๐ŸŒฑโ€”a classic dataset in machine learning and statistics. Using Python (pandas, matplotlib, seaborn, scikit-learn), the notebook demonstrates best practices in data analysis, visualization, and insights sharing.

โœจ Inspired by Ubuntuโ€™s principle of community, this project emphasizes clarity, simplicity, and shareability.


๐Ÿ“Š Project Overview

The Iris dataset includes 150 samples of iris flowers, each described by:

  • 4 numerical features: sepal length, sepal width, petal length, petal width
  • 1 categorical label: species (setosa, versicolor, virginica)

The notebook walks through:

  • โœ… Loading and exploring the dataset
  • ๐Ÿ“ˆ Computing descriptive statistics & grouping by species
  • ๐ŸŽจ Creating four visualizations (line chart, bar chart, histogram, scatter plot)
  • ๐Ÿ”Ž Summarizing key findings (e.g. petal length is a strong discriminator between species)

โš™๏ธ Requirements

  • Python 3.10+
  • Libraries:
    pandas
    matplotlib
    seaborn
    scikit-learn
    jupyter

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published