Skip to content

Jason-W-K/python_wk7_assignment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

2 Commits
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŒธ Iris Dataset Analysis

๐Ÿ“ Project Overview

This project explores the classic Iris dataset using Python. It demonstrates foundational data science skills including data loading, cleaning, statistical analysis, and visualization. The goal is to uncover patterns across iris species and present insights through expressive plots and structured code.


๐Ÿ“Œ Tasks Completed

1. Data Loading & Exploration

  • Loaded the Iris dataset via sklearn.datasets.load_iris()
  • Converted it into a pandas DataFrame
  • Inspected structure, data types, and missing values
  • Verified dataset cleanliness (no missing values)

2. Basic Data Analysis

  • Computed descriptive statistics (mean, median, std)
  • Grouped data by species to compare feature averages
  • Identified key patterns in petal and sepal dimensions

3. Data Visualization

  • ๐Ÿ“ˆ Line chart: Sepal vs Petal Length across index
  • ๐Ÿ“Š Bar chart: Average Petal Length per species
  • ๐Ÿ“‰ Histogram: Sepal Width distribution
  • ๐Ÿ”ฌ Scatter plot: Sepal Length vs Petal Length by species

All plots are customized with titles, axis labels, and legends using matplotlib and seaborn.


๐Ÿง  Key Findings

  • Setosa species has the smallest petal and sepal dimensions.
  • Virginica shows the largest petal length and width.
  • Petal dimensions are highly discriminative across speciesโ€”ideal for classification tasks.
  • Visualizations reveal strong clustering potential.

๐Ÿ› ๏ธ Technologies Used

  • Python 3.x
  • pandas
  • numpy
  • matplotlib
  • seaborn
  • scikit-learn

๐Ÿ“‚ File Structure

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages