Skip to content

Ojem2000/Data-Analysis_Pandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

2 Commits
ย 
ย 
ย 
ย 

Repository files navigation

Iris Dataset Analysis This project performs data loading, cleaning, basic analysis, and visualization using the popular Iris dataset. It is designed for beginners who want to learn pandas, matplotlib, and seaborn for data analysis. ๐Ÿ“‚ Project Structure

  • iris_analysis.py โ€“ Main Python script containing:
    • Data loading (using seaborn's built-in Iris dataset)
    • Data cleaning (handling missing values)
    • Basic statistics and grouping
    • Data visualization (line chart, bar chart, histogram, scatter plot) ๐Ÿ› ๏ธ Requirements Make sure you have Python 3 installed. Then install dependencies using:

pip install pandas matplotlib seaborn โ–ถ๏ธ How to Run

  1. Clone this repository: git clone https://github.com/yourusername/iris-analysis.git cd iris-analysis

  2. Run the script: python iris_analysis.py

  3. View the output directly in your terminal (and see charts pop up in separate windows). ๐Ÿ“Š Features

  • Exploratory Data Analysis: Summarizes dataset statistics, groups by species, and identifies patterns.
  • Visualizations: Line chart, bar chart, histogram, and scatter plot to explore data relationships.
  • Beginner-Friendly Code: Fully commented for easy understanding. ๐Ÿ“ท Example Output Species Avg Petal Length Avg Sepal Length setosa 1.464 5.006 versicolor 4.260 5.936 virginica 5.552 6.588 ๐Ÿค Contributing Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change. ๐Ÿ“œ License This project is licensed under the MIT License - feel free to use and modify.

About

Analyzing Data with Pandas and Visualizing Results with Matplotlib

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages