🌍 Air Pollution Data Analysis

This repository contains a data analysis and visualization notebook built using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. The dataset used contains information about pollution levels across various cities, stations, and countries.

📁 Dataset

The dataset used in this project is a .csv file (example: 3b01bcb8-0b14-4abf-b6f2-c1bfd384ba69.csv) that includes fields such as:

station
city
country
pollutant_avg
latitude
longitude

🔧 Technologies Used

Python
Pandas
NumPy
Matplotlib
Seaborn

📊 Visualizations Included

This project generates multiple types of visualizations to understand the dataset:

Bar Plot – Average pollutant by station
Pie Chart – Distribution of entries by city
Histogram – Distribution of pollutant_avg values
Scatter Plot – Pollutant Average vs Latitude with country-wise hue
Line Plot – Pollutant Average across Longitudes
Correlation Heatmap – Relationship among numeric columns
Box Plot – Pollutant Average distribution per country
Pair Plot – Country-wise scatter relationships
Outlier-Free Box Plot – Refined pollutant averages by country

🧼 Data Cleaning

Missing numeric values are filled with column means.
Categorical missing values are filled with the mode.
Outliers are handled using the IQR (Interquartile Range) method.

📈 Summary Statistics

The notebook also prints descriptive statistics using df.describe() to get insights into the dataset's distribution, central tendency, and spread.

📂 Usage

To run this project:

Clone the repository or download the .ipynb/.py file.
Make sure you have the required libraries installed:

pip install pandas numpy matplotlib seaborn

Replace the CSV path in the read_csv() method with your dataset path.
Run the script or notebook.

🧪 Sample Output

Visual plots for easy understanding of trends
Insights into pollution averages by location
Detection and removal of outliers
Heatmaps showing correlation between numerical columns

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
merged.pdf		merged.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌍 Air Pollution Data Analysis

📁 Dataset

🔧 Technologies Used

📊 Visualizations Included

🧼 Data Cleaning

📈 Summary Statistics

📂 Usage

🧪 Sample Output

About

Uh oh!

Releases

Packages

Aagaj/pythonproject

Folders and files

Latest commit

History

Repository files navigation

🌍 Air Pollution Data Analysis

📁 Dataset

🔧 Technologies Used

📊 Visualizations Included

🧼 Data Cleaning

📈 Summary Statistics

📂 Usage

🧪 Sample Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages