### Project Brief: Analysis of Municipal Waste Management in Urban Regions  

---

#### **Overview**  
Municipal waste refers to waste collected and treated by or for municipalities, encompassing a wide range of sources such as households, commerce, small businesses, and public institutions. Typical waste includes yard waste, garden waste, street sweepings, and litter container contents. It does not, however, include waste from sewage systems or construction and demolition activities. 

This project aims to analyze municipal waste management across different regions, focusing on costs, waste composition, population characteristics, and municipal infrastructure. The study leverages data to understand the key factors influencing waste sorting, treatment, and disposal practices. The findings will help students develop insights into sustainable waste management practices and identify areas for improvement.

---

#### **Data Description**  
The dataset contains municipal-level data collected from various regions, provinces, and municipalities. Below is a detailed description of the variables provided:  

**Demographics and Geography:**  
- **region:** Region of the municipality (e.g., South, Center, North)  
- **province:** Province of the municipality  
- **name:** Name of the municipality  
- **pop:** Total population  
- **pden:** Population density (people per km²)  
- **alt:** Altitude above mean sea level (m)  
- **isle:** Indicator if the municipality is on an island (1: Yes, 0: No)  
- **sea:** Indicator if the municipality is coastal (1: Yes, 0: No)  

**Waste and Urbanization:**  
- **urb:** Urbanization index (1: Low, 3: High)  
- **wden:** Waste generation per km²  
- **msw:** Total municipal solid waste generated (kg)  
- **msw_so:** Sorted waste (kg)  
- **msw_un:** Unsorted waste (kg)  
- **sor:** Share of sorted waste (%)  

**Economic Factors and Infrastructure:**  
- **tc:** Cost of waste management per capita (€)  
- **cres:** Residual cost per capita (€)  
- **csor:** Cost of sorted waste per capita (€)  
- **fee:** Fee scheme in place for waste management  
- **d_fee:** Indicator for municipalities using a PAYT (Pay-As-You-Throw) scheme  
- **roads:** Total kilometers of roads within the municipality  
- **proads:** People per kilometer of road (log-transformed)  
- **gdp:** Municipal revenues (€) (log-transformed)  
- **wage:** Taxable income per capita (€) (log-transformed)  

**Waste Composition (%):**  
- **organic:** Organic waste  
- **paper:** Paper waste  
- **glass:** Glass waste  
- **wood:** Wood waste  
- **metal:** Metal waste  
- **plastic:** Plastic waste  
- **raee:** Electronic waste  
- **texile:** Textile waste  
- **other:** Other types of waste  

**Regional Waste Management Indicators:**  
- **s_wteregio:** Share of solid waste sent to Waste-to-Energy (W2E) plants (regional level)  
- **s_landfill:** Share of waste sent to landfills (regional level)  

---

### **Objectives:**

1. **Analyze Cost Drivers of Waste Management**:
   - Investigate the relationship between **waste management costs per capita (`tc`)** and municipal factors such as **population density (`pden`)**, **urbanization index (`urb`)**, and **waste generation per km² (`wden`)**.
   - Identify which demographic, economic, and geographic factors most significantly influence waste management costs.

2. **Evaluate Waste Sorting Efficiency**:
   - Examine the impact of sorted waste share (`sor`) on the **cost of sorted waste per capita (`csor`)** and overall waste management costs.
   - Determine the role of fee schemes (`fee`, `d_fee`) and regional waste management practices (`s_wteregio`, `s_landfill`) in improving sorting efficiency and reducing costs.

3. **Assess Regional Variations in Waste Management Practices**:
   - Compare waste management costs across different regions (`region`, `province`) and evaluate how **regional infrastructure (`roads`, `proads`)** and **economic indicators (`gdp`, `wage`)** affect these costs.
   - Identify disparities in waste-to-energy usage (`s_wteregio`) and landfill dependency (`s_landfill`) between regions and their impact on waste management efficiency.

4. **Model and Predict Waste Management Costs**:
   - Develop a regression model to predict **waste management costs per capita (`tc`)** based on key demographic, geographic, and economic features.
   - Use feature importance analysis to recommend strategies for reducing costs while maintaining sustainable practices.

5. **Provide Actionable Recommendations**:
   - Based on the analysis, propose data-driven recommendations to optimize waste management costs, improve sorting practices, and encourage sustainable waste treatment methods.


---

<h1 style="color:yellow;">Your code below 👇</h1>


In [None]:
# import packages
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns