# **2401PTDS Regression Project: Analyzing CO2 Emissions from the Agri-food Sector**

## **Introduction**

In this project, we will explore and analyze the impact of agricultural activities on climate change, focusing on the role of CO2 emissions in the agri-food sector. Agriculture is a major contributor to global greenhouse gas emissions, and understanding these emissions is critical for developing sustainable practices that can mitigate climate change.

This project is part of a larger initiative by a coalition of agricultural stakeholders, including policymakers, environmental organizations, and agricultural businesses. The primary objective of this project is to analyze the effects of various emission sources from the agri-food sector, with a particular focus on how these emissions influence climate change, specifically average temperature increases.

We will utilize a dataset compiled from reliable sources such as the Food and Agriculture Organization (FAO) and the Intergovernmental Panel on Climate Change (IPCC). This dataset includes information on various emission sources such as:

- **Savanna Fires**: Emissions from fires in savanna ecosystems.
- **Forest Fires**: Emissions from forest fires.
- **Crop Residues**: Emissions from burning or decomposing plant material after harvest.
- **Rice Cultivation**: Emissions from methane release in rice paddies.
- **Drained Organic Soils**: Emissions of CO2 from draining organic soils.
- **Food Transport**: Emissions from the transportation of food products.
- **On-farm Energy Use**: Emissions from energy consumed on farms.
- **Food Packaging**: Emissions from the production and disposal of food packaging materials.

Additionally, we will examine the relationship between these emissions and the **average temperature** of the areas studied, as temperature change is one of the most significant indicators of climate change.

The dataset also includes features such as **forestland**, which serves as a natural carbon sink, mitigating some of the negative emissions from agriculture. Understanding the balance between emissions and carbon sequestration is key to identifying strategies for sustainable agricultural practices.

Our goal is to use regression analysis to understand how these emission sources correlate with average temperature increases. By the end of this project, we aim to provide actionable insights and recommendations that will help stakeholders in the agri-food sector mitigate their environmental impact and promote sustainability.

---

This introduction sets up the project by explaining the goals, context, and dataset. Let me know if you need more details or adjustments!

## **Data Loading**

In this section, we load the dataset containing emissions data from the agri-food sector. Using Pandas, we read the CSV file into a DataFrame to examine its structure and the first few records. This allows us to verify the dataset's contents and ensure it's properly loaded for further analysis.




In [3]:
# 2. Data Loading

# Importing necessary libraries
import pandas as pd

# Loading the dataset
# Replace 'your_dataset.csv' with the actual file path to your dataset
df = pd.read_csv("co2_emissions_from_agri.csv")

# Display the first 5 rows to understand the structure of the data
df.head()


Unnamed: 0,Area,Year,Savanna fires,Forest fires,Crop Residues,Rice Cultivation,Drained organic soils (CO2),Pesticides Manufacturing,Food Transport,Forestland,...,Manure Management,Fires in organic soils,Fires in humid tropical forests,On-farm energy use,Rural population,Urban population,Total Population - Male,Total Population - Female,total_emission,Average Temperature °C
0,Afghanistan,1990,14.7237,0.0557,205.6077,686.0,0.0,11.807483,63.1152,-2388.803,...,319.1763,0.0,0.0,,9655167.0,2593947.0,5348387.0,5346409.0,2198.963539,0.536167
1,Afghanistan,1991,14.7237,0.0557,209.4971,678.16,0.0,11.712073,61.2125,-2388.803,...,342.3079,0.0,0.0,,10230490.0,2763167.0,5372959.0,5372208.0,2323.876629,0.020667
2,Afghanistan,1992,14.7237,0.0557,196.5341,686.0,0.0,11.712073,53.317,-2388.803,...,349.1224,0.0,0.0,,10995568.0,2985663.0,6028494.0,6028939.0,2356.304229,-0.259583
3,Afghanistan,1993,14.7237,0.0557,230.8175,686.0,0.0,11.712073,54.3617,-2388.803,...,352.2947,0.0,0.0,,11858090.0,3237009.0,7003641.0,7000119.0,2368.470529,0.101917
4,Afghanistan,1994,14.7237,0.0557,242.0494,705.6,0.0,11.712073,53.9874,-2388.803,...,367.6784,0.0,0.0,,12690115.0,3482604.0,7733458.0,7722096.0,2500.768729,0.37225


## **Data Overview**


In [2]:
# Import necessary libraries
import pandas as pd

# Load the dataset
df = pd.read_csv('co2_emissions_from_agric.csv')

# 1. Data Overview
print("Data Shape:", df.shape)  # Display the number of rows and columns
print("Columns:", df.columns)  # Display the column names

# Preview the first few rows of the dataset
df.head()  # Display the first 5 rows of the dataset


FileNotFoundError: [Errno 2] No such file or directory: 'co2_emissions_from_agric.csv'