# üè≠ Air Emissions and Climate - OECD data
---
<a href="https://zupimages.net/viewer.php?id=23/11/ittd.jpg"><img src="https://zupimages.net/up/23/11/ittd.jpg" alt="" /></a>

As concern over the state of our planet grows, the need to analyze and understand environmental data becomes more <b>pressing</b>. In this notebook, we will delve into air and climate data, exploring datasets that cover <b>air emission accounts</b>, <b>air transport CO2 emissions</b>, <b>air pollutant emissions</b>, <b>greenhouse gases</b>, and more.

The data contained within this notebook provides insight into the <b>health</b> of our planet's air and atmosphere. With information on a variety of air pollutants and greenhouse gases, we can gain a better understanding of how human <b>activity</b> impacts the environment and what measures can be taken to <b>reduce</b> our carbon footprint.

Through careful analysis of this data, we can identify <b>patterns</b>, <b>correlations</b>, and <b>trends</b> that will help us make <b>informed decisions</b> about environmental policy and sustainable living. By studying these datasets, we can also gain a deeper appreciation for the <b>complexities</b> of the Earth's climate and the importance of taking <b>action</b> to protect it.

Overall, this notebook is a <b>valuable resource</b> for anyone interested in exploring air and climate data. Whether you are a <b>scientist</b>, <b>policy maker</b>, or simply someone concerned about the <b>health of our planet</b>, the insights contained within these datasets are sure to be enlightening and thought-provoking. So, let's dive in and see what we can discover about the air we breathe and the climate we inhabit.

# Imports

In [1]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

# Load and clean the data

In [2]:
# Load the data
df = pd.read_csv('/kaggle/input/oecd-air-and-climate/Air Emission Accounts.csv')

# Columns with only one value
df = df.drop(['Unit Code', 'Unit', 'PowerCode Code', 'PowerCode', 'Reference Period Code',
              'Reference Period', 'Flag Codes', 'Flags'], axis=1)

# Create a mapping dictionary to find the definitions of the abbreviations
mapping = {}
abv_columns = ['COUNTRY', 'POLLUTANT', 'ACTIVITY', 'MEASURE']
def_columns = ['Country', 'Pollutant', 'Activity ', 'Measure']
for col1, col2 in zip(abv_columns, def_columns):
    for val1, val2 in zip(df[col1].unique(), df[col2].unique()):
        mapping[val1] = val2

# Redundant definition columns
df = df.drop(['Country', 'Pollutant', 'Activity ', 'Measure', 'YEA'], axis=1)

df.head()

  exec(code_obj, self.user_global_ns, self.user_ns)


Unnamed: 0,COUNTRY,POLLUTANT,ACTIVITY,MEASURE,Year,Value
0,AUS,GHG,IND-TOTAL,TCO2,2009,529836900.0
1,AUS,GHG,IND-TOTAL,TCO2,2010,505860300.0
2,AUS,GHG,IND-TOTAL,TCO2,2011,507211900.0
3,AUS,GHG,IND-TOTAL,TCO2,2012,486884000.0
4,AUS,GHG,IND-TOTAL,TCO2,2013,480808400.0


# Data exploration

## Number of unique values

In [3]:
print(f'Columns: N uniques\n')
for col in df.columns:
    print(f'{col}: {df[col].nunique()}')

Columns: N uniques

COUNTRY: 40
POLLUTANT: 16
ACTIVITY: 113
MEASURE: 5
Year: 13
Value: 136915


# Pollutants

In [4]:
for pol in df.POLLUTANT.unique():
    print(f'{pol}: {mapping[pol]}')

GHG: Greenhouse gases Total 
CO2: Carbon dioxide
CH4: Methane
N2O: Nitrous oxide
HFC: Hydrofluorocarbons
PFC: Perfluorocarbons
SF6: Sulphur hexafluoride
PM2_5: Particulates (less than 2.5¬µm)
CO: Carbon monoxide
NMVOC: Non-methane volatile organic compounds
NOX: Nitrogen oxides
SOX: Sulphur oxides
O3PR: Ozone precursors (NMVOC, NOX in NMVOC equivalent, CO in NMVOC equivalent, CH4 in NMVOC equivalent)
ACG: Acidifying gases (SOX in SO2 equivalent, NOX in SO2 equivalent, NH3 in SO2 equivalent)
NH3: Ammonia
PM10: Particulates (less than 10¬µm)


### Information

- **GHG** - Gases that trap heat in the atmosphere, contributing to the greenhouse effect and climate change.

- **Total CO2** - A colorless, odorless gas emitted through the burning of fossil fuels and other activities, responsible for a significant portion of the greenhouse effect.

- **CH4** - A colorless, odorless gas that is a byproduct of natural gas production and other activities, contributing to the greenhouse effect.

- **N2O** - A gas emitted through agricultural and industrial activities, contributing to the greenhouse effect.

- **HFC** - Synthetic gases used as refrigerants, solvents, and aerosol propellants, with high global warming potential.

- **PFC** - Synthetic gases used in the electronics industry and other applications, with high global warming potential.

- **SF6** - A synthetic gas used in electrical equipment, with high global warming potential.

- **PM2_5** - Tiny particles in the air, typically emitted by combustion and other industrial activities, that can harm human health and contribute to climate change.

- **CO** - A poisonous gas produced through incomplete combustion of fossil fuels, contributing to air pollution.

- **NMVOC** - Organic compounds that easily evaporate into the air, contributing to the formation of ground-level ozone and other air pollutants.

- **NOX** - Gases emitted from combustion processes, contributing to smog and acid rain.

- **SOX** - Gases emitted from burning coal and oil, contributing to acid rain and respiratory problems.

- **O3PR** - Substances that contribute to the formation of ground-level ozone, including NMVOC, NOX, CO, and CH4.

- **ACG** - Gases that contribute to acidification of soil and water, including SOX, NOX, and NH3.

- **NH3** - A gas emitted from agricultural activities and other sources, contributing to air pollution and acidification.

- **PM10** - Larger particles in the air, typically emitted by construction and other activities, that can harm human health and contribute to climate change.

## Measures

In [5]:
for pol in df.MEASURE.unique():
    print(f'{pol}: {mapping[pol]}')

TCO2: tonnes of CO2-equivalent
T: tonnes of pollutant
TNO2: tonnes of NO2-equivalent
TNMVOC: tonnes of NMVOC equivalent
TSO2: tonnes of SO2 equivalent


### Information

**The most commonly used metrics are:**

- TCO2 to express the total emissions of all greenhouse gases in terms of carbon dioxide.
- T to express the total amount of a particular pollutant emitted by a source over a specific period.
- TNMVOC to express the total emissions of all NMVOCs in terms of the equivalent amount of a single NMVOC, usually benzene, that would produce the same environmental impact over a specific period.

**TNO2** and **TSO2** are less commonly used metrics, but they are still relevant for specific contexts such as air quality management and acid rain assessments.

**Here are some conversions between these metrics:**

To convert from TNO2 or TSO2 to TCO2, we need to use the Global Warming Potential (GWP) of each pollutant, which is the factor used to convert the warming effect of a certain amount of the pollutant to that of CO2.

- 1 TNO2 = 298 TCO2
- 1 TSO2 = 1 TCO2

# TO CONTINUE