#**Project 2 - Power Generation Data Set**
* Jonathan Chartrand and Robert Boutette

[Data source](https://www.kaggle.com/datasets/ccanb23/iea-monthly-electricity-statistics)
![Energy Consumption](https://drive.google.com/file/d/12lsszlB6pnYUz13XAnn3iywbazUI7GE1/view?usp=drive_link)

The dataset selected for analysis in this project was sourced from the International Energy Agency (IEA) through their Monthly Electricity Statistics tool. This comprehensive dataset covers monthly energy production data across various countries from 2010 to 2022, measured in gigawatt-hours (GWh). It encompasses a wide range of energy sources, including hydro, wind, solar, geothermal, nuclear, and both renewable and non-renewable combustible fuels. The data provides a detailed overview of the global energy production landscape, offering insights into the trends and shifts in energy sources across different regions and times. This dataset, compiled and made accessible through a dedicated scraper available on GitHub, represents a rich resource for understanding global energy production dynamics.

In [2]:
import pandas as pd

# Load the data
df = pd.read_csv('data.csv')

# Print the number of rows and columns
print(f'Number of rows: {df.shape[0]}')
print(f'Number of columns: {df.shape[1]}')

# Print the data types of the columns
print('\nData types:')
print(df.dtypes)

# Display the first 5 rows
print('\nFirst 5 rows:')
print(df.head())

Number of rows: 181915
Number of columns: 12

Data types:
COUNTRY                object
CODE_TIME              object
TIME                   object
YEAR                    int64
MONTH                   int64
MONTH_NAME             object
PRODUCT                object
VALUE                 float64
DISPLAY_ORDER           int64
yearToDate            float64
previousYearToDate    float64
share                 float64
dtype: object

First 5 rows:
     COUNTRY CODE_TIME          TIME  YEAR  MONTH MONTH_NAME  \
0  Australia   JAN2010  January 2010  2010      1    January   
1  Australia   JAN2010  January 2010  2010      1    January   
2  Australia   JAN2010  January 2010  2010      1    January   
3  Australia   JAN2010  January 2010  2010      1    January   
4  Australia   JAN2010  January 2010  2010      1    January   

                   PRODUCT      VALUE  DISPLAY_ORDER  yearToDate  \
0                    Hydro    990.728              1   16471.891   
1                     Wind    40

The dataset we're looking into provides a snapshot of how much electricity different countries around the world have produced over the years, from 2010 to 2022. It tells us the story of our energy usage, highlighting what kind of energy sources we've been depending on. For example, it shows how much energy we've gotten from natural elements like water (hydro), wind, and the sun (solar), as well as from the ground (geothermal) and advanced technology like nuclear power.

Beyond these, it also tracks energy production from traditional sources that have been with us for a longer time, such as coal, oil, and natural gas, which are collectively known as fossil fuels. The dataset doesn't just stop at showing how much energy we produce; it dives deeper to show the mix of energy sources (like renewables vs. non-renewables), how much energy we import or export, losses during distribution, and ultimately how much energy ends up being consumed.

Covering a broad array of countries, from Argentina to the United States, it reflects both local and global trends in energy production. Each entry in the dataset gives us information about the country, the month and year of the data, and specifics about the type of energy being produced and its quantity in GWh. This detailed breakdown can help us understand not just how our energy production has evolved over the years, but also how different countries contribute differently to global energy dynamics.

In essence, this dataset is like a diary of our planet's energy story, telling us where we've been getting our power from, how much of it we've been using, and how these patterns have changed over the years. It's a vital tool for anyone looking to understand the shifting sands of global energy production, offering insights into the challenges and opportunities that lie ahead in our quest for sustainable and reliable energy sources.