Global_sustainable_energy

Detailed analysis of global sustainable energy data spanning from 2000 to 2020, sourced from Kaggle. The code encompasses essential tasks such as data cleaning, visualization, correlation analysis, and developing a linear regression model for predicting CO2 emissions in Nepal.

Important

Most output could be generated using simple functions, But I have used loops to understand better key concepts of reproducible programming.

the format of files folders is:

├───figs
├───raw_data
└───rendered_document_files
    ├───figure-html
    └───libs
        ├───bootstrap
        ├───clipboard
        ├───htmlwidgets-1.6.2
        ├───jquery-1.12.4
        ├───leaflet-1.3.1
        │   └───images
        ├───leaflet-binding-2.1.2
        ├───leafletfix-1.0.0
        ├───proj4-2.6.2
        ├───Proj4Leaflet-1.0.1
        ├───quarto-html
        └───rstudio_leaflet-1.3.1
            └───images

Packages used

-library(tidyverse)

library(here)
library(janitor)
library(rnaturalearth)
library(sf)
library(kableExtra)

Key Steps:

Data Cleaning:
- The code reads energy data from a CSV file, checks for duplicate rows, and ensures correct data types for each column.
- Column names are shortened to enhance model interpretation.
Visualization:
- The code maps global energy data onto a world map using the rnaturalearth and sf libraries.
- Correlation analysis is performed to understand associations between variables.
Country-Specific Analysis (Nepal):
- Nepal's energy data is isolated, and trends for various variables are visualized over the years.
- Columns with all NA values and redundant data are removed.
Regression Modeling:
- A linear regression model is built to predict CO2 emissions in Nepal.
- Step-wise regression is performed to select significant predictors.
- Feature selection results in six significant variables influencing CO2 emissions.
- The dataset is split into training and testing sets, and a linear regression model is fitted on the training data.
Model Evaluation:
- The accuracy of the model is assessed using Mean Absolute Error (MAE), normalized for better interpretation.
- The code also includes a forecast using a naive method and evaluates its accuracy.
Summary and Documentation:
- The code provides a comprehensive summary of the entire analysis, making it suitable for inclusion in a readme.md file.
- Key findings, model accuracy, and forecasting results are highlighted.

This code serves as a robust framework for exploring, analyzing, and modeling sustainable energy data, specifically tailored for the case of Nepal.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
raw_data		raw_data
rendered_document_files/figure-html		rendered_document_files/figure-html
.gitignore		.gitignore
README.md		README.md
energy_data.Rproj		energy_data.Rproj
rendered_document.R		rendered_document.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Global_sustainable_energy

Important

the format of files folders is:

Packages used

Key Steps:

About

Releases

Packages

Languages

Sujan-Bhattarai12/global_sustainable_energy

Folders and files

Latest commit

History

Repository files navigation

Global_sustainable_energy

Important

the format of files folders is:

Packages used

Key Steps:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages