Using the World Bank datasets, download and examine the effect of income group of a country on its GDP value. This includes downloading and cleansing of the data and then perform the analysis.
- Folders: -
- source: - This folder contains the R source code used for tidying the raw datasets.
- graphs: - This folder contains all the graphs that are generated during this study.
- data: - This folder contains raw data files, the tidied files and merged file.
- Files: -
- data/edu.csv: - This is the raw educational statistics file from World Bank containing some of the internationally comparable indicators that describes education access, progression, completion, literacy, teachers, population, and expenditures. The indicator that we will be using in this study is the Income Group.
- data/gdp.csv: - This is the raw GDP file from World Bank which contains GDP data of 190 different countries in the world. 3.data/cleaned_EDU_Data.txt: - This is the tidy version of education statistics file (edu.csv) generated by the execution of the R code 'EDU_data.R'.
- data/cleaned_GDP_Data.txt: - This is the tidy version of GDP data file (gdp.csv) generated by the execution of the R code 'GDP_data.R'.
- data/merged_Data.txt: - This file contains the merged data of the two files cleaned_EDU_Data and cleaned_GDP_Data, merged by Country Code.
- graphs/CountryCode_vs_GDP.png: - This is the plot of the GDP for all the countries. Countries with different income group have different colors.
- graphs/Modified CountryCode_vs_GDP.png: - This is the plot splitted vertically to represent the GDP of countries with same income group in each splitted section.
- source/EDU_data.R: - R code that performs the tidying of the raw education statistics file (edu.csv).
- source/GDP_data.R: - R code that performs the tidying of the raw GDP data file (gdp.csv).
- README.md: - This file includes an explanation of the purpose of the project and the other files.
- README.Rmd: - The R markdown file that creates the README.md file.
- report.md: - This markdown file provides the details of the analysis performed on the cleaned data files including the introduction, analysis code, output of the code and conclusion derived from the analysis.
- report.Rmd: - The R markdown file that creates the report.md file.
- Makefile: - It lists all the steps to regenerate this study aimed to find relationship between the income group of a country with its GDP value.
- GDP.Rproj: - This is the R project file for this project.
- .gitignore: - This file specifies intentionally untracked files that Git should ignore.
## R version 3.3.2 (2016-10-31)
## Platform: x86_64-w64-mingw32/x64 (64-bit)
## Running under: Windows 10 x64 (build 14393)
##
## locale:
## [1] LC_COLLATE=English_Canada.1252 LC_CTYPE=English_Canada.1252
## [3] LC_MONETARY=English_Canada.1252 LC_NUMERIC=C
## [5] LC_TIME=English_Canada.1252
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## loaded via a namespace (and not attached):
## [1] backports_1.0.4 magrittr_1.5 rprojroot_1.1 tools_3.3.2
## [5] htmltools_0.3.5 yaml_2.1.14 Rcpp_0.12.8 stringi_1.1.2
## [9] rmarkdown_1.3 knitr_1.15.1 stringr_1.1.0 digest_0.6.10
## [13] evaluate_0.10