Benjamin Snow and Oliver Bott
This is the repository for the 'Collaborative Social Science Data Analysis' course taught by @christophergandrud at the Hertie School of Governance. This is the repository for Assignment 3, which focuses on the data gathering, cleaning, and early analyses for a final research project.
The assignment calls for: gathering web-based data from at least two sources, as well as cleaning and merging said data. Additionally basic descriptive and inferential statistics should be performed towards the motive of answering the previous assignments research question. Lastly a description of results with accompanying tables and figures should be included.
The proposal can be found in the Assignment3.Rmd file and is structured as follows:
- Background
- Data Gathering
- Data Sources
- Data Selection
- Descriptive Statistics
- Inferential Statistics
- Outlook
- References
The proposal includes dynamically linked academic references (see Main.bib) and citations for R packages (see Packages.bib).
The project proposal can be found in Assignment.Rmd.
- Data_Gathering.R file contains the code for the gathering and cleaning of the two data sources and is dynamically linked to the Assignment3.Rmd file.
- Dataset.csv contains the merged .csv dataset for the project which is dynamically linked to the Assignment.Rmd file. Due to the length of time gathering data using the Github API takes, this was deemed more efficient and easier than repeated pulling in of data.
- License is a basic MIT license for the project.
- Main.bib is the main bibliographic file which is dynamically linked to Assignment3.Rmd.
- Packages.bib is a bibliographic file dynamically linked to Proposal.Rmd with citations for R and R packages.
- README.md is the file for the file currently being read, which offers introduction and context to the project, as interestingly as possible.
Comments and feedback are encouraged, and can be sent to [Benjamin Snow](mailto: benjamin.aaron.snow@gmail.com(benjamin.aaron.snow@gmail.com)) or Oliver Bott.