This repository contains an R project to analyze sessional paper data for the Large Government of Canada IT projects website.
The primary repository for the website is located here.
This repository includes:
- Table extraction from an MS Word adaptation of the source PDF file
- Data cleanup and parsing (supporting several manual spreadsheet-editing steps)
- Data merging with 2016 and 2019 data already in CSV format
- Combining the three years' datasets together and grouping related projects together for analysis
This repository also includes the logic determining each “estimated status” value.
For more information, see each individual R file. See the helpers.R file for the set of libraries used in this project.
An Ottawa Civic Tech project
This is a volunteer project and is not affiliated with the Government of Canada.