Converting Toronto demographic data from to Hive tables, via R
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


Converting Toronto demographic data from to Hive tables, via R

Data downloaded from in csv format

"The Census of Population is held across Canada every 5 years and collects data about age and sex, families and households, language, immigration and internal migration, ethnocultural diversity, Aboriginal peoples, housing, education, income, and labour. City of Toronto Neighbourhood Profiles use this Census data to provide a portrait of the demographic, social and economic characteristics of the people and households in each City of Toronto neighbourhood. The profiles present selected highlights from the data, but these accompanying data files provide the full data set assembled for each neighbourhood."

Original csv from was explored, cleaned and restructured in Excel and R such that it can be loaded to Hive as seamlessly as possible. A comprehensive walkthrough is at