Skip to content

Hongwei-Z/CanadianImmigrationStudy

Repository files navigation

CanadianImmigrationStudy

A Study of Potential Influencing Factors on Canadian Immigrants

CSCI 6405 -- Data Mining Project, Dalhousie University.

By Hongwei Zhang, January - April 2023.

Experiment 1: Study the reasons for Canadian immigrants leaving their home countries based on economic and social indicators for their countries.

Data mining methods: Kmeans Clustering, Hierarchical Clustering

Experiment 2: Study the reasons for attracting immigrants to Canada based on Canadian economic and social indicators.

Data mining methods: Decision Tree, Association Rule Mining

Immigration

Other files:
Experiment_1.5.ipynb: Clustering based on data in 2011.
render.html: Interactive world map of Canadian immigration sources

Datasets:

  1. Immigrant status and period of immigration by place of birth and citizenship: Canada, provinces and territories and census metropolitan areas with parts: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=9810030201
  2. Net migration - Canada: https://data.worldbank.org/indicator/SM.POP.NETM?locations=CA
  3. GDP per capita (current US$): https://data.worldbank.org/indicator/NY.GDP.PCAP.CD
  4. Life expectancy at birth, total (years): https://data.worldbank.org/indicator/SP.DYN.LE00.IN
  5. Employment to population ratio, 15+, total (%): https://data.worldbank.org/indicator/SL.EMP.TOTL.SP.ZS
  6. Labor force participation rate, female (% of female population ages 15+): https://data.worldbank.org/indicator/SL.TLF.CACT.FE.ZS?view=chart
  7. Inflation, consumer prices (annual %): https://data.worldbank.org/indicator/FP.CPI.TOTL.ZG?view=chart
  8. Government expenditure on education, total (% of GDP): https://data.worldbank.org/indicator/SE.XPD.TOTL.GD.ZS
  9. Current health expenditure (% of GDP): https://data.worldbank.org/indicator/SH.XPD.CHEX.GD.ZS