### This Notebook will be used for the Applied Datascience Capstone Project. 

### The Capstone project is the culminating activity for the IBM Datascience Professional Certificate. The Certificate covers a broad range of topics ranging from data science methodology, data analysis with python to data visualization and machine learning.


### Table of Contents
1. [Introduction](#introduction)
2. [Literature Review](#literature-review)
3. [Data](#data)
4. [Methodology](#methodology)
5. [Results](#results)
6. [Discussion](#discussion)
7. [Conclusion](#conclusion)
8. [Bibliography](#bibliography)


### Introduction

Toronto is a city with a large immigrant population. However, migration is an expensive and risky endeavor for many migrants. Therefore, an informed approach to immigration is a wise thing to do. Two very broad strategies to the migration effort would be either: trend following or trend setting. 

In trend following, the prospective migrant tries to find out how others have migrated. The basics would have to be covered such as source of income which is typically through employment and place of residence. The presence of food sellers catering to particular food types within a neighborhood might be helpful in gaining an understanding of the residents within the area.

The other strategy to migration is probably more appealing to those with an entrepreneurial mindset. This approach involves trying to find out what are the potential growth areas. Trying to find out what are the products and services needed but are missing from a neighborhood can be a another strategy for the business minded migrant. Since it is enterpreneurial in approach, it is also more risky because sometimes missing products and services can indicate that there is no need or demand for the absent products or services.

No matter the strategy adopted, a deeper understanding of the target neighborhood for immigration would definitely be better than immigrating without a clue about the destination neighborhood. 

This project seeks to answer the question of what does it take to successfully immigrate into Canada, and Toronto in particular.

As a strategy, this project will profile the neighborhoods in terms of demographic information  compare this against employment and business licensing information. This project will attempt to see if there are useful success patterns that can be established. This profiling exercise will also attempt to find any possible areas of growth that may provide business opportunities.

### Literature Review

Canada is considered one of the more successful models in terms of immigration. 
Canada has a relatively large immigrant labour force both temporary and permanent. (Liebig 2016)
In Canada, migrants comprise about 20.6% of the entire population. In Toronto alone, immigrants account
for 33% of the entire population (Government of Canada, 2020). 

There are primarily four factors seen to contribute to successful migrant integration as suggested by a paper on immigrant integration. (Government of Canada, 2020)
These factors are: the migrant skills, stakeholder attitude, early information access and settlement location.
The challenges to attracting the right skills and talent aside, larger companies have better bandwidth and resources to
welcome migrants as contrasted to smaller enterprises. Information access and access to relevant work networks is dependent
on the immigrant support organizations in the area assigned to the migrants. According to the same paper, region of first settlement may
not necessarily be the place of eventual settlment and the right conditions have to be in place in order for migrants to feel
a connection and community. Another article suggests that perhaps some tax relief and residence acquisition assistance would help migrants in settling better (Hosseini, 2017)
Whichever the case may be, the new migrant labourer requires access to employment that would probably be provided by larger
employers due to employer capacity and network with the supporting organizations. The exception might be that of nursing care workers who use a specialized route for migration (Liston & Carens, 2008)
which would gravitate toward areas where larger populations of elderly reside.

On the flip side, survival enterpreneurs would need access to people of similar culture and ethnic background to support their business. (Chrysostome, 2010)
This means that success of survival enterpreneurs would gravitate to the proximity of the areas with large settlement of a particular ethnic group.

### Data

One advantage of using Toronto as a city of study is the availability of data collected in the open data initiative of the city. 

Using foursquare in combination with business registrations in the city, we are able to home in on businesses which can potentially provide employment for would be migrants. We would definitely detail out the nature of the businesses in the city to get a good picture for matching with the migrants. Census data can show the concentration of elderly in the city, another source of employment for aspiring migrants.

Census data would once again be helpful in seeking out the residence areas and concentration of particular ethnic groups which are extremely vital for the survival enterpreneurs.

The primary data tools of choice would be descriptive in nature. In terms of describing what is already on the ground, the communities and the possible employment opportunities. However, it might also be useful to run some clustering analysis to validate some assumptions provided in the literature review. Cluster analysis may also provide some serendipitous insights as well.

Aside from Foursquare and map data, the following datasets would be used:
- Toronto Open Data: https://https://open.toronto.ca
    - Ward Profiles, 2018 (25-Ward Model): https://open.toronto.ca/dataset/ward-profiles-2018-25-ward-model/
    - Municipal Licensing and Standards - Business Licences and Permits: https://open.toronto.ca/dataset/municipal-licensing-and-standards-business-licences-and-permits/
    - Toronto Employment Survey Summary Tables: https://open.toronto.ca/dataset/toronto-employment-survey-summary-tables/


### Methodology

In [1]:
import pandas as pd
import numpy as np

In [2]:
#import business permit data
fields_used=["Client Name","Category","Licence Address Line 3"]
df_biz_raw=pd.read_csv("./data/Business-licences-data.csv",usecols=fields_used)
df_biz_raw

Unnamed: 0,Category,Client Name,Licence Address Line 3
0,PRIVATE TRANSPORTATION COMPANY,TAXIFY CANADA INC,M9N 1A1
1,PRIVATE TRANSPORTATION COMPANY,INSTARYDE INC,M3J 2T8
2,PRIVATE TRANSPORTATION COMPANY,UBER CANADA INC,M4W 3M5
3,PRIVATE TRANSPORTATION COMPANY,FACEDRIVE INC,M2J 4R4
4,PRIVATE TRANSPORTATION COMPANY,RIDE INC,M8Z 3B1
...,...,...,...
160261,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,FOSTEN INC,L7E 4K8
160262,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,1990618 ONTARIO INC,M6B 2S3
160263,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,"KESEROVIC, OLGA",M4A
160264,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,"JAFARI BERENJI, MEHDI",L4S


In [3]:

df_biz_raw.applymap(lambda x: x.strip() if isinstance(x, str) else x)
df_tmp=df_biz_raw['Licence Address Line 3'].str.split(" ", n = 1, expand = True) 
df_biz_raw['area']=df_tmp[0]
df_biz_raw

Unnamed: 0,Category,Client Name,Licence Address Line 3,area
0,PRIVATE TRANSPORTATION COMPANY,TAXIFY CANADA INC,M9N 1A1,M9N
1,PRIVATE TRANSPORTATION COMPANY,INSTARYDE INC,M3J 2T8,M3J
2,PRIVATE TRANSPORTATION COMPANY,UBER CANADA INC,M4W 3M5,M4W
3,PRIVATE TRANSPORTATION COMPANY,FACEDRIVE INC,M2J 4R4,M2J
4,PRIVATE TRANSPORTATION COMPANY,RIDE INC,M8Z 3B1,M8Z
...,...,...,...,...
160261,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,FOSTEN INC,L7E 4K8,L7E
160262,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,1990618 ONTARIO INC,M6B 2S3,M6B
160263,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,"KESEROVIC, OLGA",M4A,M4A
160264,NON-MOTORIZED REFRESHMENT VEHICLE OWNER,"JAFARI BERENJI, MEHDI",L4S,L4S


In [4]:
df_biz_counts=df_biz_raw.groupby(['Category']).count().reset_index().sort_values(['Client Name'], ascending=False)
df_biz_counts.head(20)

Unnamed: 0,Category,Client Name,Licence Address Line 3,area
27,EATING ESTABLISHMENT,39322,39318,39318
62,RETAIL STORE (FOOD),26588,26587,26587
60,PUBLIC GARAGE,14441,14441,14441
11,BUILDING RENOVATOR,12022,12015,12015
52,PERSONAL SERVICES SETTINGS,9699,9699,9699
72,TAXICAB OWNER,9121,9119,9119
85,TOW TRUCK OWNER,5536,5536,5536
24,DRIVING INSTRUCTOR (V),3239,3239,3239
79,TEMPORARY SIGN - MOBILE,3138,3137,3137
41,MASTER PLUMBER,3088,3088,3088


In [5]:
df_biz_counts.tail(20)

Unnamed: 0,Category,Client Name,Licence Address Line 3,area
35,INSULATION INSTALLER,37,37,37
19,CURBLANE VENDING,34,34,34
82,TEMPORARY SIGN PROVIDER,33,33,33
51,PERMANENT FIREWORKS VENDOR,27,27,27
13,CHIMNEY REPAIRMAN,27,27,27
12,CARNIVAL,27,27,27
47,PARKLET CAFE,23,23,23
4,BATH HOUSE,17,17,17
16,CLOTHING DROP BOX OPERATOR,14,14,14
5,BILL DISTRIBUTOR,12,12,12


### Results

- section to follow -

In [6]:
import pandas as pd
import numpy as np

### Discussion

- section to follow -

### Conclusion

- section to follow -

### Bibliography

8 things immigrants should know about working in canada. Randstad Canada, n.d. https://www.randstad.ca/job-seeker/career-resources/working-in-canada/8-things-immigrants-should-know-about-working-in-canada/.
    
Canada, Employment and Social Development. “Survival to Success: Transforming Immigrant Outcomes.” Canada.ca. Government of Canada, June 8, 2020. https://www.canada.ca/en/employment-social-development/programs/foreign-credential-recognition/consultations.html.
        
Chrysostome, Elie. “(PDF) The Success Factors of Necessity Immigrant Entrepreneurs: In Search of a Model.” ResearchGate. Thunderbird International Business Review, March 2010. https://www.researchgate.net/publication/229907244_The_success_factors_of_necessity_immigrant_entrepreneurs_In_search_of_a_model.
        
Griffith, Andrew. “Building a Mosaic: The Evolution of Canada's Approach to Immigrant Integration.” migrationpolicy.org. Migration Policy Institute, April 3, 2019. https://www.migrationpolicy.org/article/building-mosaic-evolution-canadas-approach-immigrant-integration.
    
Hosseini, Mana. “Is Canada Doing All It Can To Integrate New Immigrants?” Canada Immigration and Visa Information. Canadian Immigration Services and Free Online Evaluation. Canadian Citizenship &amp; Immigration Resource Center (CCIRC), August 21, 2017. https://www.immigration.ca/canada-can-integrate-new-immigrants.
    
Liebig, Thomas. “Recruiting for Success Challenges for Canada’s Labour Migration System.” OECD.org, November 2016. http://www.oecd.org/migration/mig/recruiting-for-success-Canada.pdf.

Liston, Mary, and Joseph Carens. “Immigration and Integration in Canada.” Allard Research Commons. University of British Columbia, 2008. https://commons.allard.ubc.ca/cgi/viewcontent.cgi?article=1208&context=fac_pubs.