Skip to content

Build connection graphs of ultra-high net worth people in Canada.

Notifications You must be signed in to change notification settings

yuehuca/schulich-cibc-acp-21

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 

Repository files navigation

schulich-cibc-acp-21

This project is intended to build connection graphs of ultra-high net worth people in Canada.

  1. Run the namelist crawler to get the names and basic profile information from yahoo finance. This crawler is modified from yahoo_finance_scrap.
  2. Send a name list to search engine crawler. (When quotation marks are added to the key words, search engine will only return the results that exactly contain the targets that you need, meaning that the search amount can represent the connection strength between 2 targets. In this project I use bing to get the search amounts, and the reason is that the search results of bing are more accurate than the ones of google, and the anti-crawler algorithm of google is too powerful). proxy ips and random pseudo headers are applied, and the crawler can be run on AWS instance.
  3. Draw the graphs with pyecharts. You will need to prepare 3 variables: categories, links, and nodes.
  4. 2 different types of graphs are provided: circular and float
  5. The strength level is defined by the total search amount of a people, and the size of a dot is defined by the total number of connections with other people.

Screenshots of samples

  1. The outputs are based on the executives (126 people) whose anual salaries are higher than 2 million CAD and coming from listed Canadian companies.
  2. The final interactive outputs from pyecharts are coded in html. Please download them circular float to read in full functions.

alt text alt text alt text alt text alt text alt text

About

Build connection graphs of ultra-high net worth people in Canada.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published