## this is the test branch

After reading [@KyleOS's post](https://kyso.io/KyleOS/unicorns) about the breakdown of global unicorns I wanted to see how the number stacked up per capita.

Using [CBInsights global unicorn tracker](https://www.cbinsights.com/research-unicorn-companies) and [UN population data for 2019](https://population.un.org/wpp/DataQuery/) I was able to calculate the number of unicorns each country has per 10m people.

First lets just show the ranking of total number of unicorns.

In [1]:
import pandas as pd
%run plot.py

ModuleNotFoundError: No module named 'pandas'

In [None]:
# read the data
df_population = pd.read_csv('./data/populations.txt', delimiter=',')
df_cb = pd.read_excel('./data/CB-Insights_Global-Unicorn-Club_2019.xlsx')

# lets group and count the startups by country
df_startups = df_cb.groupby('Country').count()['Company'].to_frame()

# rename usa so its the same in both datasets
df_startups.rename(index={'United States':'United States of America'},inplace=True)

# merge the dataframes
df = df_startups.merge(df_population, on=['Country'])

# calculate unicorns per 10m people
df['Unicorns per 10m people'] = df['Company']/(df['2019']/10000000)

## Total Unicorns per country 2019

In [None]:
df = df.sort_values('Company')
bar = plotly.graph_objs.Bar(x=df['Company'], y=df['Country'], orientation = 'h')
fig = plotly.graph_objs.Figure(data=[bar], layout=go.Layout(height=700, width=700, margin={"l": 150}))
iplot(fig)

We get results that we expect - the USA and China dominate when it comes to total number of unicorns. They are the worlds two largest economies and both have large populations and homogeneus markets to support such companies.

## Unicorns per 10m people per country 2019

In [None]:
df = df.sort_values('Unicorns per 10m people')
bar = plotly.graph_objs.Bar(x=df['Unicorns per 10m people'], y=df['Country'], orientation = 'h')
fig = plotly.graph_objs.Figure(data=[bar], layout=go.Layout(height=700, width=700, margin={"l": 150}))
iplot(fig)

Here we get a different picture - the top three countries have only one unicorn each and have very small populations. But after Estonia the rest of the countries have at least a population of 7 million.  It shows how the USA ranks really highly even when you take into account the population size. Smaller but high tech countries do really well - Isreal, Switzerland, HK, Singapore. Then we get cluster of European countries. Its suprising that China still does well even with high inequality and such a large population.

Finally just for curiosity lets have a look at a global choropleth map.

In [None]:
make_plot(df['Country'], df['Unicorns per 10m people'])

Unicorns seem pretty spread out. But if we look at this map again in 10 years I'd probably expect Asian and African countries to be more represented on this map than they are now. 

Finally lets print the entire table sorted alphabetically by country for reference:

In [None]:
with pd.option_context('display.max_rows', None, 'display.max_columns', None):  # more options can be specified also
    display(df_cb.sort_values('Country'))