# Instructions


● **Preprocess the dataset**

○ Clean (preprocess all “odd” columns, and pre-process values)

○ Enrich (get data from grouping operations, maybe enrich in different ways 👀)

● Perform EDA (exploratory data analysis) on the dataset

○ Generate visualizations

○ Share your insights and conclusions

● Preferably use Python


**Open Questions (optional):**

● Any conclusions based on the number of companies created by date, month, year etc?

● Does the number of branches for a CNPJ provided on the dataset is equal to the actual
number produced by the dataset?

● What about the business activities (CNAE)? What are the main types? Can they be
aggregated into fewer groups?

● What are the differences between the cities / zip codes?

● Is it possible to catch any spatial relationships? Which visualizations would be best in
this case?

● If you were to make any model from the data, which one do you think makes sense?



# Importing libraries and reading data

In [1]:
#Imports 
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
import seaborn as sns
from sqlalchemy import create_engine
from sklearn.model_selection import train_test_split
import pymysql
import pandas_profiling
import re
from datetime import datetime

In [2]:
df = pd.read_csv('../data/data_case.csv')

  exec(code_obj, self.user_global_ns, self.user_ns)


In [3]:
df.head()

Unnamed: 0,document_number,cnpj_basico,establishment_type,razao_social,nome_fantasia,opening_date,cnae,cnae_description,total_branches_and_socios,city_state,city_code,zip_code,capital_social,size_company,juri_description,email_provider
0,4363642000106,4363642,MATRIZ,TRES GERACOES EM MODA LTDA,TRES GERACOES,29 de março de 2001,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 6, 'tot_branches': '1 branches'}",Curitiba - PR,4106902,80410200,0.0,NOT_INFORMED,Sociedade Empresária Limitada,
1,17001584000465,17001584,FILIAL,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,20 de setembro de 2018,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 7, 'tot_branches': '4 branches'}",Curitiba - PR,4106902,80610905,100000.0,SMALL_COMPANY,Sociedade Empresária Limitada,HOTMAIL.COM
2,6931423000454,6931423,FILIAL,FONTANA COMERCIO DE JOIAS LTDA,,30 de julho de 2014,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '4 branches'}",Curitiba - PR,4106902,81200240,10000.0,NOT_INFORMED,Sociedade Empresária Limitada,
3,39842321000179,39842321,MATRIZ,ZAZULE ESTACAO CURITIBA LTDA,ZAZULE ESTACAO CURITIBA,19 de novembro de 2020,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '1 branches'}",Curitiba - PR,4106902,80230903,10000.0,NOT_INFORMED,Sociedade Empresária Limitada,ZAZULE.COM.BR
4,81671679000223,81671679,FILIAL,JOSE PELAYO SANCHEZ & CIA LTDA,PELAYO CASA,09 de outubro de 2013,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '2 branches'}",Londrina - PR,4113700,86020000,50000.0,SMALL_COMPANY,Sociedade Empresária Limitada,


# Data cleaning

In [4]:
# Checking data types

df.dtypes

document_number                int64
cnpj_basico                    int64
establishment_type            object
razao_social                  object
nome_fantasia                 object
opening_date                  object
cnae                           int64
cnae_description              object
total_branches_and_socios     object
city_state                    object
city_code                      int64
zip_code                      object
capital_social               float64
size_company                  object
juri_description              object
email_provider                object
dtype: object

In [5]:
# Checking duplicates

df.duplicated().sum() / len(df)

0.0

In [6]:
# Checking nulls

df.isna().sum() / len(df)

document_number              0.000000
cnpj_basico                  0.000000
establishment_type           0.000000
razao_social                 0.000000
nome_fantasia                0.328898
opening_date                 0.000000
cnae                         0.000000
cnae_description             0.000000
total_branches_and_socios    0.000000
city_state                   0.000000
city_code                    0.000000
zip_code                     0.000000
capital_social               0.000000
size_company                 0.000000
juri_description             0.000000
email_provider               0.341263
dtype: float64

In [7]:
#Checking column values
for column in df.columns:
    print(df[column].value_counts().head(20))
    print(df[column].value_counts().tail(10))
    print()
    print()

4363642000106     1
38353734000127    1
19747444000113    1
40390008000126    1
40455467000140    1
32936513000103    1
32551625000138    1
31774522000174    1
32474742000145    1
40275830000146    1
41722445000162    1
42194867000175    1
35678579000120    1
21978902000177    1
37365409000111    1
41894040000101    1
31157942000101    1
29988808000100    1
20879673000170    1
29953048000104    1
Name: document_number, dtype: int64
32172944000132    1
35032781000180    1
19209988000121    1
2380579000190     1
6168511000101     1
25168658000138    1
7947121000194     1
27153062000190    1
17964993000114    1
81722795000143    1
Name: document_number, dtype: int64


15913061    370
77500049    156
78586674    100
75584110     88
2905110      86
78614278     85
2869763      73
75923185     66
88212113     61
11137051     60
2031555      60
82658840     58
33098658     57
10302127     57
751089       54
76189406     54
77856995     52
2834982      52
18192964     49
9248166      49
Name: 

{'tot_socios': 19, 'tot_branches': '5 branches'}     1
{'tot_socios': 52, 'tot_branches': '2 branches'}     1
{'tot_socios': 14, 'tot_branches': '14 branches'}    1
{'tot_socios': 64, 'tot_branches': '3 branches'}     1
{'tot_socios': 61, 'tot_branches': '1 branches'}     1
{'tot_socios': 7, 'tot_branches': '41 branches'}     1
{'tot_socios': 5, 'tot_branches': '55 branches'}     1
{'tot_socios': 11, 'tot_branches': '59 branches'}    1
{'tot_socios': 51, 'tot_branches': '3 branches'}     1
{'tot_socios': 57, 'tot_branches': '1 branches'}     1
Name: total_branches_and_socios, dtype: int64


Curitiba - PR                305883
Londrina - PR                 71330
Maringá - PR                  68031
Cascavel - PR                 40407
Ponta Grossa - PR             39580
São José dos Pinhais - PR     38732
Foz do Iguaçu - PR            33285
Colombo - PR                  25420
Pinhais - PR                  18644
Guarapuava - PR               17380
Toledo - PR                   17159
Araucá

**Preoprocesing To Do**

document_number: OK.

cnpj_basico: some do not have 8 numbers, should extract from document number.

establishment_type: OK.

razao_social: Some LTDA have dots, other doesn't. Otherwise OK.

nome_fantasia: * and dots might be missing data. 32% nulls. Caps are not consistent.

opening_date: Date is expressed as string.

cnae: OK, probably can be grouped differently.

cnae_description: OK, probably can be grouped differently.

total_branches_and_socios: JSON-Like. tot_socios and tot_branches should be different columns. Only numbers should be extracted.

city_state: OK.

city_code: OK.

zip_code: OK.

capital_social: some 0s and 1s that do not make much sense.

size_company: OK.

juri_description: OK.

email_provider: Caps and names are not consistent, can be grouped better. 

In [8]:
# Correcting cnpj_basico column with the first 8 digits of the document_number

df['cnpj_basico'] = df.document_number.apply(lambda x: int(str(x)[0:8]))

In [9]:
df['cnpj_basico'].value_counts()

15913061    370
77500049    156
78586674    100
75584110     88
78614278     85
           ... 
36808588      1
10423294      1
12369448      1
25307438      1
81722795      1
Name: cnpj_basico, Length: 1238260, dtype: int64

In [10]:
# Replacing nulls

df.nome_fantasia.fillna('Missing_Name', inplace=True)
df.email_provider.fillna('Missing_Email', inplace=True)




In [11]:
# Replacing nome_fantasia with odd values

df.loc[df.nome_fantasia=='.', 'nome_fantasia'] = 'Missing_Name'
df.loc[df.nome_fantasia=='***', 'nome_fantasia'] = 'Missing_Name'
df.loc[df.nome_fantasia=='****', 'nome_fantasia'] = 'Missing_Name'
df.loc[df.nome_fantasia=='*****', 'nome_fantasia'] = 'Missing_Name'
df.loc[df.nome_fantasia=='******', 'nome_fantasia'] = 'Missing_Name'
df.loc[df.nome_fantasia=='*******', 'nome_fantasia'] = 'Missing_Name'
df.loc[df.nome_fantasia=='********', 'nome_fantasia'] = 'Missing_Name'

In [12]:
# Modifying dates

# Extracting year
df.opening_date[0]

# Extracting day 

# Extracting month

# Replacing month by number with dictionary (?)

'29 de março de 2001'

In [13]:
# Extracting date year
df['opening_year'] = df.opening_date.apply(lambda x: x[-4:])


In [14]:
# Extracting date day
df['opening_day'] = df.opening_date.apply(lambda x: x[0:2])


In [15]:
# Extracting date month
df['opening_month']= df.opening_date.apply(lambda x: "".join(re.findall("[a-zA-Z]+", x)).strip('de'))

In [16]:
# Creating month names list
month_names = list(df.opening_month.value_counts().index)
 

In [17]:
# Creating month numbers list
month_numbers= ['03', '05', '02', '01', '04', '08', '06', '07', '10', '09', '11', '12']

In [18]:
# Creating function to replace values
def replacer(to_replace, list_1, list_2):
    for i in range(len(list_1)):
        if to_replace == list_1[i]:
            to_replace = list_2[i]
    return to_replace
    

In [19]:
# Testing function
foo = 'julho'
replacer(foo, month_names, month_numbers)

'07'

In [20]:
# Replacing month names for numbers
df['opening_month'] = df.opening_month.apply(lambda x: replacer(x, month_names, month_numbers))

In [21]:
# Creating clean date column
df['opening_date_clean'] = df.opening_year + '/' + df.opening_month + '/' + df.opening_day

In [22]:
# Changing to date time format
df['opening_date_clean'] = df['opening_date_clean'].apply(lambda x: datetime.strptime(x, '%Y/%m/%d'))

In [23]:
df

Unnamed: 0,document_number,cnpj_basico,establishment_type,razao_social,nome_fantasia,opening_date,cnae,cnae_description,total_branches_and_socios,city_state,city_code,zip_code,capital_social,size_company,juri_description,email_provider,opening_year,opening_day,opening_month,opening_date_clean
0,4363642000106,43636420,MATRIZ,TRES GERACOES EM MODA LTDA,TRES GERACOES,29 de março de 2001,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 6, 'tot_branches': '1 branches'}",Curitiba - PR,4106902,80410200,0.0,NOT_INFORMED,Sociedade Empresária Limitada,Missing_Email,2001,29,03,2001-03-29
1,17001584000465,17001584,FILIAL,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,20 de setembro de 2018,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 7, 'tot_branches': '4 branches'}",Curitiba - PR,4106902,80610905,100000.0,SMALL_COMPANY,Sociedade Empresária Limitada,HOTMAIL.COM,2018,20,09,2018-09-20
2,6931423000454,69314230,FILIAL,FONTANA COMERCIO DE JOIAS LTDA,Missing_Name,30 de julho de 2014,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '4 branches'}",Curitiba - PR,4106902,81200240,10000.0,NOT_INFORMED,Sociedade Empresária Limitada,Missing_Email,2014,30,07,2014-07-30
3,39842321000179,39842321,MATRIZ,ZAZULE ESTACAO CURITIBA LTDA,ZAZULE ESTACAO CURITIBA,19 de novembro de 2020,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '1 branches'}",Curitiba - PR,4106902,80230903,10000.0,NOT_INFORMED,Sociedade Empresária Limitada,ZAZULE.COM.BR,2020,19,11,2020-11-19
4,81671679000223,81671679,FILIAL,JOSE PELAYO SANCHEZ & CIA LTDA,PELAYO CASA,09 de outubro de 2013,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '2 branches'}",Londrina - PR,4113700,86020000,50000.0,SMALL_COMPANY,Sociedade Empresária Limitada,Missing_Email,2013,09,10,2013-10-09
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
1282534,3535330000250,35353300,FILIAL,AEQ ALIANCA ELETROQUIMICA LTDA,Missing_Name,27 de julho de 2011,2550101,"Fabricação de equipamento bélico pesado, excet...","{'tot_socios': 3, 'tot_branches': '4 branches'}",Quatro Barras - PR,4120804,83420000,12437843.0,OTHER,Sociedade Empresária Limitada,Missing_Email,2011,27,07,2011-07-27
1282535,3535330000412,35353300,FILIAL,AEQ ALIANCA ELETROQUIMICA LTDA,Missing_Name,31 de julho de 2019,2550101,"Fabricação de equipamento bélico pesado, excet...","{'tot_socios': 3, 'tot_branches': '4 branches'}",Quatro Barras - PR,4120804,83420000,12437843.0,OTHER,Sociedade Empresária Limitada,AEQAEROESPACIAL.COM.BR,2019,31,07,2019-07-31
1282536,4186294000149,41862940,MATRIZ,CWI AGRO FRUTICOLA LTDA,CWI AGRO & FRUTI,12 de dezembro de 2000,133411,Cultivo de pêssego,"{'tot_socios': 3, 'tot_branches': '2 branches'}",Lapa - PR,4113205,83750000,550000.0,SMALL_COMPANY,Sociedade Empresária Limitada,CWIAGRO.COM.BR,2000,12,12,2000-12-12
1282537,4186294000220,41862940,FILIAL,CWI AGRO FRUTICOLA LTDA,CWI AGRO & FRUTI,13 de setembro de 2016,133411,Cultivo de pêssego,"{'tot_socios': 3, 'tot_branches': '2 branches'}",Campo Largo - PR,4104204,83604365,550000.0,SMALL_COMPANY,Sociedade Empresária Limitada,CWIAGRO.COM.BR,2016,13,09,2016-09-13


In [24]:
# Creating total socios and total branches columns

df['total_socios'] = df.total_branches_and_socios.apply(lambda x: re.findall(r'\d+', x)[0])

df['total_branches'] = df.total_branches_and_socios.apply(lambda x: re.findall(r'\d+', x)[1])

In [25]:
df

Unnamed: 0,document_number,cnpj_basico,establishment_type,razao_social,nome_fantasia,opening_date,cnae,cnae_description,total_branches_and_socios,city_state,...,capital_social,size_company,juri_description,email_provider,opening_year,opening_day,opening_month,opening_date_clean,total_socios,total_branches
0,4363642000106,43636420,MATRIZ,TRES GERACOES EM MODA LTDA,TRES GERACOES,29 de março de 2001,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 6, 'tot_branches': '1 branches'}",Curitiba - PR,...,0.0,NOT_INFORMED,Sociedade Empresária Limitada,Missing_Email,2001,29,03,2001-03-29,6,1
1,17001584000465,17001584,FILIAL,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,20 de setembro de 2018,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 7, 'tot_branches': '4 branches'}",Curitiba - PR,...,100000.0,SMALL_COMPANY,Sociedade Empresária Limitada,HOTMAIL.COM,2018,20,09,2018-09-20,7,4
2,6931423000454,69314230,FILIAL,FONTANA COMERCIO DE JOIAS LTDA,Missing_Name,30 de julho de 2014,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '4 branches'}",Curitiba - PR,...,10000.0,NOT_INFORMED,Sociedade Empresária Limitada,Missing_Email,2014,30,07,2014-07-30,5,4
3,39842321000179,39842321,MATRIZ,ZAZULE ESTACAO CURITIBA LTDA,ZAZULE ESTACAO CURITIBA,19 de novembro de 2020,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '1 branches'}",Curitiba - PR,...,10000.0,NOT_INFORMED,Sociedade Empresária Limitada,ZAZULE.COM.BR,2020,19,11,2020-11-19,5,1
4,81671679000223,81671679,FILIAL,JOSE PELAYO SANCHEZ & CIA LTDA,PELAYO CASA,09 de outubro de 2013,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '2 branches'}",Londrina - PR,...,50000.0,SMALL_COMPANY,Sociedade Empresária Limitada,Missing_Email,2013,09,10,2013-10-09,5,2
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
1282534,3535330000250,35353300,FILIAL,AEQ ALIANCA ELETROQUIMICA LTDA,Missing_Name,27 de julho de 2011,2550101,"Fabricação de equipamento bélico pesado, excet...","{'tot_socios': 3, 'tot_branches': '4 branches'}",Quatro Barras - PR,...,12437843.0,OTHER,Sociedade Empresária Limitada,Missing_Email,2011,27,07,2011-07-27,3,4
1282535,3535330000412,35353300,FILIAL,AEQ ALIANCA ELETROQUIMICA LTDA,Missing_Name,31 de julho de 2019,2550101,"Fabricação de equipamento bélico pesado, excet...","{'tot_socios': 3, 'tot_branches': '4 branches'}",Quatro Barras - PR,...,12437843.0,OTHER,Sociedade Empresária Limitada,AEQAEROESPACIAL.COM.BR,2019,31,07,2019-07-31,3,4
1282536,4186294000149,41862940,MATRIZ,CWI AGRO FRUTICOLA LTDA,CWI AGRO & FRUTI,12 de dezembro de 2000,133411,Cultivo de pêssego,"{'tot_socios': 3, 'tot_branches': '2 branches'}",Lapa - PR,...,550000.0,SMALL_COMPANY,Sociedade Empresária Limitada,CWIAGRO.COM.BR,2000,12,12,2000-12-12,3,2
1282537,4186294000220,41862940,FILIAL,CWI AGRO FRUTICOLA LTDA,CWI AGRO & FRUTI,13 de setembro de 2016,133411,Cultivo de pêssego,"{'tot_socios': 3, 'tot_branches': '2 branches'}",Campo Largo - PR,...,550000.0,SMALL_COMPANY,Sociedade Empresária Limitada,CWIAGRO.COM.BR,2016,13,09,2016-09-13,3,2


In [26]:
# Uppercasing email provider's column
df.email_provider = df.email_provider.apply(lambda x: x.upper())

In [27]:
# Creating a new column only with the email provider's company nam

df['email_provider_company'] = df.email_provider.apply(lambda x: x.split('.')[0]) 

In [28]:
df

Unnamed: 0,document_number,cnpj_basico,establishment_type,razao_social,nome_fantasia,opening_date,cnae,cnae_description,total_branches_and_socios,city_state,...,size_company,juri_description,email_provider,opening_year,opening_day,opening_month,opening_date_clean,total_socios,total_branches,email_provider_company
0,4363642000106,43636420,MATRIZ,TRES GERACOES EM MODA LTDA,TRES GERACOES,29 de março de 2001,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 6, 'tot_branches': '1 branches'}",Curitiba - PR,...,NOT_INFORMED,Sociedade Empresária Limitada,MISSING_EMAIL,2001,29,03,2001-03-29,6,1,MISSING_EMAIL
1,17001584000465,17001584,FILIAL,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,FAGE COMERCIO VAREJISTA DE BIJUTERIAS LTDA,20 de setembro de 2018,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 7, 'tot_branches': '4 branches'}",Curitiba - PR,...,SMALL_COMPANY,Sociedade Empresária Limitada,HOTMAIL.COM,2018,20,09,2018-09-20,7,4,HOTMAIL
2,6931423000454,69314230,FILIAL,FONTANA COMERCIO DE JOIAS LTDA,Missing_Name,30 de julho de 2014,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '4 branches'}",Curitiba - PR,...,NOT_INFORMED,Sociedade Empresária Limitada,MISSING_EMAIL,2014,30,07,2014-07-30,5,4,MISSING_EMAIL
3,39842321000179,39842321,MATRIZ,ZAZULE ESTACAO CURITIBA LTDA,ZAZULE ESTACAO CURITIBA,19 de novembro de 2020,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '1 branches'}",Curitiba - PR,...,NOT_INFORMED,Sociedade Empresária Limitada,ZAZULE.COM.BR,2020,19,11,2020-11-19,5,1,ZAZULE
4,81671679000223,81671679,FILIAL,JOSE PELAYO SANCHEZ & CIA LTDA,PELAYO CASA,09 de outubro de 2013,4789001,"Comércio varejista de suvenires, bijuterias e ...","{'tot_socios': 5, 'tot_branches': '2 branches'}",Londrina - PR,...,SMALL_COMPANY,Sociedade Empresária Limitada,MISSING_EMAIL,2013,09,10,2013-10-09,5,2,MISSING_EMAIL
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
1282534,3535330000250,35353300,FILIAL,AEQ ALIANCA ELETROQUIMICA LTDA,Missing_Name,27 de julho de 2011,2550101,"Fabricação de equipamento bélico pesado, excet...","{'tot_socios': 3, 'tot_branches': '4 branches'}",Quatro Barras - PR,...,OTHER,Sociedade Empresária Limitada,MISSING_EMAIL,2011,27,07,2011-07-27,3,4,MISSING_EMAIL
1282535,3535330000412,35353300,FILIAL,AEQ ALIANCA ELETROQUIMICA LTDA,Missing_Name,31 de julho de 2019,2550101,"Fabricação de equipamento bélico pesado, excet...","{'tot_socios': 3, 'tot_branches': '4 branches'}",Quatro Barras - PR,...,OTHER,Sociedade Empresária Limitada,AEQAEROESPACIAL.COM.BR,2019,31,07,2019-07-31,3,4,AEQAEROESPACIAL
1282536,4186294000149,41862940,MATRIZ,CWI AGRO FRUTICOLA LTDA,CWI AGRO & FRUTI,12 de dezembro de 2000,133411,Cultivo de pêssego,"{'tot_socios': 3, 'tot_branches': '2 branches'}",Lapa - PR,...,SMALL_COMPANY,Sociedade Empresária Limitada,CWIAGRO.COM.BR,2000,12,12,2000-12-12,3,2,CWIAGRO
1282537,4186294000220,41862940,FILIAL,CWI AGRO FRUTICOLA LTDA,CWI AGRO & FRUTI,13 de setembro de 2016,133411,Cultivo de pêssego,"{'tot_socios': 3, 'tot_branches': '2 branches'}",Campo Largo - PR,...,SMALL_COMPANY,Sociedade Empresária Limitada,CWIAGRO.COM.BR,2016,13,09,2016-09-13,3,2,CWIAGRO


# Data Enrichment

In [29]:
# Grouping by month-year

# Grouping by economic activity

# Grouping by zip_code or city

# Grouping by capital levels

# Grouping by cnpj básico (Company)

# Adding economic data (Actividad Económica, Consumo, Precios Commodities: Azucar, Soja, Maiz, Trigo, Cafe)

In [None]:
# Indicators

# Openings per month, social activity, zipcode or city

# Capital per month, avg capital per company

# Average socios, branches per company

# Big companies openings over total openings



