Addressable market challange

The solution was developed in a juputer notebook with the following structure:

Load libs and modules
Load raw data sets
Split actual and addressable customers
EDA
- Descriptive statistics analsysis
- Customer type size analysis
- Outlier analysis
- Univariate distribution analysis
Model building
- Customer segmentation
- Decision-tree classifier
- Customer scorer (pearson similiarity)
Generate Deliverables
- Deliverable 1
- Deliverable 2
- Sanity Check
Improvements (TODO)

The notebook source artifacts:

Raw Data

Addressable customers' ids: addressable_ids.csv
Training dataset: training_ids.csv
Validation dataset: testing_ids.csv
Addressable Market in the format id/score, ordered by score: addressable_ranking.csv

Powered by IBM Watson Studio using the following hadware and software config:

Fernando Felix do Nascimento Junior

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Addressable Market_files		Addressable Market_files
Addressable Market.html		Addressable Market.html
Addressable Market.ipynb		Addressable Market.ipynb
Neoway_database_2019-05-17.csv		Neoway_database_2019-05-17.csv
README.md		README.md
addressable_ids.csv		addressable_ids.csv
addressable_ranking.csv		addressable_ranking.csv
classifier.py		classifier.py
clustering.py		clustering.py
config.py		config.py
customer_CRM_2019-05-17.csv		customer_CRM_2019-05-17.csv
requirements.txt		requirements.txt
testing_ids.csv		testing_ids.csv
training_ids.csv		training_ids.csv
utils.py		utils.py
viz.py		viz.py