autoInsight is a library that helps researchers and developers see patterns in data that have two classes at a glance.
copy autoInsight.py to your working directory
requirement: pandas, numpy, xgboost, re, IPython
Import data and use the library in your jupyter notebook as follows. (XGBoost_Data_Exploration.ipynb)
import autoInsight as ai
import pandas as pd
import numpy as np
dataset = pd.read_csv('framingham.csv')
dataset['male'] = dataset['male'].astype('bool')
ai.binary_auto_insight(dataset, labelCol = 'TenYearCHD', positive_class = 1)
Examples of Results (from kaggle data for 10-Year Coronary Heart Disease Risk Prediction 'framingham.csv')
These are results created from autoInsight. Each feature is chosen automatically with a machine learning algorithm.
. . .
Note: missing = No information appears on that feature (NA value)
Data: https://www.kaggle.com/amanajmera1/framingham-heart-study-dataset