Skip to content

Library for displaying insight of data automatically

Notifications You must be signed in to change notification settings

thadaJ/autoInsight

Repository files navigation

autoInsight

autoInsight is a library that helps researchers and developers see patterns in data that have two classes at a glance.

Step1:

copy autoInsight.py to your working directory

requirement: pandas, numpy, xgboost, re, IPython

Step2:

Import data and use the library in your jupyter notebook as follows. (XGBoost_Data_Exploration.ipynb)

import autoInsight as ai
import pandas as pd
import numpy as np
dataset = pd.read_csv('framingham.csv')
dataset['male'] = dataset['male'].astype('bool')
ai.binary_auto_insight(dataset, labelCol = 'TenYearCHD', positive_class = 1)

Examples of Results (from kaggle data for 10-Year Coronary Heart Disease Risk Prediction 'framingham.csv')

These are results created from autoInsight. Each feature is chosen automatically with a machine learning algorithm. result

. . .

result

Note: missing = No information appears on that feature (NA value)

References

Data: https://www.kaggle.com/amanajmera1/framingham-heart-study-dataset

About

Library for displaying insight of data automatically

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published