Skip to content

Fast data exploration using Sweetviz

Notifications You must be signed in to change notification settings

personx000/fast_eda

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fast And Easy Data Exploration For Machine Learning

Where there is data smoke, there is business fire.

-- Thomas Redman

Table of Contents

  1. What is this repo about?
  2. Quick setup
  3. Example
  4. Let's connect!

What is this repo about?

Tired of spending “too much time” doing data exploration before training your Machine Learning models?

Looking for a faster way to understand data issues and patterns, before you dive into the fun part of training your ML model?

Wanna learn how to train better ML models, by finding and fixing issues in your data?

In this repo you can find a short Python script that used the Sweetviz library to do data exploration at the speed of light.

I also invite you to read the original article I published on my site:

📝 Fast And Easy Data Exploration For Machine Learning

Quick setup

You need Python >= 3.7 and

$ pip install sweetviz

Example

I attached a sample dataset for a churn prediction model, that you can quickly explore with a one-liner.

$ python eda.py --file v1.csv --target Churn

Let's connect

If you want to learn more about real-world ML topics and become a better data scientist

👉 Subscribe to the datamachines newsletter.

👉 Follow me on Twitter and LinkedIn.

About

Fast data exploration using Sweetviz

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%