Skip to content

JohnMcCambridge/flenser

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 

Flenser

Have you ever been handed a dataset you've never seen before?

Flenser is a simple, minimal, automated exploratory data analysis tool. It runs a set of simple tests against each column within a dataset, and outputs a HTML file noting which tests trigger per column, alongside relevant outputs.

Flenser is intended to be run at the earliest stages of data exploration, when you have no familiarity with the dataset. It will do its best to tell you what is actually going on in the dataset, regardless of what is supposed to be going on in the dataset.

Flenser is designed to be helpful, not 'helpful': it will not attempt to modify or make assumptions about your dataset. Instead it will apply each simple test, to every column, and show you outputs that will allow your human brain to make decisions about what is actually going on.

Additional tests can be added by modifying the Test dataclass.

How to run

python3 flenser.py "filename.csv"

Flenser will print its default list of nans. You may specify one or more additional nan values to use, as follows:

python3 flenser.py "filename.csv" "nan1" "nan2" "nan3" ...

With thanks to

Recurse
Kelly F
Rebecca H
Azhad S
Shivam S
Christina M
Adam K
Edith V
Justin R

About

Flenser is a simple, minimal, automated exploratory data analysis tool.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published