Skip to content
An extension to pandas dataframes describe function.
Jupyter Notebook Python
Branch: master
Clone or download
Pull request Compare This branch is 7 commits ahead of AlfonsoRReyes:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data
notebooks
pandas_overview
tests
.gitignore
LICENSE
MANIFEST.in
README.md
pytest.ini
setup.py
tox.ini

README.md

pandas_overview

A tool that goes deeper than pandas dataframe's describe function.

The module contains DataFrameSummary and a DataFrameOverview objects.

The DataFrameSummary is an extended version of pandas' describe() method with:

  • methods
    • summary(columns=None): extends the describe() function with detailed info:
      • count, mean, std, min, 25%, 50%, 75%, max, counts, uniques, mising, missing %, type
    • dfs.type_summary(type=None): summary for a specific type, it has a more specialized info:
      • numeric: iqr, kurtosis, skewness, sum, mad, cv, zeros_num, zeros %, deviating of mean, deviating of mean %
      • constant: top row
      • categorical: values, top row
      • bool: left value, left value %, right value, right value %
      • unique: is not that unique :/
  • properties
    • dfs.columns_types: a count of the types of columns
    • dfs.types: internal list of supported types
    • dfs[column]: more in depth summary of the column

The DataFrameOverview displays an overview of the data, including:

  • methods
    • overview(dfs, first_level=None): Data overview showing:
      • Summary
      • Columns
      • Head
      • Tail
      • Correlations
      • Histogram for numeric columns
      • Range for unique and date columns
      • Values for constant columns

Installation (Not published yet)

The module can be easily installed with pip:

> pip install pandas-overview

This module depends on numpy and pandas. Optionally you can get also some nice visualisations if you have matplotlib installed.

Tests

To run the tests, execute the command python setup.py test.

Usage

For detailed information check this this and this out.

Origins

This is a forked version of the original pandas-summary plus great additions from pandas-summary-master

Contribution and License Agreement

If you contribute code to this project, you are implicitly allowing your code to be distributed under the MIT license. You are also implicitly verifying that all code is your original work.

license

You can’t perform that action at this time.