Skip to content

User-friendly Python implementation of distance correlation, a better alternative to Pearson correlation for preliminary data analysis

License

Notifications You must be signed in to change notification settings

colemai/correlade

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Correlade Package

Pearson correlation only really detects linear relationships. Distance Correlation is a better measure, detecting more varied dependency relationships between variables.

This package aims to give a one-liner that'll be super useful for preliminary data analysis in Pandas, producing a matrix of pairwise distance correlations and also a heatmap of these reults.

Because this correlation measure is more computationally expensive, if you feed it more than 2000 observations it will randomly select 2000 rows on which to base the calculation.

To install:

pip3 install cython
pip3 install git+https://github.com/hoihui/distcorr
pip3 install correlade 

To use:

from correlade import correlade
correlade.dcorr(df)

About

User-friendly Python implementation of distance correlation, a better alternative to Pearson correlation for preliminary data analysis

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages