Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
src
 
 
 
 
 
 
 
 
 
 

README.md

Ball Statistics

Travis Build Status AppVeyor Build Status CRAN Status Badge PyPI version

Introdution

The fundamental problems for data mining, statistical analysis, and machine learning are:

  • whether several distributions are different?
  • whether random variables are dependent?
  • how to pick out useful variables/features from a high-dimensional data?

These issues can be tackled by Ball statistics, which enjoy following admirable advantages:

  • available for most of datasets (e.g., traditional tabular data, brain shape, functional connectome, wind direction and so on)
  • insensitive to outliers, distribution-free and model-free;
  • theoretically guaranteed and computationally efficient.

Softwares

R package

Install the Ball package from CRAN:

install.packages("Ball")

Compared with selective R packages available for datasets in metric spaces:

fastmit energy HHG Ball
Test of equal distributions ✔️ ✔️ ✔️
Test of independence ✔️ ✔️ ✔️ ✔️
Test of joint independence ✔️
Feature screening / Sure Independence Screening (SIS) ✔️
Iterative Feature screening / Iterative SIS ✔️
Datasets in metric spaces ✔️ SNT ✔️ ✔️
Robustness ✔️ ✔️ ✔️
Parallel programming ✔️ ✔️
Computational efficiency 🏃🏃🏃 🏃🏃🏃 🏃🏃 🏃🏃🚶

SNT is the abbreviation of strong negative type.

See the following documents for more details about the Ball package:

Python package

Install the Ball package from PyPI:

pip install Ball

References

Bug report

Open an issue or send an email to Jin Zhu at zhuj37@mail2.sysu.edu.cn

You can’t perform that action at this time.