Skip to content

locluclak/SCaDA

Repository files navigation

Statistical Inference for k-means Clustering after Domain Adaptation

This package provides a statistical inference framework for k-means clustering after domain adaptation (DA). It leverages the SI framework and employs a divide-and-conquer strategy to efficiently compute the p-value of selected features. Our method ensures reliable feature selection by controlling the false positive rate (FPR) while simultaneously maximizing the true positive rate (TPR), effectively reducing the false negative rate (FNR).

Environment Setup

pip install -r requirements.txt

Usage

We provide several Jupyter notebooks demonstrating how to use the SCaDA.

PyPI package

The SCaDA is available on the PyPI and can be installed as follows:

pip install PySCaDA

About

Statistical Inference for k-means Clustering after Domain Adaptation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published