Multi-variable AUC for Sifting Complementary Features and Its Biomedical Application

UCI data and multi-class data were downloaded from UC Irvine Machine Learning Repository
TCGA datasets are preprocessed data from the Xena platform in the “gene expression RNAseq -IlluminaHiSeq pancan normalized” version. The file is too large to upload on Github. Please see details and download datasets on the provided web links.

data_process.py is code for data preprocessing.
feature_selection.py is code for sifting an optimal feature set.
feature_ranking.py is code for ranking features from high to low and also sorting features with their frequency.

Globally evaluate the complementarity among features.
Screen discriminative combination of features that are complementary to each other from a global view.

Note: features mean genes when applied to gene expression datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
UCI		UCI
multi_class		multi_class
README.md		README.md
data_process.py		data_process.py
feature_ranking.py		feature_ranking.py
feature_selection.py		feature_selection.py

Provide feedback