OVBLR-SFE : An Optimal Variational Bayesian Logistic Regression (OVBLR) model with a Salient Feature Estimation strategy.

OVBLR-SFE is a novel interpretable machine learning method that leverages feature importance to enhance interpretability. This method incorporates variational inference and a Bayesian framework to approximate the posterior probability distribution, and utilizes the estimated parameters of the posterior distribution as weights for regression coefficients. Additionally, we have defined the concept of significant features based on a 95% confidence interval ( 95%CI) to facilitate the selection of important features in high-dimensional datasets.

The code for OVBLR-SFE is implemented based on PRML and scikit-learn.

Key Features

Feature Importance: OVBLR-SFE focuses on identifying and quantifying the importance of features within a dataset.
Variational Inference: The method utilizes variational inference techniques to approximate the posterior probability distribution.
Bayesian Framework: OVBLR-SFE adopts a Bayesian framework, allowing for a principled approach to modeling and inference.
Weighted Regression Coefficients: The estimated parameters of the posterior probability distribution are employed as weights for the regression coefficients.
Significance-based Feature Selection: The concept of significant features is defined based on a 95% confidence interval (95%CI) to aid in selecting important features in high-dimensional datasets.

Requirements

  pip = "*"
  sklearn = "*"
  ipykernel = {version = "*", index = "https://pypi.douban.com/simple"}
  ffmpeg = "*"
  matplotlib = "*"
  scikit-learn = "*"
  pandas = "*"
  numpy = "*"
  imblearn = "*"
  seaborn = "*"
  openpyxl = "*"
  polling = "*"
  socks = "*"
  lime = "*"
  shap = "*"
  eli5 = "*"
  ipython = "*"
  jupyter = "*"
  mglearn = "*"
  self-paced-ensemble = "*"
  tabulate = "*"
  pymoo = "*"

Install

    python setup.py install

Usage

Import the OVBLR-SFE module

    from prml.linear import VariationalLogisticRegression

Prepare your dataset and ensure it is in the appropriate format.
Train the OVBLR-SFE model:

    vlr = VariationalLogisticRegression(a0=1, b0=1)
    vlr.fit(X_train, y_train, feature_names)

Obtain feature importance scores:

    importance_scores = vlr.feature_importance()
    
    --------------------------
    (feature_name, weight, lower, upper, is_salient_feature)
    ('Bare_Nuclei', '1.9137 ± 0.2416', Decimal('1.4402'), Decimal('2.3872'), True), 
    ('Clump_Thickness', '1.4302 ± 0.2088', Decimal('1.0210'), Decimal('1.8394'), True)
    ......

Prediction using trained models:

    y_pred_prob = vlr.proba(Xtest)
    y_pred = vlr.predict(Xtest)
    score = vlr.score(Xtest, Ytest)

How to use it ?

You can run single_test.py. For more results see the images and image_acme folder.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
adult		adult
bone_marrow_transplant_data		bone_marrow_transplant_data
breast_data		breast_data
heart_disease_data		heart_disease_data
image_acme		image_acme
images		images
notebooks		notebooks
prml		prml
spectf_data		spectf_data
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
acme.py		acme.py
compareAlg.py		compareAlg.py
environment.yaml		environment.yaml
hyperparameter_search.py		hyperparameter_search.py
notebook_acme_bone.ipynb		notebook_acme_bone.ipynb
notebook_acme_breast.ipynb		notebook_acme_breast.ipynb
notebook_acme_lt.ipynb		notebook_acme_lt.ipynb
notebook_shap_vlr_all_dataset.ipynb		notebook_shap_vlr_all_dataset.ipynb
prepareData.py		prepareData.py
setup.cfg		setup.cfg
setup.py		setup.py
single_test.py		single_test.py
utils.py		utils.py

License

xqw42/OVBLR-SFE

Folders and files

Latest commit

History

Repository files navigation

OVBLR-SFE : An Optimal Variational Bayesian Logistic Regression (OVBLR) model with a Salient Feature Estimation strategy.

Key Features

Requirements

Install

Usage

How to use it ?

About

Resources

License

Stars

Watchers

Forks

Languages