Computationally Efficient Feature Significance and Importance for Machine Learning Models (SFIT)

Introduction

This repo implements the SFIT method from the paper "Computationally Efficient Feature Significance and Importance for Machine Learning Models".

The Single Feature Introduction Test (SFIT) is a simple and computationally efficient significance test for the features of a machine learning model. Our forward-selection approach applies to any model specification, learning task and variable type. The test is non-asymptotic, straightforward to implement, and does not require model refitting. It identifies the statistically significant features as well as feature interactions of any order in a hierarchical manner. For more details, please refer to the full paper.

Requirements

The sfit functions require numpy, scipy, statsmodels, keras and sklearn.

The main file that illustrates use cases of the code and replicate the results of the simulations from the paper require sklearn, statsmodels, tensorflow and keras.

Running the code

python main.py

This generates simulated data as described in the paper and fit a linear model and a neural network on them. These models are then used to run the SFIT method. The expected printed output has been saved in this file.

Contact and cite

If you have any questions, please contact Enguerrand Horel (ehorel at stanford dot edu).

If you use this code in your work, please cite:

@article{horel2019computationally, title={Computationally Efficient Feature Significance and Importance for Machine Learning Models}, author={Horel, Enguerrand and Giesecke, Kay}, journal={arXiv preprint arXiv:1905.09849}, year={2019} }

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
test		test
LICENSE		LICENSE
README.md		README.md
expected_output.txt		expected_output.txt
main.py		main.py
sfit.py		sfit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computationally Efficient Feature Significance and Importance for Machine Learning Models (SFIT)

Introduction

Requirements

Running the code

Contact and cite

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Computationally Efficient Feature Significance and Importance for Machine Learning Models (SFIT)

Introduction

Requirements

Running the code

Contact and cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages