S-CLASSY

Discovering Rule Lists with Preferred Variables (S-CLASSY)

This repository contains the code for using preferred variables to discover Hybrid Rule Lists (HRL) for univariate or multivariate classification in Data Mining. These models use the Minimum Description Length (MDL) principle as selection criterion.

Dependencies

This project was written for Python>=3.7. All required packages from PyPI are specified in the requirements.txt. NOTE: This list of packages includes the gmpy2 package. You can install the dependencies locally:

pip install -r requirements.txt

Execution guide

Open: run_S-CLASSY.ipynb or run_S-CLASSY_cross_validation.ipynb in the Jupyter Notebook dashboard by clicking on the name of the file in the dashboard.
Update the configurations regarding the datasets tha you want to load (for more info read the comments in the .ipynb file).
Run the file.

References

[1] Proença, H.M., Grünwald, P., Bäck, T., van Leeuwen, M.: Robust subgroup discovery. Data Mining and Knowledge Discovery 36(5), 1885–1970 (2022)

[2] Proença, H.M., van Leeuwen, M.: Interpretable multiclass classification by MDL-based rule lists. Information Sciences 512, 1372–1393 (2020)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
rulelist		rulelist
tests		tests
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_S-CLASSY_cross_validation.ipynb		run_S-CLASSY_cross_validation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S-CLASSY

Dependencies

Execution guide

References

About

Releases

Packages

Languages

License

ioannapap/S-CLASSY

Folders and files

Latest commit

History

Repository files navigation

S-CLASSY

Dependencies

Execution guide

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages