Skip to content

sinpe13/tubesML

 
 

Repository files navigation

TubesML

A package that allows for flexible ML pipelines, model validation, and model inspection.

The project started to productize code written for Kaggle competitions and it is mostly designed to facilitate the creation of flexible processing and modeling pipeline. Everything is built around scikit-learn transformers and methods, but it is possible to extend the concept to other ML packages.

The main characteristic that makes the pipeline components of tubesML flexible is the fact that it always preserves the pandas DataFrame structure, hence making it easy to, for example, create a feature within a pipeline and test it in a grid search.

Installation

Dependencies

tubesML requires:

  • Python (>= 3.7)
  • Matplotlib (>= 3.7.5)
  • Pandas (>= 2.2.2)
  • Scikit-Learn (>= 1.2.2)
  • Seaborn (>= 0.12.0)

These requirements are good for a Kaggle notebook, however the package has been developed with the following requirements

  • Python (>= 3.10)
  • Matplotlib (>= 3.9.0)
  • Pandas (>= 2.2.2)
  • Numpy (>= 2.0.0)
  • Scikit-Learn (>= 1.5.0)
  • Seaborn (>= 0.12.0)

User installation

If you already have a working installation of pandas and sklearn, the easiest way to install scikit-learn is using pip

pip install -U tubesml

Important links

About

A package that allows for flexible ML pipelines, model validation, and model inspection

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 99.9%
  • Makefile 0.1%