Skip to content
master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 

Hydra-Sklearn Preprocessing Pipelines

Sklearn-Hydra

This repository accompanying the blog post:

Creating Configurable Data Pre-Processing Pipelines by Combining Hydra and Sklearn - by Eli Simhayev & Benjamin Bodner

Running Different Pipelines

Run:

python main.py preprocessing_pipeline=decision_tree

to execute the decision_tree preprocessing pipeline. You might also run other pipelines (from configs/preprocessing_pipeline) by just changing:

python main.py preprocessing_pipeline=<your-pipeline>

Hydra also supports Tab completion to complete config.

Adding New Pipelines

Adding new pipelines can be easily done using a yaml configuration in configs/preprocessing_pipeline. You might add another configurations: which model to use, which visualizations, etc. - learn more here: Hydra — A fresh look at configuration for machine learning projects

We hope this will help you to better organize your data preprocessing pipelines 🙂

About

Hydra-Sklearn preprocessing pipelines. Code accompanying the blogpost: Creating Configurable Data Pre-Processing Pipelines by Combining Hydra and Sklearn

Topics

Resources