Skip to content

A generalizable, loss-minimizing, automated sklearn system for simplifying the process of deploying high-performance model ensembles for tabular data.

License

Notifications You must be signed in to change notification settings

varunlakshmanan/glass

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 

Repository files navigation

What is Glass?

Glass is the Generalizable, Loss-minimizing, Automated Sklearn System. Glass is a template for quickly building and deploying a high-performance model ensemble on tabular and numerical data. It returns the best possible fitted model ensemble out of every possible combination of estimators whose parameters have been hyperoptimized. Ordinarily, machine learning requires five core steps:

  • Data Preprocessing
  • Feature Engineering
  • Model Fitting
  • Hyperparameter Optimization
  • Model Evaluation and Ensembling

Using Glass greatly minimizes the steps of model fitting, hyperparameter optimization, and model ensembling. Because it's built on top of native scikit-learn functions, it's immediately familiar to understand. You can deploy a high-performance model in under 5 lines of code.

How do I use Glass?

Preprocess your dataset, and perform initial feature engineering. When preprocessing, impute missing data and encode categorical features as necessary. Ensure that your data has no null or NaN values. When feature engineering, create custom features and drop unnecessary columns based on feature importance.

After that, follow the instructions in the Documentation to get started. Take a look at the example provided for further usage guidelines.

How does Glass work?

Glass works by initializing many different scikit-learn models with varying architectures. It fits each of them to your input data while training, and hyperoptimizes the parameters of each model. The accuracy of every possible ensembled combination of these models is then ranked, and the best fitted model ensemble is returned for your use.

What technologies does Glass use?

Languages:

  • Python

Frameworks:

  • Scikit-learn - for most of the estimators, predictors, and transformers that compose the final model
  • Pandas - for data manipulation
  • XGBoost
  • LightGBM
  • Optuna - for hyperparameter optimization

About

A generalizable, loss-minimizing, automated sklearn system for simplifying the process of deploying high-performance model ensembles for tabular data.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages