Autobinary Framework version 1.0.11

The Autobinary library is a set of tools that allow you to automate the process of building a model to solve certain business problems.

Autobinary allows you:

To conduct a primary exploratory analysis and process factors;
To conduct a primary selection of factors from all available;
To conduct a primary training according to the required cross-validation scheme;
To search for the optimal set of hyperpatmers;
To conduct a deep selection of factors to finalize the model;
To calibrate the final model if necessary;
To visualize optimization and business metrics;
To conduct an interpretative analysis of the factors.

How to use:

Installation script:

Move the installation file autobinary-1.0.10.tar.gz to the required folder
Install with: !pip install autobinary-1.0.10.tar.gz
Import the library with: import autobinary

Manual adjustment:

Move "autobinary" folder to local space;
Set the path to the "autobinary" folder;
Import the necessary tools from the autobinary library.

Requirements:

pandas >= 1.3.1
numpy >= 1.21.5
catboost >= 0.25.1
matplotlib >= 3.1.0
sklearn >= 0.24.2 and <1.2.0
pdpbox == 0.2.0

The repository folders provide detailed examples of using the library:

01_Feature_vs_Target:

* Examples of analysis of the target variable with respect to the factor for classification problems;

* Examples of analysis of the target variable relative to the factor for regression problems.

02_CV_importances_for_trees:

* Examples of training various algorithms for solving classification problems according to a cross-validation scheme;

* Examples of training various algorithms for solving regression problems using a cross-validation scheme;

* Examples of training various algorithms for solving multiclassification problems using a cross-validation scheme;

* Calculation of the importance of factors after learning the algorithm;

03_Tuning_parameters_Optuna:

* Examples of finding the optimal set of hyperparameters using the Optuna library.

04_Explaining_output:

* Examples of interpretation of the influence of factors on the target variable using the Shap library;

* Examples of interpretation of the influence of factors on the target variable using the PDPbox library.

05_Uplift_models:

* Examples of Solo model for solving uplift problems with the necessary cross-validation scheme;

* Examples of Two models (Vanilla) for solving uplift problems with the necessary cross-validation scheme;

* Examples of Two models (DDR control) for solving uplift problems with the necessary cross-validation scheme;

* Examples of Two models (Treatment control) for solving uplift problems with the necessary cross-validation scheme.

06_Base_uplift_calibration:

* Calibration examples for response tasks;

* Calibration examples for uplift tasks;

* Calibration examples for other types of tasks;

07_Feature_selection:

* Examples of primary selection of factors from all available using gap analysis, correlation analysis, tree depth analysis, as well as the Permutation Importance method (for binary classification, regression and multiclass classification);

* Examples of deep selection of factors using the Forward and Backward selection methods;

* Examples of factor selection using the Target Permutation method.

08_Custom_metrics:

* Examples of visualization of known and custom metrics for a detailed understanding of the quality of the algorithm in binary classification and uplift tasks;

* Examples of visualization of known and custom metrics for a detailed understanding of the quality of the algorithm in regression problems.

09_Finalization_calibration:

* An example of the finalization and calibration of the model with the existing model and training with the given parameters for the binary classification problem;

* An example of the finalization and calibration of the model with the existing model and training with the given parameters for the regression problem;

10_Full_Fitting_model:

* An example of the entire process of building and finalizing a model in a laptop for a binary classification problem (probability of surviving a Titanic crash).

Authors:

Vasily Sizov - https://github.com/Vasily-Sizov
Dmitry Timokhin - https://github.com/dmitrytimokhin
Pavel Zelenskiy - https://github.com/vselenskiy777
Ruslan Popov - https://github.com/RuslanPopov98

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
01_Feature_vs_Target		01_Feature_vs_Target
02_CV_importances_for_trees		02_CV_importances_for_trees
03_Tuning_parameters_Optuna		03_Tuning_parameters_Optuna
04_Explaining_output		04_Explaining_output
05_Uplift_models		05_Uplift_models
06_Base_uplift_calibration		06_Base_uplift_calibration
07_Feature_selection		07_Feature_selection
08_Custom_metrics		08_Custom_metrics
09_Finalization_calibration		09_Finalization_calibration
10_Full_Fitting_model		10_Full_Fitting_model
autobinary		autobinary
.gitignore		.gitignore
README.md		README.md
autobinary-1.0.11.tar.gz		autobinary-1.0.11.tar.gz

TDL77/autobinary_framework

Folders and files

Latest commit

History

Repository files navigation

Autobinary Framework version 1.0.11

Autobinary allows you:

How to use:

Installation script:

Manual adjustment:

Requirements:

The repository folders provide detailed examples of using the library:

Authors:

About

Resources

Stars

Watchers

Forks

Languages