Skip to content

A basic script that takes any dataset, makes a regression model, and then generates a PDF report about it!

Notifications You must be signed in to change notification settings

augustvanhout/regressit

Repository files navigation

Regressit


I built an automated regression machine which prints out an analyst-worthy PDF report. This can be used to analyze new datasets very quickly, determine if we have enough data to move forward, and if we do, how the variables are interacting with the target!

I wrote functions which can produce a reasonable regression model for any data we feed it. It's then able to generate a report about the independent variables and their statistical relationship to the target.


How to use Regressit :

You'll find a nice importable script in this directory. Just call...

model = model_workflow(X, y, features_desired)
generate_report(model, title = "fitting title")

Model workflow goes....

Derive features

  • Use training data to derive features though cleaning and engineering, and then save the recipe we figured out

Create a column info DF

  • Make a dataframe full of information about those columns so we can convey it to the audience with tables and graphs

Select top features

  • Take the top (n) features from the original data to be used in an OLS model.

Apply feature engineering

  • Take those top performing features we found, get them from the source again, and apply those best transformations

Model workflow

  • Do it all in one go. Train test split, build a model, transform the test data, test the new model, and make a final one for production. Export a dictionary full of the model items, like the column_df and the model itself

Generate report

  • Create a PDF

Citations:

About

A basic script that takes any dataset, makes a regression model, and then generates a PDF report about it!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages