Skip to content

Pipeable steps for feature engineering and data preprocessing to prepare for modeling

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md
Notifications You must be signed in to change notification settings

walrossker/recipes

 
 

Repository files navigation

recipes

R-CMD-check Codecov test coverage CRAN_Status_Badge Downloads lifecycle

Introduction

With recipes, you can use dplyr-like pipeable sequences of feature engineering steps to get your data ready for modeling. For example, to create a recipe containing an outcome plus two numeric predictors and then center and scale (“normalize”) the predictors:

library(recipes)
data(ad_data, package = "modeldata")

ad_rec <- recipe(Class ~ tau + VEGF, data = ad_data) %>%
  step_normalize(all_numeric_predictors())

ad_rec
#> Recipe
#> 
#> Inputs:
#> 
#>       role #variables
#>    outcome          1
#>  predictor          2
#> 
#> Operations:
#> 
#> Centering and scaling for all_numeric_predictors()

More information on recipes can be found at the Get Started page of tidymodels.org.

You may consider recipes as an alternative method for creating and preprocessing design matrices (also known as model matrices) that can be used for modeling or visualization. While R already has long-standing methods for creating such matrices (e.g. formulas and model.matrix), there are some limitations to what the existing infrastructure can do.

Installation

There are several ways to install recipes:

# The easiest way to get recipes is to install all of tidymodels:
install.packages("tidymodels")

# Alternatively, install just recipes:
install.packages("recipes")

# Or the development version from GitHub:
# install.packages("devtools")
devtools::install_github("tidymodels/recipes")

Contributing

This project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

About

Pipeable steps for feature engineering and data preprocessing to prepare for modeling

Resources

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 65.2%
  • R 34.8%