twidlr: consistent data.frame and formula API for models

Overview

twidlr is an R package that exposes a consistent API for model functions and their corresponding predict methods such that they are specified as:

fit <- model(data, formula, ...)
predict(fit, data, ...)

Where "data" is a required data.frame (or able to be coerced to one) and "formula" describes the model to be fitted.

twidlr gets its name from the "twiddle" used in R formulas.

Installation

twidlr is available to install from github by running:

# install.packages("devtools")
devtools::install_github("drsimonj/twidlr")

Usage

library(twidlr) exposes model functions that you're already familiar with, but such that they accept a data.frame first, formula second, and then additional arguments. A robust method to predict data is also exposed.

For example, a typical linear model would be lm(hp ~ mpg * wt, mtcars, ...). Once twidlr is loaded, the same model would be run via lm(mtcars, hp ~ mpg * wt, ...).

Motivation

Modelling in R is messy! Some models take formulas and data frames while others require matrices and vectors. The same can be said of corresponding predict() methods, which can also be impure, returning unexpected or inconsistent results.

twidlr seeks to overcome these problems be providing:

Consistent API for model functions and their corresponding predict methods (helping to improve the generality of tidy modelling packages like piplearner)
Pure and available predictions by way of predict being made available for all methods (including unsupervised algorithms like kmeans) and making "data" a required argument
Tidyverse philosophy by working with data frames and being pipeable such as mtcars %>% lm(hp ~ wt)
Leverage formula operators where they may be valid but not originally available. For example, to specify select variables or include additional terms like interactions and dummy-coded variables with syntax such as glmnet(iris, Sepal.Width ~ Petal.Width * Petal.Length + Species).

twidlr models

Model functions exposed by twidlr:

Package	Function
glmnet	glmnet
lme4	glmer
lme4	lmer
quantreg	crq
quantreg	nlrq
quantreg	rq
quantreg	rqss
randomForest	randomForest
rpart	rpart
stats	glm
stats	kmeans
stats	lm
stats	t.test
xgboost	xgboost

Contributing

For conventions and best-practices when contributing to twidlr, please see CONTRIBUTING.md

Name		Name	Last commit message	Last commit date
Latest commit History 171 Commits
R		R
man		man
tests		tests
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
twidlr.Rproj		twidlr.Rproj
xgboost.model		xgboost.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

twidlr: consistent data.frame and formula API for models

Overview

Installation

Usage

Motivation

twidlr models

Contributing

About

Releases

Packages

Languages

License

guhjy/twidlr

Folders and files

Latest commit

History

Repository files navigation

twidlr: consistent data.frame and formula API for models

Overview

Installation

Usage

Motivation

twidlr models

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages